Heiko Mueller

Archiving and Maintaining Curated Databases

Database archiving is important for the retrieval of old versions of a database and for temporal queries over the history of data. Over the past two years, we developed a management system for maintaining, populating, and querying archives of hierarchical data. Our archiver is based on a nested merge approach that efficiently stores multiple versions of hierarchical data in a compact archive. By merging elements into one data structure, any specific version is retrievable from the archive in a single pass over the data and efficient tracking of object history is possible. In the first part of this presentation, I will give an overview on the latest features of our archiver. In the second part, I will discuss our ongoing efforts regarding (i) archiving evolving databases, (ii) supporting distributed archives, and (iii) using the archiver as the basis of a system to create, maintain, and publish curated databases.

