190 likes | 360 Views
Copying Archives Project. Group Members: Mushashu Lumpa Ngoni Munyaradzi. What is a Digital Repository?. A storage system where digital contents can be stored. Problem Statement. Preservation and replication of content in archives is an important aspect of digital libraries
E N D
Copying Archives Project Group Members: MushashuLumpa Ngoni Munyaradzi
What is a Digital Repository? • A storage system where digital contents can be stored
Problem Statement • Preservation and replication of content in archives is an important aspect of digital libraries • Heterogeneous archives cannot be easily connected to transfer data • Our client Stellenbosch University
Archives tools for project: • LOCKSS (Lots of Copies Keep Stuff Safe) • A digital preservation system • Difficult to integrate with archives
Archives Tools • DSpace • Open source software package • Provides tools for management of digital assets • Commonly used as institutional repositories • EPrints • Free software that creates online archives • Primarily used for institutional and scientific journals
Project Aim • Aim of the project • to develop and test a common mechanism to interconnect archives with LOCKSS or with one another • thereby enabling preservation, replication and migration of content. • Using this common mechanism, it should be possible for an archive: • to connect into a LOCKSS network for preservation • to transfer its contents to another archive • to reload from LOCKSS when an archive fails
Research Questions • Is it possible to use a generic package format to import/export data from various repositories? • Is it possible to create an import and export interface for LOCKSS, utilising the generic package format? • Is it possible to create an import and export interface for Dspace and Eprints utilising the generic package format?
MushashuLumpa Related Work and Approach
Interoperability Background • Open Archives Initiative defines a protocol called OAI-PMH. • OAI-PMH allows for metadata harvesting • Dublin Core being the standard representation of the metadata • Complex metadata descriptions exists • e.g. METS, MPEG-21 DIDL, PREMIS, IMS etc
Related work • University of Wales Aberystwyth integrated their local library content, running on DSpace with: • University of Wales Swansea: DSpace • National Library of Wales: Fedora • Electronic Theses Online Service (EThOS – www.ethos.ac.uk/): EPrints • Used OAI-PMH with METS • Message Queue Based Approach • Used METS to encapsulate the data transferred • Used message queues as a file stream transfer mechanism • Ad-hoc one-off scripts • EPrintsto Fez Fedora. University Of Queensland Library, Australia.
Our Approach • Develop a common interchange package for LOCKSS, DSpace and EPrints • Develop plug-ins to handle the export and import
Modes of Package Interchange • Once-off batch Migration • We want to develop a system that can be applied to other archives, and not just to DSpace, LOCKSS and EPrints • This is more of a backup, restore feature • Incremental Online repository updates • Automated incremental updates of archives • Lessons learnt from the use of message-queues, OA-PMH and the use of a Complex Metadata format
Ngoni Munyaradzi Project plan and Wrap up
Project Plan • Work Allocation: • Ngoni Munyaradzi – development of plug-ins for the LOCKSS system • MushashuLumpa – development of DSpaceand EPrintsplug-ins • Main Project Deliverables: • Export/Import plug-ins for LOCKSS, DSpaceand EPrints • generic interchange package
Key Success Factors • Dependent on whether the research questions have been answered • Design of a generic interface package • Implementation of incremental updates of the content • Implementation of import and export plug-ins for LOCKSS, DSpaceand EPrints
Evaluation • Data migration consistency and integrity test • Efficiency tests • System usability tests
Impact of Project • Contribution to the digital library community by providing a solution for interoperability and preservation • Safeguard locally produced content (academic, heritage) against accidental loss
Timeline • Plug-ins prototype (by 4th June) • LOCKSS • DSpace • Design and Implementation of common inter-change package (by 12th September, 2010) • Design and implementation of plug-ins (by 19th September, 2010) • System integration and testing (by 29th September, 2010)