90 likes | 495 Views
When SRB Meets NARA Collection-Based Long-Term Preservation. Case Studies: E-mail & AMICO Collections. Preservation : archival storage of the original collection. Access: technologies used to query the archived collection. Presentation: using stylesheets: CSS/HTML XSL/XML
E N D
Preservation: • archival storage of the original collection. • Access: • technologies used to query the archived collection. • Presentation: • using stylesheets: • CSS/HTML • XSL/XML • Consistency: • type of quality assurance to be performed.
E-mail Collection • Collection of 1 million records. • E-mail DTD based on RFC 1036. • Turnaround time: 24 hours. • Scalability: 40-million E-mail turnaround time --> 1 month. • Main steps: • assembling the collection, • tagging each message using XML, • archival storage of the digital objects, • instantiation as a new collection, • indexing the collection, • presentation through a Web interface, and • support for queries against the collection.
AMICO Image Collection • collection of high-resolution images of art pieces (tiff, jpeg) • associated meta-data (object & media meta-data) in proprietary markup format • given data dictionary => XML DTD & relational storage schema • conversion (wrapping) of meta-data to XML • storage of images as binary objects in SRB • querying of meta-data using XML query language and/or SRB • www.npaci.edu/DICE/AMICO/
Request for image (X.509) tif file SRB/MCAT HPSS California Digital Library (CDL) PrototypeThe Art Museum Image Consortium (AMICO) Q1: Find title, type, and image ID of paintings BBQ Interface Q2: Find creator and related metadata of paintings XMAS query XML doc MIXm View based on AMICO DTD Wrapper AMICO XML Database AMICO XML Database MARC Database