910 likes | 1.09k Views
Emerging Standards for Complex Works. Howard Besser UCLA School of Education & Information http://www.gseis.ucla.edu/~howard. Emerging Standards for Complex Works. Background & Context for Standards MOA2: Structural & Administrative Metadata NISO/DLF: Technical Imaging Standards
E N D
Emerging Standards for Complex Works Howard Besser UCLA School of Education & Information http://www.gseis.ucla.edu/~howard
Emerging Standards for Complex Works • Background & Context for Standards • MOA2: Structural & Administrative Metadata • NISO/DLF: Technical Imaging Standards • Identification/provenance • Rich Media • Longevity
Key problems we’re facing • Discovery • Longevity • Interoperability
DL DL DL DL search & presentation search & presentation search & presentation search & presentation user user Traditional Digital Library Model
DL DL DL DL search & presentation user user Ideal Digital Library Model
For Interoperability Digital Libraries Need Standards • Descriptive Metadata for consistent description • Discovery Metadata for finding • Administrative Metadata for viewing and maintaining • Structural Metadata for navigation • ... Terms & Conditions Metadata for controlling access...
Why are Standards and Metadata consensus important? • Managing digital files over time • Longevity • Interoperability • Veracity • Recording in a consistent manner • Will give vendors incentive to create applications that support this
Collaborative Metadata Projects • Dublin Core • NSF/ERCIM Digital Collaboratory • OCLC CORC Project- • Visual Resources Association (VRA) Core • Encoded Archival Description (EAD) • Computerized Interchange of Museum Information (CIMI)- • Records Export for Art and Cultural Heritage (REACH)
CORC--Cooperative Online Resource Catalog • both bib records & webliographies (pathfiinders) • supports both AACR2/MARC and DC • began 1/99, scheduled availability 7/00 • 100-200 participants • Academic libraries • OCLC networks, special libraries, public libraries, state & national libraries, consortia
Making of America II- • Background of the DLF Project • Administrative Metadata • Structural Metadata
MOA2 Goal is Interpoerability • Book example
DLF Metadata for Interoperability Testbed:the MOA II Project • R & D • Distributed Repositories • Transportation, 1869-1900 • Testbed Project • Best Practices • Structural and administrative metadata
Previous Projects/Background • Library Standards Background • UC Berkeley Background • Finding Aids • EAD • SGML • EAD “Digital Archives”
MOA II Classes of Objects • Continuous Tone Photos • Photo Albums • Diaries, journals, letterpress books • Ledgers • Correspondence
MOA II Metadata • Administrative Metadata • for enhancing resource management • Structural Metadata • for reflecting internal hierarchies and relationships btwn parts • Raw/Seared/Cooked
MOA II Behaviors • Navigation • Display/Print
MOA II Best practices • Use/Users/Collection: • Benchmarking • Masters vs. Derivatives • Scanning- • Administrative Metadata- • Structural Metadata-
Think about users (and potential users), uses, and type of material/collection Scan at the highest quality that does not exceed the likely potential users/uses/material Do not let today’s delivery limitations influence your scanning file sizes; understand the difference between digital masters and derivative files used for delivery Many documents which appear to be bitonal actually are better represented with greyscale scans Include color bar and ruler in the scan Use objective measurements to determine scanner settings (do NOT attempt to make the image good on your particular monitor or use image processing to color correct) Don’t use lossy compression Store in a common (standardized) file format Capture as much metadata as is reasonably possiple (including metadata about the scanning process itself) Scanning Best Practices
Administrative Metadatato uniquely identify a digital resource and manage it over time • Information about where the various pieces/versions of the object reside • Information to view the digital object • Information about the scanning process
Structural Metadata:that which is relevant to presentation of the digital object to the user • metadata defining the "object”: a book, a diary, a photo album • metadata defining the “sub-objects”: pages (physical) or chapters and subheads (intellectual)
SGML, XML, HTML • TEI for structured humanities text • EAD for Finding Aids
NISO/DLF Image Metadata WorkshopPossible Goals • Metadata fields • Rules for Field Contents (authority control) • Core set of necessary fields • Syntax for expressing fields and contents (headers)
Image MetadataFocus on Metadata that may prove helpful for • management • use • preservation • ...
Image MetadataBreak-out Groups: Work Done • Characteristics and Features of Images • Image Production and Reformatting Features • Image Identification and Integrity
NISO/DLF Image Metadata Workshop (4/99)Image Technical Information :Possible Goals • Metadata fields • Rules for Field Contents (authority control) • Core set of necessary fields • Syntax for expressing fields and contents (headers)
Image MetadataFocus on Metadata that may prove helpful for • management • use • preservation • ...
Image MetadataBreak-out Groups: Work Done- • Characteristics and Features of Images • Image Production and Reformatting Features • Image Identification and Integrity
Image Metadata Elements for Data DictionaryData Dictionary Entries • Element Name • Definition (short) of the element name • Is the element required? (Identified as: Mandatory, Mandatory if Applicable, Recommended, Optional) • How is the value of the element represented? • Examples • When is this data collected? • What is the purpose of this data? • Who would the identified users be? • How is the metadata used? • What other metadata standards reference it?
Image Metadata Elements for Data DictionaryCharacteristics and Features Element List • Format Issues: • Resolution Issues: • Encoding: • Compression: • Others:
Image Metadata Elements for Data Dictionary Image Production Element List (Pertaining to the Image) • In-image target(s): • System target(s), associated with the object: • Responsible agent • Rationale: • Hardware: • Software:
Image Metadata Elements for Data Dictionary Image Production Element List (Pertaining to the Process) • Format of the image • Intrinsic characteristics of the image • Identification • Provides a means for defining methodology including documentation and rationale • Who is involved with the file? • Who created the image file? • Who commissioned the creation of the image file (i.e., the chartering entity), as opposed to: Who is the responsible agency? Who is the owner? • Where • What • When: necessary dates including: capture date/time, modification • Checksum • Navigational aid • Encoding tools
Image MetadataNISO/DLF Image Metadata:In Progress • Data Dictionary for both “Characteristics & Features” and for “Image Production Elements” due end of 6/00
Identification/Provenance (Images)- • The number of variant forms of a work can be enormous • Image Families • A digital image frequently has many layers of parentage • Information about the parentage that can indicate the quality and veracity of the image (Dublin Core "Source" and "Relation") • how to deal with different versions derived from the same scan or different encoding schemes • Vocabulary Standards to express this
The number of variant forms of a work can be enormous • different views of the same object • different lighting of the same object • different scans of the same photo • different resolutions • different compression schemes • different compression ratios • different file storage formats • different details of the same image • ...
Identification/Provenance • how to deal with different versions (browse, hi-res, medium res) derived from the same scan or different encoding schemes (TIFF, PICT, JFIF) • Vocabulary Standards to express this • VRA Surrogate Categories • CIMI's "Image Elements”
Other Metadata • Description of depiction/surrogate (What VRA calls its "Surrogate Categories") • Description of original object • Rights and Reproduction Information • Location Information
Metadata for Digital Commerce • DOI • <indecs>-
<Indecs> • formal structure for describing and uniquely identifying intellectual property itself, the people and businesses involved in its trading, and the agreements which they make about it (primarily for publishing, music, and visual arts) • will develop high-level specifications for the services that will be required to implement a global IP trading system based on this <indecs> generic data model • focus is on encoding rights at a high level, not on resource discovery • likely to involve metadata schma registration and directory to allow interoperation of personal identifiers for rightsholders and users • supported by EEC DG-13 • First meeting July 1999 • http://www.indecs.org/
Problems & Potentialsof Rich Media- • Types of Rich Media • Technologies and problems • Opportunities--a scenario • Metadata • Indexing
Some Types of Rich Media • Moving image materials • Multimedia • Interactive programs • Computer art
After an uphill battle, tech and Tinseltown find common ground (USA Today, 3/3/00)
Projected Changes: Prospect of digitized movies already has some mourning loss of film(SF Chronicle, 3/5/00)