330 likes | 577 Views
Dublin Core Metadata. Howard Besser UCLA School of Education & Information http://www.gseis.ucla.edu/~howard. Metadata for Digital Libraries -. Models for Digital Libraries Importance of Metadata Standards Types and Uses of Metadata Discovery Metadata: The Dublin Core.
E N D
Dublin Core Metadata Howard Besser UCLA School of Education & Information http://www.gseis.ucla.edu/~howard
Metadata for Digital Libraries- • Models for Digital Libraries • Importance of Metadata Standards • Types and Uses of Metadata • Discovery Metadata: The Dublin Core
Key problems we’re facing • Discovery • Longevity- • Interoperability-
DL DL DL DL search & presentation search & presentation search & presentation search & presentation user user Traditional Digital Library Model
DL DL DL DL search & presentation user user Ideal Digital Library Model
For Interoperability Digital Libraries Need Standards • Descriptive Metadata for consistent description • Discovery Metadata for finding • Administrative Metadata for viewing and maintaining • Structural Metadata for navigation • ... Terms & Conditions Metadata for controlling access...
Why are Standards and Metadata consensus important? • Managing digital files over time • Longevity • Interoperability • Veracity • Recording in a consistent manner • Will give vendors incentive to create applications that support this
Why Standards? • Why do we need standards? • To make information universally available to users • facilitate sharing and interchange of information • To preserve information (make it safe from changes in hardware and software) • Standards only work if communities widely accept them, but they’re necessary for communities to work together
Why are you Managing this Information? • Organizational mission & type • Users • Uses
Questions to Ask • What communities is this standard designed for? • What type of information is this standard designed to handle? • What functions is this standard designed to serve? • What previous standards is it built upon? • Does the standard prescribe how to create new records (or parts of records), or how to map from existing records? • How far does the standard go? Semantics: Does it define element sets? Rules? Syntax?-
What is Metadata • Structured data describing other data used to find or help manage information resources • Aids in interoperability • Titles, dates, captions, cataloging and indexing data, file headers, rights info, provenance, code books, transaction logs, ... • One person’s metadata is another’s data
Sorting through the Standards Morass • Data Structures (DC, CDWA, MARC, VRA Core, TEI, EAD, MESL data dict) • Data Interchange (Z39.50) • Data Values/vocabularies (LCSH, AAT, ULAN, TGN) • Data Content/syntax (AACR2)
Semantics/Syntax/Structure • Semantics • meaning, as defined by a community to meet their particular needs (DC) • Syntax • a systematic arrangement of data elements for machine processing • facilitates the exchange and use of metadata among various applications (HTML, XML, RDF) • Structure • a formal arrangement of the syntax with the goal of consistent representation of the semantics (rules defining field contents like 1/11/99)
What is MetadataTypes & Uses • lots of different ways of dividing the clusters
Uses of Metadata • Discovery & Retrieval • Identification/Provenance • Rights Management • Viewing • Integrity • Longevity • Content rating
Containers and Packages of MetadataWarwick, not MARC • modular • overlapping • extensible • community-based • designed for a networked world to aid commonality btwn communities while still providing full functionality within each community
Some different schemes where Metdata is kept • embedded withing the object (HTML tags) • in a separate related DB maintained by same organization (OPAC, MOA II) • in a separate DB maintained by a separate organization (Books in Print, ratings systems) • derived on-the-fly from a different scheme (MARC-to-DC)
Collaborative Metadata Projects • Dublin Core • NSF/ERCIM Digital Collaboratory • OCLC CORC Project- • Visual Resources Association (VRA) Core • Encoded Archival Description (EAD) • Computerized Interchange of Museum Information (CIMI)- • Records Export for Art and Cultural Heritage (REACH)
Dublin Core (3/95) • improve resource discovery • anticipate precision problems of Web Crawler-based searching tools • existing metadata could be “dumbed down” • elements should be simple to understand and use, so that any individual should be able to assign terms him/herself • software might eventually automatically generate very base-level metadata
Title Creator Subject Description Publisher Contributors Date Type Format Identifier Source Language Relation Coverage Rights Dublin Core
Dublin Core • every element is both optional and repeatable • elements are cross-disciplinary • elements are extensible by organized communities • can employ a syntax such as html’s <META> tagset for use by Spiders and Harvesters • May 2000 DLF Metadata Harvesting Project
DC Qualifiers • allows one community to express important nuances and qualifications, while still making the basic importance available to communities with simple needs • our community can reflect alternate title, transliterated title, and main title, yet they will all be found under a simple Web search under “title”
Discovery Metadata:Recent History • Dublin Core (3/95) • Warwick Framework (4/96) • Image Metadata Workshop (9/96) • Canberra, Helsinki, ... DC (98) • Digital Library Collaboratory (97-) • DC-8, Frankfurt 10/99
Dublin Core--further work • Warwick Framework • metadata packages for extensible functions • layed groundwork for RDF • Canberra Qualifiers • refining the semantics of the element set to provide more precise info • SUBELEMENT, SCHEME, LANG • Granularity • no hierarchical relationships w/i a given DC record; only one record per discrete object (collection or item-level), and relationship field plus qualifier links them
The Research Process and Functional Categories of Metadata • Discovery • Retrieval • Collation • Analysis • Re-presentation
Metadata Mapping- • Crosswalks • Resource Description Framework (RDF)
Crosswalks • mapping btwn differing metadata structures • eliminate the need for monolithic, universally adopted standards • focus on flexibility and interoperatiblity • RDF-based metadata registries
Resource Description Framework (RDF, spec released 2/99) • W3C Metadata activity • designed to move the Web beyond simple links to semantically-rich relationships btwn resources • metadata application using XML as a common syntax for exchange and processing • flexible architecture for managing diverse application-specific metadata packets that can be processed by machines • associates resources, property types, and corresponding values • http://www.w3.org/RDF/
RDF • Resources (character strings, names, digital objects) • Property (“is the author of”) • Value • resources+properties=relationships • many different relationships can be reflected
XML-encoded RDF • <?xml:namespace ns=http://www.w3.org/RDF/RDF prefix="RDF" ?> • <?xml:namespace ns=http://purl.oclc.org/DC/ prefix="DC" ?> • <RDF:RDF> • <DC:Creator>Howard Besser</DC:Creator> • </RDF:Description> • </RDF:RDF>
Should you start building with RDF today? • Tools are primitive • Standard still likely to evolve
Metadata for Digital Libraries Howard Besser UCLA School of Education & Information Baca, Murtha (ed). Introduction to Metadata, Los Angeles: Getty Information Institute, 1998 http://www.getty.edu/gri/standard/intrometadata/ http://sunsite.Berkeley.EDU/Imaging/Databases/#standards http://sunsite.Berkeley.EDU/moa2/ http://sunsite.Berkeley.EDU/Longevity/ http://www.ifla.org/II/metadata.htm http://purl.oclc.org/metadata/dublin_core/ http://purl.oclc.org/corc/ http://lcweb.loc.gov/ead/ http://www.gseis.ucla.edu/~howard/image-meta.html http://www.gseis.ucla.edu/~howard/Metadata/UC-May00/ http://sunsite.berkeley.edu/Metadata/sp2000.html