330 likes | 416 Views
Metadata 101. Sandy McIntyre Colby SOASIS- Dayton 2000-11-30. Outline. Environment Scan Metadata Basics Dublin Core 101 Selected Standards Discussion & Questions. Environment Scan. How big?.
E N D
Metadata 101 Sandy McIntyre Colby SOASIS- Dayton 2000-11-30
Outline • Environment Scan • Metadata Basics • Dublin Core 101 • Selected Standards • Discussion & Questions
How big? • 7.1 million unique web sites, a 50 percent increase over the previous year's total of 4.7 million • 41 percent of the Web, or about 2.9 million sites are Private • “OCLC Researchers Measure the World Wide Web” Oct. 16, 2000 (http://www.oclc.org/oclc/press/20001016a.htm)
Metadata • Known items vs. a topic • Data about data • Or: StructuredData about data • Structure • Lots of communities do metadata
Metadata • What is “metadata”? • Data about data • Or: StructuredData about data • Sound familiar? • Lots of communities do metadata
Why metadata? • Improves discovery • Enables retrieval • Supports administration
Discovery Navigating large collections is challenging! • Used to build databases to answer key what, who, where, when questions like: • What exists on a topic, in a genre, by an author, for a specific audience, published in a given year? • Brings out content, value, relationships that are not expressed in the resource • Supports fast, arm’s-length evaluation of resources to optimize retrieval, save users’ time • Is often used to “market” resources to users • Catalogs / directories / search engines • Selective Dissemination of Information (SDI))
Retrieval • Identifiers that assist manual and automated systems in retrieval • Shelf location for physical resource • File location for electronic resource • System requirements for e-resources • User’s system responds to file type with correct application • Captures rights and privileges information • Circulation • Document delivery • Interlibrary loan
So what’s the big fuss? • The Web is large and growing quickly • Many producers, many users on the Web • Navigating networked resources is difficult • Good description = • better access • better control • Control and access = big business • Convergence of interests = collaboration in building standards (interoperability)
Concepts to know: • Types of metadata Descriptive Structural Administrative Title = Nitty Gritty Dirt Band 1 File type = jpg 2 Rights holder = NGDB 3
Concepts to know (cont.): • Semantics • What’s in a name? • Syntax • We gots grammar • Interoperability • Sharing...
Concepts to know (cont.): M • Metadata objects can be: • Embedded in the resource • Separate from the resource • Both embedded and separate M M M
“Dublin Core” • Common name for the Dublin Core Metadata Element Set (DCMES) • DCMES is a • a common core of semantics for resource description • it appears to be very useful in facilitating: • retrieval of described resources • as a lingua franca for the exchange of resource descriptions • DCMES is maintained by the Dublin Core Metadata Initiative (DCMI) hosted by OCLC purl.org/dc
International in Scope Purl.oclc.org/dc/project/index.htm
Dublin Core Metadata Element Set (DCMES) • A set 15 elements designed to enhance discovery and retrieval of resources • Goals of DCMES: • Simplicity of creation and maintenance • Commonly understood semantics • Conformance to existing and emerging standards • International scope and applicability • Extensibility • Interoperability among collections and indexing systems
“Rules” for DCMES • DCMES is extensible: • Additional elements, schemes, qualifiers may be defined and used in conjunction with DCMES • DCMES may be modified by DCMI to add more elements, schemes, qualifiers over time • Approved elements, schemes qualifiers may only be used with appropriate elements • All elements, qualifiers, schemes are optional • All elements, qualifiers, schemes are repeatable • DCMES special practice may be defined by individuals, agencies, communities
Selected metadata standards • ISBD (AACR2 / MARC) • Text Encoding Initiative (TEI) headers • Encoded Archival Description (EAD) • VRA Core Categories (VRA CC) • Global Information Locator Service (GILS) • Content Standard for Digital Geospatial Metadata (CSDGM, formerly FGDC)
Metadata transport standards • MARC (MAchine Readable Cataloging) • SGML (Standard Generalized Markup Language) • HTML (Hypertext Markup Language) • XML (Extensible Markup Language) • RDF (Resource Description Framework) • Character encoding • MARC 21 repertoire • Unicode
Character Encoding • Many standards available • Of critical importance to be sure that systems correctly process, index, display textual data • MARC 21 uses various ISO standards plus EACC, etc. • Global standard gaining acceptance: Unicode http://lcweb.loc.gov/marc/specifications/speccharintro.html http://www.unicode.org/
Who uses metadata? • Elementary students • Publishers, authors, institutions • Librarians Reference/Catalogers • International in scope
Elementary students Journal of the American Society for Information Science, 51(2): 193-201, 2000: 193- 201.
Publishers, authors • Crossref- Ovid • The Association of American Publishers and Andersen Consulting recommended E-Book metadata standards • Implement a document-identification scheme worldwide
Webmasters http://www.fortune.com/fortune/technology/alsop/0,5238,88063,00.html
Librarians • Librarians • Reference, catalogers • Corporate, academic, government
Academia U. of Michigan's media image services(a search system based on Dublin Core elements) 40,000+ images215,000+ recordsare in this system http://www.images.umdl.umich.edu
Applying Dublin Core • Acquisitions • Often mandated (law or management) • Determine metadata set • Controlled vocabulary • Template (tools) • Indexing • Prototype
CORC • Discovery, harvesting, template, automated HTML • Internal publishing • Leaflets (web resources that end unto themselves) • Global standard gaining acceptance: Unicode
Additional links: • Web Characterization: • Statistics, publications, related links (http://wcp.oclc.org/) • Cataloging & Metadata Resources: • Metadata (http://slis.cua.edu/ihy/catmeta.htm#D2) • Open Archives Initiative: • (http://www.openarchives.org) • Dublin Core Metadata Initiative • Home page (purl.org/DC) • Dublin Core Library Interest Group mailing list: • http://www.mailbase.ac.uk/lists/dc-libraries/ • IFLA -- Digital Libraries: Metadata Resources • http://www.ifla.org/II/metadata.htm