1 / 54

Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

This paper explores the benefits, objectives, and current status of the Virtual International Authority File (VIAF), a collaborative effort to provide authority data in the form, language, and script users want on the Internet. It discusses how VIAF facilitates exposure of authority data, reduces cataloging costs, and simplifies authority control internationally.

ramires
Download Presentation

Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Building Blocks for the Future: Making Controlled Vocabularies Available for theSemantic Web Dr. Barbara B. Tillett Chief, Policy & Standards Division Library of Congress For ELAG, May 2011

  2. Linked Data VIAF LCSH National Library of Sweden DBpedia

  3. Services Databases, Repositories Web front end Internet “Cloud” 3

  4. Internet “Cloud” Databases, Repositories Services VIAF LCSH Web front end 4

  5. VIAF Objectives • Facilitate exposure of authority data • Reduce cataloging costs • Simplify authority control (creation and maintenance) internationally • Provide authority data in form, language, and script users want

  6. VIAF 歌川, 広重2世 1826-1869 ‎ Utagawa, Hiroshige, 1826?-1869 6

  7. VIAF: The Virtual International Authority File Original VIAF partners • Library of Congress (LC) • Deutsche Nationalbibliothek (DNB) • Bibliothèque nationale de France (BnF) • OCLC - host • Virtually combining the name authority files of all institutions into a single name authority service. • http://viaf.org/

  8. Virtual International Authority File • Matches names across 21 authority files of 18 institutions • 18.4 million name records • 14.5 million clusters 8 Based on KSY Cooperative Identities Hub, CEAL 2010-03

  9. Library of Congress/NACO • Deutsche Nationalbibliothek • Bibliothèque nationale de France • National Library of Australia • National Library of the Czech Republic • Bibliotheca Alexandrina (Egypt) • Getty Research Institute • National Library of Israel • Istituto Centrale per il Catalogo Unico (Italy) • Biblioteca National de Portugal • Biblioteca Nacional de España • National Library of Sweden • Swiss National Library • Vatican Library • NUKAT Center (Poland) • Library and Archives Canada • National Széchényi Library (Hungary) • RERO (Switzerland) 9

  10. Current Status • Available as linked data with URIs (Universal Resource Identifiers) • Unicode throughout • MARC 21, UNIMARC, and RDF supported • Usage tripled this last year • Thousands of visits daily

  11. Enhancing the Authorities Derived Authority Bibliographic Record Authority Record Enhanced Authority 11

  12. Usage Mining the Bibliographic Record Language LC Control Number LC Classification Place of Publication Publisher Material Type Authors LDR 00638ncm a22002057a 450 1 5773347 5 19960820101947.4 8 960815s1965 oruuua n eng 10 $a 96753638 040 $a DLC $c DLC 019 $a 17706440 020 $c $2.95 028 22 $a 48418 $b Matrix Publ. Co. 045 2 $b d198006 $b d198007 048 $b va01 $b ve01 $a ka01 050 00 $a M1258 $b .L 100 1 $a Leigh, Mitch, $d 1928- 245 14 $a The man of La Mancha / $c by Mitch Leigh & Joe Darion; arr. By Roland Barrett & Alan Keown. 260 $a Springfield, OR : $b Matrix Publ. Co., $c c1965. 300 $a 1 score (16 p.) ; $c 18 x 27 cm. 500 $a Brief record. 650 0 $a Musicals $x Excerpts. 600 10 $a Leigh, Mitch $x Musical settings. 700 1 $a Darion, Joe. Title Date of Publication

  13. All text is normalized Subjects are grouped into broad subject areas Coauthor Publication date is by decade Material type is coded Derived Authority Record 00505cz a2200157n 450 0 1 xlc 1 1 3 OCoLC 2 5 19880921165012.4 3 8 880831n|acannaab|n aaa c 4 040 $a OCoLC $b eng $c OCoLC $f viaf 5 100 1 $a Leigh, Mitch. 6 903 $a 88030979 7 910 14 $a the man of la mancha 8 921 $a matrix publ co 9 922 $a oru 10 930 $a mitch leigh 11 940 $a eng 12 942 $a 234 13 943 $a 196x 14 944 $a cm 15 950 1 $a darian, joe $d 1928-

  14. Enhanced Authority Record 00505cz a2200157n 450 0 1 oca01144962 1 5 19880921165012.4 2 8 840702n| acannaab| |n aaa ||| 3 10 $a n 88090379 4 40 $a DLC $c DLC $d DLC 5 100 1 $a Leigh, Mitch, $d 1928- 6 670 $a the man of la mancha, c1966: $b t.p. (Mitch Leigh) 7 903 $a 84758340 $9 1 8 903 $a 93710923 $9 1 9 910 11 $a impossible dream$9 1 10 910 11 $a century library of music and sound by mitch leigh $9 1 11 921 $a matrix publ co $9 1 12 921 $a kapp $9 2 13 922 $a oru $9 2 14 930 $a mitch leigh $9 1 15 940 $a eng $9 2 16 942 $a 234 $9 2 17 943 $a 196x $9 1 18 943 $a 197x $9 1 19 944 $a cm $9 2 20 950 11 $a darian, joe $d 1928- $9 1 21 950 11 $a wasserman, dale $9 1

  15. Information in Bibliographic Records He writes music His primary subject area is music He was published in the 1960s and 1970s by Matrix Publ. Co. in Oregon and Kapp in New York Worked with Joe Darion and Dale Wasserman Mitch Leigh is the only name he has used on his publications Etc.

  16. http://www.viaf.org Hosted by 16

  17. viaf.org

  18. As viewed Nov. 1, 2010 Cervantes Saavedra, Miguel de 1547 Cervantes de Salazar, Francisco, ca. 1514 Cervantes, 1823-1898 Cervantes Juan, 1395-1458 Cervantes, Ignacio, 1847-1905 Cervantes, Juan de, 1382-1453 Cervantès, François, 1959- Cervani, Giulio, 1919- Cervantes, María Antonieta Cervantes de Haro, fl. 1908-193- cer

  19. Cervantes

  20. Cervantes

  21. Cervantes Preferred Forms

  22. Cervantes

  23. Cervantes

  24. Cervantes

  25. Cervantes

  26. Cervantes

  27. Cervantes

  28. Cervantes MARC 21

  29. Cervantes RDF

  30. VIAF and Catalogers Use as a reference tool: To resolve conflicts, questionable dates, forms of name, etc. Cite as source in 670 $a, for example: BNF in VIAF, date searched Nat. Lib. of Australia in VIAF, date searched LAC in VIAF, date searched

  31. Next steps for VIAF • Better searching • More “Linked data” • Related persons as in WorldCat Identities, Wikipedia, etc. • Participants beyond libraries • Rights management agencies, Publishers • Museums, Archives • More name types • Corporate and Family names • Uniform titles • Geographic names • … not topical terms

  32. SKOS • Simple Knowledge Organization System • “Provides a model for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, folksonomies, and other similar types of controlled vocabulary”—SKOS Primer

  33. SKOS • Based on the Resource Description Framework (RDF) • Resources can be exchanged between software applications and published on the Web • Interconnects data on the Web, helping create the Semantic Web

  34. id.loc.gov/authorities • “Authorities & Vocabularies” from the Library of Congress • Intent: To provide human and programmatic access to commonly found standards and vocabularies developed by LC

  35. “Authorities & Vocabularies” • LCSH was the first offering • Subject headings • Genre/form headings • Children’s subject headings • Subdivision records • Validation records • Provides links from LCSH headings to RAMEAU headings • Exploring Répertoire de vedettes-matière (RVM) and others

  36. “Authorities & Vocabularies” • Also includes: • Thesaurus for Graphic Materials (TGM) • MARC geographic area codes • MARC language codes • MARC relator codes • Preservation Events … etc.

  37. “Authorities & Vocabularies” • Benefits • Servers can download entire controlled vocabularies and the values within them, in multiple formats • Available for free on the Web

  38. “Authorities & Vocabularies” • Human end-users can • Search and view individual headings and data elements • Details of the record • Visualization • Suggest additions, changes

  39. “Authorities & Vocabularies” • URI for specific LCSH records/ concepts: id.loc.gov/authorities/[LCCN] id.loc.gov/authorities/sh8508803

  40. “Authorities & Vocabularies” • Contact information • Content of site: Libby Dechman, edec@loc.gov • Technical questions: Larry Dixson, ldix@loc.gov

  41. “Authorities & Vocabularies” • A comment form and discussion list are available at http://id.loc.gov/authorities/contact.html

  42. RDA Controlled Vocabularies - Registries • Free on the Web at Open Metadata Registry http://metadataregistry.org/schema/list.html

  43. http://metadataregistry.org/rdabrowse.htm

  44. Carrier type

  45. URI

More Related