1 / 32

Morphing Metadata: a Highly Automated Method of Cataloging Electronic Theses and Dissertations

Sevim McCutcheon Monographs Cataloger, Assistant Professor Kent State University University Libraries ALA Midwinter Conference Cataloging Norms Interest Group January 16, 2010. Morphing Metadata: a Highly Automated Method of Cataloging Electronic Theses and Dissertations.

penny
Download Presentation

Morphing Metadata: a Highly Automated Method of Cataloging Electronic Theses and Dissertations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Sevim McCutcheon Monographs Cataloger, Assistant Professor Kent State University University Libraries ALA Midwinter Conference Cataloging Norms Interest Group January 16, 2010 Morphing Metadata:a Highly Automated Method of Cataloging Electronic Theses and Dissertations

  2. ETD Project Timeline and Today’s Topics OhioLINK Consortium’s ETD Center est. 2000 First Kent State University ETD, November 2004 • OhioLINK statewide cataloging standards developed, 2005-2006 • Automated process created at KSU, 2005-2006 (“Cataloging Bot,” short for Cataloging Robot)

  3. Comparison of Print and Electronic TDs

  4. Cataloging ETDs Why catalog ETDs? Keyword searching isn’t perfect! Precision & recall Differences and Similarities compared to cataloging print: • Differences: • Author-supplied metadata is the basis of a MARC record • No physical object to track or examine • Similarities: • It’s still a monograph, with a title page as the chief source of information • It’s still a dissertation, requiring complex subject analysis

  5. OhioLINK Committee Discussions • Born digital vs. reproduction • Available first, before book or microfiche = born digital • Published vs. unpublished • Born digital thus published

  6. OCLC Bib Formats + Standards 3.1 Theses and Dissertations. Two types A. Those that exist as digital originals B. Those that are scanned versions of paper originals. “Digital originals should be treated as published items and cataloged as original electronic publications. “ See also:http://www.oclc.org/support/documentation/worldcat/cataloging/electronicresources/ .

  7. When ETDs considered published, what changes? • Fixed Field (Leader/06): Record Type code is now “a,” language material; not “t” for manuscript • Fixed Field Country of Publication code and 260 $a, $b: include place of publication and publisher, which is the university • Fixed Field Government Publication Code might be affected: American state universities use GPub (008/28) = s

  8. NDLTD site has OhioLINK Documentation for Cataloging ETDs

  9. NETWORK NDLTD web site OhioLINK ETD Center SUBMIT SEARCH ETD Center web site

  10. Student 1. Submission 4. Approval College Gatekeeper 2. Submission notification 3. Notification forwarded 5. Publication notification Library ETD Coordinator 6. Retrieve metadata Catalog Cataloger 8. Cataloging notification Cataloging Bot 7. Send MARC to catalog Cataloging Bot interactions with ETD Center ETD Center

  11. Step 5: Publication Notification Standard Marcview.cgi is not true MARC = unusable!

  12. Workaround using OAI-PMH

  13. ETD Center OAI-PMH URLs ETD-MS: ETD Metadata Standard (more complete than DC) http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_etdms&identifier=kent1122136806 DC: Dublin Core http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_dc&identifier=kent1122136806

  14. How The Bot Works • Receives email • Parses email for document ID • Retrieves metadata • Parses record for useful data • Builds MARC record • Sends record to local catalog • Notifies staff via email notification

  15. 1. Receives email

  16. 2. Parses email for document ID

  17. 3. Retrieves metadata http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_etdms&identifier=kent1155924832 http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_etdms&identifier=kent1155924832

  18. 4. Parses record for useful data • Extracts data from XML file • Used regular expressions • There are other ways to do it! • Cataloging bot polishes raw material into provisional bibliographic record: • Translates some character entities • Takes out smart quotes • Replaces odd characters, like “&ndash” for numbers • Normalizes most capitalization, as when title is in all CAPITAL letters

  19. 5. Builds MARC Record • Miscellaneous reformatting • Splits subtitle from title • Adds GMD • Adds placeholder fields for pagination, etc. • Generates non-filing indicators for English • Local standards document • “Constant” data and formatting • MARC/perl module • Library for easy processing of MARC records • http://marcpm.sourceforge.net/

  20. 6. Sends MARC Record to Catalog • Direct communication between Bot and local system • “OCLC” interactive interface on III system • Perl Cookbook, recipe 17.10, (bidirectional forking client) • Record download a possible alternative • Import to OCLC or local system • Manual process

  21. 7. Sends email notification: ETD has MARC record in the local catalog

  22. Provisional Record in KentLINK

  23. Once the Bot’s work ends, the Cataloger… • Exports provisional record from KentLINK to OCLC save file • Upgrades to full record • Contributes record to WorldCat • Overlays local catalog’s provisional record with OCLC Full Record

  24. Exporting a Record

  25. Full Record in OCLC

  26. Full Record in OCLC

  27. Sources OhioLINK ETD Center http://www.ohiolink.edu/etd/ OhioLINK ETD Cataloging Standards http://platinum.ohiolink.edu/dms/catstandards/etd.pdf NDLTD: promotes dissemination and use of ETDs; documentation section includes OhioLINK and KSU materials 1. OhioLINK cataloging standards; and 2. ETD Cataloging Checklist / Sevim McCutcheon http://www.ndltd.org/ OCLC Bibliographic Formats and Standards http://www.oclc.org/bibformats/default.htm

  28. Any questions? Want to implement? Sevim McCutcheon, Cataloging issues Lmccutch@kent.edu 330-672-1703 Mike Kreyche, Technology issues mkreyche@kent.edu 330-672-1918

More Related