1 / 30

Cleaning Metadata

Cleaning Metadata. Graphic by Ryan Schenk. Outline. Introduction Definitions Cleaning metadata General Thoughts Process, Tools, and Lessons Going forward. Definitions. Metadata Schema Metadata Cleaning. Metadata. Summary information about something. The Something.

salaam
Download Presentation

Cleaning Metadata

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Cleaning Metadata Graphic by Ryan Schenk

  2. Outline • Introduction • Definitions • Cleaning metadata • General Thoughts • Process, Tools, and Lessons • Going forward

  3. Definitions • Metadata • Schema • Metadata Cleaning

  4. Metadata Summary information about something The Something Summary Information Title: Recovering Gear Photographer: Jan Hahn Date: 1950 Ship: Atlantis People: Nat Corwin, Dean Bumpus

  5. Schema The structure of a set of information The Structure ID | Subject 1 |Nat and Dean aboard Atlantis in 1950 The Information A different schema with the same information: ID | Date | People | Ships 1 | 1950 | Nat Corwin, Dean Bumpus | Atlantis

  6. Cleaning Metadata Converting metadata into a more usable form Before cleaning: ID | Caption 1 | Corwin, N. and Dean Bumpus on Atlantis in 1950 Loose schema, different formats, ambiguous, not atomic After cleaning: dc.id | dc.date | dc.subject.person | dc.subject.ship 1 | 1950 | Corwin, Nathaniel; | Atlantis (Ketch) | Bumpus, Dean F. | Precise schema, standard formats, specific, atomic

  7. Cleaning Metadata • General Thoughts • Process, Tools, and Lessons

  8. General Thoughts Cleaning metadata is like... Engineering Bundesarchiv, Bild 183-1989-0523-016 / CC-BY-SA [CC-BY-SA-3.0-de (www.creativecommons.org/licenses/by-sa/3.0/de/deed.en)], via Wikimedia Commons

  9. General Thoughts Cleaning metadata is like... Archaeology Attribution, Noncommercial, Share Alike http://www.flickr.com/photos/dunechaser/665480669/

  10. A Process for Cleaning Metadata 1. atomization 2. addition 3. reconciliation 4. reassembly

  11. AARR! • Atomization • Addition • Reconciliation • Reassembly

  12. 1. Atomization Breaking down information into basic elements

  13. Review of Atomization Tools

  14. Atomization Lessons for Metadata Designers Loose schemas and free comment fields are tough to atomize. ID | Subject 1 |Nat and Dean aboard Atlantis in 1950 Structured schemas don't need to be atomized ID | Date | People | Ships 1 | 1950 | Nat Corwin, Dean Bumpus | Atlantis

  15. 2. Addition Adding information

  16. Addition Lessons for Metadata Designers Addition is time-consuming and often impossible. Record as much as you can from the start!

  17. 3. Reconciliation Standardizing information

  18. Reconciliation Lessons for Metadata Designers Free text fields tend to produce Irregular information. Temple of Doom Movie: Movie: Indiana Jones and the Temple of Doom Controlled vocabularies and selection widgets will keep your information standardized.

  19. 4. Reassembly Recombining information in a new form

  20. Reassembly Lessons for Metadata Designers Be consistent. It takes time to reassemble multiple formats.

  21. Cleaning Metadata: Review

  22. Going Forward

  23. The End

  24. Review of Tools

More Related