1 / 21

METS at UC Berkeley

METS at UC Berkeley. Generating METS Objects. Background. Kinds of materials: primarily imaged content & tei encoded content archival materials: manuscripts and pictorial collections oral histories Kinds of Metadata Structural metadata: physical structure Descriptive metadata

carlow
Download Presentation

METS at UC Berkeley

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. METS at UC Berkeley Generating METS Objects

  2. Background • Kinds of materials: • primarily imaged content & tei encoded content • archival materials: manuscripts and pictorial collections • oral histories • Kinds of Metadata • Structural metadata: physical structure • Descriptive metadata • BasicTechnical metadata about digital files and how they were produced

  3. Tools For Producing METS Objects • GenDB • Gathers structural, descriptive and technical metadata • GenX • Generates METS objects from GenDB

  4. GenDB • Consists of: • Relational database (Currently SQL Server) • Locally developed software for gathering metadata and facilitating digital processing

  5. Object 1 Div 1 Div 2 Div 3 Object 2 Div 1 Div 2 Div 3 Div 4 GenDB Database Structure Structural Metadata Structural Md Table Object 1 Div 1 (root) Div 2 (parent = div 1) Div 3 (parent = div 1) … Object 2 Div 1 (root) Div 2 (parent = div 1) Div 3 (parent = div 2) Div 4 (parent = div 2)

  6. GenDB Database Structure Descriptive Metadata Structural Md Table Name Table Object 1 Div 1 Core Desc Md Name 1 Div 2 Core Desc Md Name 2 Div 3 Core Desc Md Name 3 Object 2 Div 1 Core Desc Md Note Tables Note 1 Div 2 Core Desc Md Div 3 Core Desc Md Note 2 Div 4 Core Desc Md Note 3

  7. GenDB Database Structure Content File/Technical Md Master Image Table Structural Md Table Mstr 1 Technical Md Object 1 Div 1 Mstr 2 Technical Md Div 2 Div 3 Derivative Image Table Drv 1 Technical Md Drv 2 Technical Md Drv 3 Technical Md Drv 4 Technical Md

  8. Populating the Database Tables • Web interface: manual input of structural and descriptive metadata • Digitization Management modules • Generate work orders to guide digitization process • Import content file information and technical metadata coming out of digitization process • Batch loader: batch input based on TEI encodings, legacy metadata

  9. Web Interface: WebGenDB Java Server Web Interface SQL Server Database jdbc rmi Java Servlet XML Config Files

  10. Digitization Management Modules Vendor Imaging/ Transcription WorkOrders Web Interface Technical MD Spreadsheets Java Server Java Servlet SQL Server Database

  11. SQL Server Database Batch Loader Java Server Web Interface TEI Docs XSLT Java Servlet Java Batch Loader XML Batch Load File

  12. WebGenDB The concepts that drove the design • Shielding user from METS complexity • Highly configurable • Unicode support • Access driven by login privileges • Use of Open Source software and components • Distributed approach

  13. XML Configuration Files • Three levels • Common to all projects elements • Common to all screens in a project elements • Specific to a screen in a project • Define fields common to all projects • Define fields used in specific project • Define screens by project & object type

  14. Relation among XML files ObjectType1.xml Proj1.xml ObjectType2.xml AlProjects.xml ObjectType1.xml Proj2.xml ObjectType2.xml

  15. Project XML file example <ObjectType> <name>workorder</name> <fileLocation> /data/_w/GenDB/WEB-INF/classes/edu/berkeley/library/propertyFiles/CalCultureWorkOrderScreensFile.xml </fileLocation> </ObjectType> <Field> <name>Image</name><type>checkbox</type><label>Image </label><size>1</size> </Field> <Field> <name>Text</name><type>checkbox</type><label>Text </label><size>1</size> </Field> <Field> <name>Title</name><type>text</type><label>Title </label><size>60</size> </Field>

  16. Software used • MSSQL running on NT • Tomcat 4.1.2 implementing servlets 2.3 • Jsdk 1.4 • Xalan 2.4 • Xerces 1.0.3 • FOP 0.12.1 • JDOM beta 8 • Opta 2000

  17. Relationship of GenDB to METS • Metadata not directly stored in METS, MODS or MIX schema formats. • Much of the database structure was developed before these standards emerged • Database structure and content adjusted to be compatible with all these formats

  18. GenX: From GenDB to METS • Allows Digital Publishing Group staff to select the objects in the GenDB database that are ready for export and to export them as METS objects.

  19. GenX Architecture App Interface Java Application METS XML Repository JDBC GenDB

  20. GenX Output • METS output corresponding to version 1.3 • Descriptive metadata exported to METS descMD in MODS 2.0 format • Technical Metadata exported to METS techMD in MIX format • Planned: • Text technical md to METS descMD in NYU TextMD • Rights to METS rightsMD in ODRL subset

  21. Links • GenDB Web Interface Demo • http://sunsite2.berkeley.edu/GenD • login: demo • password: demo • Developers: • rbeaubie@library.berkeley.edu • ghill@library.berkeley.edu • jhassan@library.berkeley.edu

More Related