100 likes | 114 Views
This overview discusses schema language choices, EAD as vocabulary/ontology, EAD subset, and tools for schema languages, including DTD, W3C XML Schema, RelaxNG. It also explores maintenance and distribution of schema, vocabulary for describing archival descriptions, vocabularies for Archival components, and implementation in different schemata and XML instances.
E N D
EAD Revision: Technical Considerations Terry Catapano EAD Roundtable Meeting 2011-08-24
Overview • Schema Language Choices • EAD as Vocabulary/Ontology • EAD Subset • Tools
Schema: Languages • Options • DTD • W3C XML Schema • RelaxNG • Maintenance? • Distribution?
Schema Languages: Features • XML • Documentation and “appinfo” • Strong Datatyping • Namespace Support • External • Internal • Allow Any? • Allow Some? (Xlink, EAC?, MODS?) • Maintenance/preservation implications • Parsing, Validation, Processing
Vocabularies/Abstract Model • Vocabulary for describing: • Archival descriptions (records/documents) [Tag Library] • Archival components (things) • Establish URIs • Formal Expression (SKOS?, RDFS?, OWL?) • Implemented in schemata, xml instances • TEI, NISO JATS, Dublin Core, DarwinCore...
Vocabularies, con't • Enumerated values • c/@level • Complement, Incorporate others • LOCAH • Aaron Rubenstein/ http://purl.org/archival/vocab/arch • Dublin Core, others...?
SKOS Straw Man <rdf:RDFxmlns:ead="http://example.info/ead/vocabulary/"xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"xmlns:skos="http://www.w3.org/2004/02/skos/core#"><skos:Concept rdf:about="http://www.example.info/ead/vocabulary/c/"><skos:prefLabel>Component</skos:prefLabel><skos:definition>A subordinate part of [described archival] materials...An intellectually logical section of... described [archival] materials...A stage within a hierarchical description of [archival materials].</skos:definition></skos:Concept><skos:Concept rdf:about="http://www.example.info/ead/vocabulary/unittitle/"><skos:prefLabel>Unittitle</skos:prefLabel><skos:definition>The name, either formal or supplied, of described [archival] materials. May consist of a word, phrase, character, or group of characters...The <unittitle> is comparable to ISAD(G) element 3.1.2 and MARC field 245</skos:definition></skos:Concept></rdf:RDF>
Profiling/Subsetting • Base Schema + Subset(s) • Endorsed/Official? • Facilitate User created? • Similar to approaches of other tagsets (TEI; JATS) • Schematron • Rules-based Validation • XPATH Based
Schematron: Example <sch:pattern name="Co-Occurrence Constraints"><sch:rule context="*[@level ='otherlevel']"><sch:assert test="string-length(@otherlevel) > 1"> If the value of a level attribute is "otherlevel', then the otherlevel attribute must be used </sch:assert></sch:rule></sch:pattern> <sch:pattern name="Elements Removed"> <sch:rule context="ead:chronlist"> <sch:report test=".">ERROR: <sch:name/> ELEMENT NOT ALLOWED</sch:report> </sch:rule> </sch:pattern> <sch:pattern name="No Mixed Content"> <sch:rule context="*"> <sch:report test="child::ead:* and text()[string-length(normalize-space(.)) != 0]" >ERROR: Mixed content is not allowed</sch:report> </sch:rule> </sch:pattern>
Tools • Migration • DTD to Schema • 2002 to EAD X • Validation (e.g., Schematron) • Conversions? • “EAD Tidy” • EAD to RDF • Pipelines (XProc) • Packaging?