100 likes | 141 Views
PREMIS Controlled vocabularies. Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress rgue@loc.gov PREMIS Implementation Fair San Francisco, CA October 7, 2009. Use of controlled vocabularies in PREMIS.
E N D
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress rgue@loc.gov PREMIS Implementation Fair San Francisco, CA October 7, 2009
Use of controlled vocabularies in PREMIS Many semantic units include a data constraint: “Value should be taken from a controlled vocabulary” Most have “suggested values” There is not currently a way to enforce selection of a controlled value LC is establishing a mechanism to develop vocabularies and to validate them
Controlled vocabularies databases Library of Congress is establishing databases with controlled vocabulary values for standards that it maintains Controlled lists are represented using SKOS as well as alternative syntaxes
About SKOS Simple Knowledge Organization System RDF application used to express knowledge organization systems such as classifications, thesauri, taxonomies, and the concepts within Allows distributed, decentralized management of KOS through Linked Data-inspired application. All concepts and schemes require a URI
The SKOS data model (Classes) ConceptSchemes (e.g., published vocabularies, thesauri, code lists, etc.) Concepts (individual entries or terms within the larger vocabulary) Collections (logical groupings of Concepts)
Advantages to using SKOS SKOS has a defined element set which is particularly relevant for controlled vocabularies Relationships between entries in a concept scheme can be expressed (broader, narrower, etc.) Relationships between entries in different concept schemes can be expressed (exactMatch, related) Having a dereferencable URI for concepts and their concept schemes enhances the ability to provide web services for consumers of these standards
Reasons for developing a web service for vocabularies Facilitate development and maintenance process for vocabularies Make controlled lists openly available Provide comprehensive information about controlled values Experiment with semantic web technologies and linked data Expose vocabularies to wider communities
Using controlled vocabularies in PREMIS Semantic units that specify a controlled vocabulary: realized as “concept scheme” Each value: realized as SKOS instance Implementers add their values within a concept scheme Mechanism to import the values into the PREMIS XML schema to enable validation A concept in multiple standards may be established for broad usage in a concept scheme There has been some interest in exploring an RDF version of PREMIS for semantic web applications Those wishing to experiment: http://id.loc.gov
Introducing id.loc.gov Library of Congress Authorities & Vocabularies service: http://id.loc.gov Allows both human-oriented and programmatic access to LC-promulgated authorities and vocabularies. First offering is Library of Congress Subject Headings, but more to come: e.g. preservation events, cryptographic hash functions Demo