130 likes | 292 Views
Toward a Data Repository for Evolutionary Biology: <Metadata Issues>. Jane Greenberg, Associate Professor, Director SILS/Metadata Research Center <MRC>, UNC-CH Jackson Dube, Visiting Scholar, SILS/MRC Ruth Monnig, Doctoral Research Assistant, SILS/MRC. Overview. Metadata defined
E N D
Toward a Data Repository for Evolutionary Biology: <Metadata Issues> Jane Greenberg, Associate Professor, Director SILS/Metadata Research Center <MRC>, UNC-CH Jackson Dube, Visiting Scholar, SILS/MRC Ruth Monnig, Doctoral Research Assistant, SILS/MRC
Overview • Metadata defined • Role of metadata in a repository • Range of metadata standards • Principles and objectives • Domains • Architectural Layout • Issues • Discussion
Metadata • Data about the content, quality, condition, and other characteristics of data (FGDC Glossary, 1992) • Additional information necessary for data to be useful (Musik, 1997) • Structured, descriptive information about a resource (DCMI Glossary; Weibel, 1995)
Metadata types and properties *Resource = data = object = entity = document = data object
Why metadata? • Facilitate discovery of data objects • Permit use – intellectual and technical • Asset/object management and preservation • Security • Help advance the field of evolutionary biology
Range of published data objects • Table, graph • Dataset (supplementary data, entire data set) • Research methods, procedures • Coverage: Temporal and spatial aspects • Agent/s: scientists/s, organizations • Project • Publication (journal volume, issue, pagination) Related data objects“All these levels again and more… / Ruth/Jed, please consider altering/or another slide/s..
Schemes (just a few…) LSID TEI Header; MARC bibliographic format, Dublin Core EAD FGDC/CSGSM; NBII EML DDI ODRL (Creative Commons Profile) A Core PREMIS Characteristics Objectives and principles Domains Environment Object type/format Architectural Layout Extent Level of Complexity Flat, hierarchical Granularity Range of metadata standards
Metadata continuum TEI Header, MARC; Dublin Core DDI LSID EML FGDC EAD Draft – jed, after we meet, maybe you can make pretty!
Range of metadata standards • Data structure standards • Container/labels, semantics/data dictionaries • Data communication standards • Mark-up/encoding, data interchanges • Data value standards • Content representation, ontologies, authority files • Data syntax standards • Encoding, element ordering, values • Data models, architectures/packaging • OAIS, METS, FRBR, DCAM, RDF, SEEK
The Knowledge Network for Biocomplexity (KNB) *http://knb.ecoinformatics.org//data.html
The Knowledge Network for Biocomplexity (KNB) ontologies Data structures *http://knb.ecoinformatics.org//data.html
Issues • Cost • More metadata, more cost to produce • Less metadata, cost to users • Metadata creation • Who, when, how? (Insuring quality, timely creation) • What applications are needed? • Interoperability • What levels of interoperability do we need? With what systems • Preservation • Open access
Questions for discussion • What level do you think metadata needs to be applied to facilitate data object discovery/use? • What other issues come to mind?