260 likes | 377 Views
Extended Metadata Registry (XMDR). Interagency/International Cooperation on Ecoinformatics Brussels, Belgium. September 2004. Bruce Bargmeyer +1 (510) 495-2905 bebargmeyer@lbl.gov. Past, Present, Future. EEA. DOE. text. text. data. data. text. data. ambiente agricultura tiempo
E N D
Extended Metadata Registry (XMDR) Interagency/International Cooperation on Ecoinformatics Brussels, Belgium September 2004 Bruce Bargmeyer +1 (510) 495-2905 bebargmeyer@lbl.gov
Past, Present, Future EEA DOE text text data data text data ambiente agricultura tiempo salud hunano industria turismo tierra agua aero environ agriculture climate human health industry tourism soil water air environ agriculture climate human health industry tourism soil water air DoD 123 345 445 670 248 591 308 123 345 445 670 248 591 308 3268 0825 1348 5038 2708 0000 2178 3268 0825 1348 5038 2708 0000 2178 123 345 445 670 248 591 308 123 345 445 670 248 591 308 3268 0825 1348 5038 2708 0000 2178 3268 0825 1348 5038 2708 0000 2178 123 345 445 670 248 591 308 123 345 445 670 248 591 308 3268 0825 1348 5038 2708 0000 2178 3268 0825 1348 5038 2708 0000 2178 text data environ agriculture climate human health industry tourism soil water air EPA 123 345 445 670 248 591 308 123 345 445 670 248 591 308 3268 0825 1348 5038 2708 0000 2178 3268 0825 1348 5038 2708 0000 2178 text data 3268 0825 1348 5038 2708 0000 2178 123 345 445 670 248 591 308 ambiente agricultura tiempo salud huno industria turismo tierra agua aero 123 345 445 670 248 591 308 3268 0825 1348 5038 Others . . . Users Lots of users Lots of information systems Lots of Data Sources
Data Standards • Avoid a combinatorial explosion of data content, description, and metadata arrangements for information storage, access and interchange. Data standards and metadata registries can help.
Data Element Concept Name: Country Identifiers Context: Definition: Unique ID: 5769 Conceptual Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Afghanistan Belgium China Denmark Egypt France Germany ………… Data Elements Afghanistan Belgium China Denmark Egypt France Germany ………… AFG BEL CHN DNK EGY FRA DEU ………… 004 056 156 208 818 250 276 ………… Name: Context: Definition: Unique ID: 4572 Value Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Name: Context: Definition: Unique ID: 3820 Value Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Name: Context: Definition: Unique ID: 1047 Value Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others ISO 3166 English Name ISO 3166 3-Alpha Code ISO 3166 3-Numeric Code
Afghanistan Belgium China Denmark Egypt France Germany ………… AFG BEL CHN DNK EGY FRA DEU ………… 004 056 156 208 818 250 276 ………… 11179 Metadata Registry
Metadata RegistriesSemantics Management Evolution • Database (schema) integration • System design • Data use - metadata • Warehouse support – schema and metadata • XML support (schema) • “Backed into” terminology support • Next: Semantics servers -- for semantics web and semantics based computing
Metadata Registry TerminologyThesaurus Themes Ontology GEMET Data Standards Structured Metadata Metadata Registries ISO/IEC 11179 Metadata Registries
Elements of Terminology any of several game fishes of the genus Salmo, related to the salmon... Concept Sign Object trout Salmo trutta brown trout truite
Terminology any of several game fishes of the genus Salmo, related to the salmon... Concept Terms Context trout Salmo trutta truite common name scientific name French name UIN=6349
Name: trout species Definition: The names of species of trout. Values: brook trout Salvelinus fontinalisbrown trout Salmo truttacutthroat trout Oncorhynchus clarkii Concept Terms Context Brown trout Salmo trutta truite common name scientific name French name UIN=6349 DataElements
Systems:STORETEnvirofacts . . . Federal Register Regulations Reports XML SchemasEDI Messages DataInterchange Documents Publishing DBMSQuery Concept Terms Context Brown trout Salmo trutta truite common name scientific name French name UIN=6349 DataElements
Search Engine Concept Terms Context Documents Data Thesaurus Brown trout Salmo trutta truite common name scientific name French name UIN=6349 1 2 3 4 5 6 7 8 fish trout Search Example: FishTrout Salmo trutta Brown Trout
EDEN-IW Intelligent Information Services (IIS): Local Mapping Concept Terms Context Ontology Brown trout Salmo trutta truite common name scientific name French name UIN=6349 Query Agent Broker Mediator Resource Agent Central Mapping Example: fish trout brown trout
Observation Global Ontology Station U nit Determinant AnalyticalFraction TimeStamp Medium Local Ontology NERITime NERIObservationCharacteristics NERIStation Table(x) Table(y) Table(z) Table(m) Local DB Schema Semantic Mapping 14
Concept Terms Context Brown trout Salmo trutta truite common name scientific name French name UIN=6349 Data Elements I/ICE Participants Ecoterm Government State/Local Private Enterprise Academe GEneral Multilingual EnvironmentalThesaurus (GEMET) And CNR Earth Thesaurus DOE, NIH and NCI Safety and Health Concepts/terms Terminology Sources
Terminology Management DBMS/EDI/ Documents a category of vertebrate, cold-blooded craniate animals with permanent gills... DBMS/EDI/ Documents Dictionary DataElements Keyword IIS Search Engine Thesaurus Ontology Semantics Server Search Engine 11179 Metadata Registry
Purpose of XMDR • Extend Semantics Management Capabilities of ISO/IEC 11179 • Test & Demo Extended Capabilities in a Reference Implementation • Produce Design for Next Generation Operational 11179 Registries • Propose Revisions to 11179 Parts 2 & 3 (Ver. 3) • Adapt & Adopt Emerging (Semantic) Technologies • Help Resolve Registration & Interrelation Issues for Complex Metadata Standards Forging Semantics Based Computing
Project Background • Collaborative, Interagency Effort • DOD, EPA, LBNL, USGS, NCI, Mayo Clinic…Others? • Draws on and Contributes to Interagency/International Cooperation on Ecoinformatics • Involves International, National, State, Local Government Agencies, other Organizations • Recognizes Great Potential of Semantics-based Computing, Management of Metadata • Improving Collection, Maintenance, Dissemination, Processing of Very Diverse Data Structures • Collaboration Arises from Need to Share Diverse Data Across Multiple Organizations • Project Duration Expected to be July 04 – Jun 05, + Many Players, Many Interests…Shared Context
Concept Of Operations • Service Oriented Architecture • Enables Heterogeneous, Disparate Systems to Interoperate • Agreement in the Interface, Not the Implementation • Publish, Find, Bind • Standards Based Design • Lifecycle Application Support • Abstract Technology Commonalities • Open Standards, Technology Agnostic • Durable: Used for Current and Future Technologies • Semantic Web Service • Publish, Find, Bind…Automatically • Component of Semantic Web • Bootstrap Semantic Web?
Potential Standards/Technologies • DBMS • Object, XML, Relational, RDF/Graph, Logic, Text, Document, Multimedia • Knowledge Representation • Web Ontology Language (OWL) • Simple Common Logic (SCL) • Middleware/Messaging • Cocoon 2, Jini, CoABS, JMS, XMLBlaster, SOAP • XML [Semantic] Web Services • Axis, JWSDP • Agent Development • ABLE, JADE • Engines/Servers • OMS (IBM), Federator/OMS (OWI) • Jess
ISO/IEC 11179Expressed as an Ontology <?xml version="1.0" encoding="ISO-8859-1"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns="http://www.owl-ontologies.com/unnamed.owl#" xml:base="http://www.owl-ontologies.com/unnamed.owl"> <owl:Ontology rdf:about=""/> <owl:Class rdf:ID="Registrar"> <rdfs:subClassOf rdf:resource="http://www.w3.org/2002/07/owl#Thing"/> <rdfs:subClassOf> <owl:Restriction> <owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int" >1</owl:cardinality> <owl:onProperty> <owl:ObjectProperty rdf:ID="contact"/> </owl:onProperty> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction>
Potential Content Domains • Environmental (Ecoterm, GBIF, …) • Biomedical • Chemical • Geographic Information Systems • Bibliographic Ontologies/Metadata Standards • General Terminologies/Ontologies • Economic Code Sets • Other Diverse Domains, Structures…Representative Samples
Other Calendar Events Involved in Other Disciplines
Users World Wide Web Companies Data Services Metadata Registries Universities Environmental Data Grid Semantic Services TerminologyThesaurus Ontology Taxonomy Computation Services Agencies Data Standards Structured Metadata Others Environmental Semantics Grid Software: Models, Visualization, Analysis Agent systems Semantic Based Computing Environmental Computer Grid High Performance, cluster, Personal September 2004
XMDR & I/ICE • How do we collaborate? • Ecoterm • GBIF • EPA EDR/TRS • EEA Data dictionary • NIH/NCI • Agriculture