200 likes | 251 Views
Tutorial on XML Tag and Schema Registration in an ISO/IEC 11179 Metadata Registry. Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30 p.m. January 2003. Kathleen Gundry Science Applications International Corporation (SAIC). Overview.
E N D
Tutorial on XML Tag and Schema Registration in anISO/IEC 11179 Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30 p.m. January 2003 Kathleen Gundry Science Applications International Corporation (SAIC)
Overview • Identify various information assets to be managed in registries. • Identify the relevant standards for management of those assets. • Present the benefits of combining the management functions in a single registry. • Present the architecture of a registry that might meet the various standards for managing a variety of organizational metadata. • Present the current plans to register XML objects in a 11179 metadata registry, the EPA’s Environmental Data Registry.
Purpose of a Metadata Registry • The purpose of a metadata registry is to collect and manage information about an organization’s information assets. • The registry is supposed to centralize that information, and improve the organization’s ability to manage it. • Registries should be able to promote reuse of data, identify opportunities for information integration, identify duplicate information collections, promote standardization.
Types of Metadata and Related Registry Standards • Data Elements: ISO/IEC 11179, Data Element Metadata Registry • Information Resources: Dublin Core • Data Sets, Spatial Data: FGDC Content Standard for Digital Geospatial Metadata • XML: OASIS/ebXML Registry standard
Central Registry Proliferation Enterprise Application Inventory XML Registry Data Element Metadata Document Management System Data Set Metadata
Taming Metadata Chaos at EPA Environmental Data Registry eXtensible Markup Language Metadata Environmental Information Management System XML data elements XML objects data sets Information Resources Registry System System of Registries all sources Terminology Reference System Substance Registry System Facility Registry System terms regulated substances facility data Environmental Information Warehouse Legacy Databases
Finding a Place for XML Metadata • OASIS and ebXML have combined their efforts to build a standard for an XML Registry. • A freeware, alpha version of an XML Registry, conformant with the current version of the standard is available from Sourceforge. • There are several commercial XML registry products, but none currently available completely conform to the standard. • Some organizations, including the Environmental Exchange Network, seek to link their XML metadata to other enterprise metadata, like data standard metadata.
Managing XML Metadata in the EDR • EPA and its State and Tribal Nation partners have a long term goal of building an Environmental Exchange Network based on environmental data standards. • The Environmental Exchange Network partners identified a need to link XML schema to related data standards. • With no available commercial products meeting that need, the group chose to register XML schema in the EDR as an interim solution. • XML components will be registered in the EDR, without changing the underlying ISO/IEC 11179 conformant metamodel.
Registering XML Tags • Conceptually, an XML tag = a data element. • XML tags are registered as designations (alternate names) for data elements. • In the EDR, standard XML tags are registered as designations for standard data elements • They use the standard data element name and follow the standard XML tag naming conventions • XML tags as data elements. • In the EDR, XML tags from XML schemas will be registered as data elements because the tags do not always have a one-to-one relationship with an application data element • The application XML tags can be linked to the application data elements.
Registering XML Schemas • XML Namespaces, XML Schemas, XML Datatypes, and other grouping XML objects will be registered as Classification Schemes. • 11179 defines Classification Scheme as “the descriptive information for the arrangement or division of objects into groups.” • In the EDR Classification Scheme is used to group other elementary objects, such as data elements and value domains. For example, an application system is registered as a Classification Scheme, documented with metadata (name, definition, abstract, submitter), and linked to related data elements and value domains .
Registering XML Schemas • Classification Scheme Types will be created and assigned for various grouping XML components. • Linkages between XML objects such as Trading Partner Agreements, XML schema, and other information resources can be stored. • In the EDR, Classification Schemes can be related to other Classification Schemes through Classification Scheme items, creating hierarchical relationships.
11179 Model Elements XML schema XML data type Other XML “group objects” Classification Scheme through classification scheme item Elementary XML tag metadata - definition - type - size Data Element Value Domain enumerations Names or “XML tags” For elementary tags or XML group objects Designation Indicator for designation type - XML tag - Registry name - XML schema Context
Linking XML Tags to Schemas • The long-term plan is to link XML tags to related XML schemas through data elements. • This linkage is supported by the 11179 metamodel. • It can be used to track data standards conformance of XML schemas. • It can be used as a data administration tool to understand what information is being exchanged by the different XML schemas—to foster integration and harmonization.
Tracking XML Schema Status • The ISO/IEC 11179 standard provides registration and administrative statuses to track metadata objects through the review and approval process. • In the EDR, an additional status field will be added to record the progress of the XML schema through the review and approval process to track the progress of the schema through the process, using the W3C defined statuses.
XML Metadata in the EDR • The EDR will be used to store both XML tags and XML documents, with no changes to the underlying model. • Some new queries will be developed to allow users to search for XML schema based on submitting organization, subject, and status—these are simply modifications to current application system/information resource queries.
Registry Model Comparisons • The ISO/IEC 11179 and the OASIS/ebXML metamodels have much in common. • By extending one of the models, an organization could manage both data element and XML objects in a single registry.
Future of XML Management • XML metadata is related to data element metadata, so some linkages between the metadata is desirable. • The Environmental Exchange Network is committed to building XML schemas based on approved data standards. • Data standards will be stored in a 11179 metadata registry. • XML schemas need to be related to the data element metadata either through storage in a single registry, or through linkages between two registries.
For More Information Kathleen Gundry Science Applications International Corporation Kathleen.g.gundry@saic.com A