1 / 26

WG 2 Workshop on New Metadata Standards

WG 2 Workshop on New Metadata Standards. May 23, 2008 Sydney Australia Leader: Denise Warzel.

Download Presentation

WG 2 Workshop on New Metadata Standards

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. WG 2 Workshop on New Metadata Standards May 23, 2008 Sydney Australia Leader: Denise Warzel

  2. Many organizations have recognized the need to develop and share programmatically accessible terminologies, services and data collection forms. This workshop will focus on building a consensus and plan for advancing the development of new metadata registry standards for these types of objects: Services, Terminologies and Forms. To enable data processing systems to be written to discover and use these types of resources accurate, unambiguous and verifiable metadata is a prerequisite. With appropriate metadata standards for these types of objects, systems could be designed to allow owners to register and share electronically the essential characteristics necessary for common understanding of what and how these resources are intended for use. While the ISO/IEC 11179 addresses unambiguous representation and registration of data elements, no such standard exists for these important objects that are essential for developing interoperable information systems. A standard metadata framework to describe the essentialcharacteristics of these types of objects in ways that can be compared and interpreted is needed. The envisioned metadata standards would be similar to the ISO/IEC 11179 Metadata Registries standard Part 3 in which the registry and basic attributes of each type of object are defined and can be specified and registered in a metadata registry. This session will facilitate developing a common understanding of some of these characteristics for each type of object. Participants will share existing metamodels and emerging specifications, discuss common needs, approaches and priorities and possible next steps that could lead to the development of new ISO standards in these areas. This session would be of interest to information developers, information managers, data administrators, standards developers and others who are responsible for designing systems in which these types of metadata objects are understandable and shareable.

  3. Many organizations have recognized the need to develop and share programmatically accessible terminologies, services and data collection forms. This workshop will focus on building a consensus and plan for advancing the development of new metadata registry standards for these types of objects: Services, Terminologies and Forms. Type of Metadata Standard: (Service, Terminology, Form or Other): Existing/Recommended metamodel: Benefits: Use Cases/Justification if this standard existed: Known initiatives/individuals that could be pooled to create a Functional Specification: Plans for Implementation: Suggestednext steps if known:

  4. Goals for Today • Discuss new areas for development of metadata standards • Specifically: Terminology, Forms, Services • Reach common understanding about what each of these metamodels would describe (agree what we mean by “Terminology”, “Forms”, “Services”) • Reach consensus on priorities/areas of common interests • Learn how to progress something into a new standard • Form a Strategy for progressing these new standads

  5. Areas of Interest • Registry Metamodel • Profiles that would allow registry operators to ‘declare’ themselves, provide attributes that would enable others to understand certain characteristics about the registry, such as business rules, naming conventions, contact information etc. • Issue 127: 2005 Global Attributes for a Registry to be used with an ROR • (ISO 29002) • What is contained in the Registry (e.g. What type of registry is it? Image Registry, Service Registry, 11179 Metadata Registry, etc) • Establish “Trust” • Interest is registry for CWM mappings (Baba)

  6. ‘Terminology’ • There are different types of terminologies • This would cover all types of terminologies: • Taxonomies, Formal Ontologies, Standardized list of terms, Vocabularies, Dictionaries, etc. • The metadata would describe the Terminology as a WHOLE – (NOT the model for the terms in the terminology)

  7. Terminology Issues • Agree on a common definition of ‘Terminology’ • Look at TC 37 work (Sue Ellen Wright) • Add proper Provenance class • Related Work: • Registry for Knowledge Organization Systems (NKOS) – Gail Hodge • ‘Rights’ – UMLS has both public and restricted content • UK Joint Information Systems committee (JISC) – Dennis Nicolson(?), Doug Tudhope • NCBO – BioPortal – resource quality, attributes • Government Terminology Services – each country has some terminologies they are serving and have an implicit metamodel that could be harmonized • (JoAnne Evans – Monash University) • Open Data Linking

  8. Model - Overview

  9. Areas of Interest • Services Metamodel • Profiles that would allow a Service Registry operators to ‘declare’ themselves, provide attributes that would enable others to understand certain characteristics about the registry, such as business rules, naming conventions, contact information etc • Establish “Trust”

  10. ‘Service’ definition • OASIS (organization) defines service as "a mechanism to enable access to one or more capabilities, where the access is provided using a prescribed interface and is exercised consistent with constraints and policies as specified by the service description."

  11. Service Metadata Issues • myGrid Service Model • caGrid Service Metamodel • Need a way to describe a “Registry Service”?? Should use WSDL – service metadata would provide a way to standardize a description of the described in a WSDL • IBM has introduced Service Science as a new curriculum – may have some information that would be compatible

  12. caGrid Service Metamodel

  13. 11179:Administered Item caGrid Service Metamodel

  14. Courtesy: K. Wolstencroft et al The myGrid ontology: bioinformatics service discovery, Int. J. Bioinformatics Research and Applications, Vol. x, No. x, 200x .

  15. not sure if ‘format’ is the similar to ‘dimensionality’ • Not in caGrid Service Domain model: • Operation.task • Operation.method • Operation.resource • Operation.relationship (shim services) caGrid Service Metamodel match new Courtesy: K. Wolstencroft et al The myGrid ontology: bioinformatics service discovery, Int. J. Bioinformatics Research and Applications, Vol. x, No. x, 200x . missing

  16. Service Types • Domain: Performs scientific function • Shim: Does not perform scientific function, but is needed to make one service work with another • E.g. Courtesy: K. Wolstencroft et al The myGrid ontology: bioinformatics service discovery, Int. J. Bioinformatics Research and Applications, Vol. x, No. x, 200x .

  17. myGRID Service Operation: • Operation.resource • Operation.method (algorithm) • Operation.task These concepts could come from myGrid ontology The concepts for each “Input” could come from the semantics registered for the UML class myGrid Portions courtsey: K. Wolstencroft et al The myGrid ontology: bioinformatics service discovery, Int. J. Bioinformatics Research and Applications, Vol. x, No. x, 200x .

  18. Comparison of myGrid semantics and caGrid Semantics • myGRID semantic categories • Informatics. captures the key concepts of data, data structures, databases and metadata. The data and metadata hierarchies in the ontology contain this information. • • Bioinformatics. This builds on informatics as well as data and metadata, there are domain-specific data sources (e.g., the model organism sequencing databases), and domain-specific algorithms for searching and analysing data (e.g., the sequence alignment algorithm, clustalw). The algorithm and data_resource hierarchies contain this information. • • Molecular biology. This includes the higher level concepts used to describe the bioinformatics data types used as inputs and outputs in services. These concepts include examples such as, protein sequence, and nucleic acid sequence. • • Tasks. A hierarchy describing the generic tasks a service operation can perform. Examples include retrieving, displaying and aligning. • • Formats. A hierarchy describing bioinformatics file formats. For example, fasta format for sequence data, or phylip format for phylogenetic data. • Informatics. Not sure if we have a corollary or need this. • • Bioinformatics. We could probably use theirs for attaching a concept to Operation algorithm • • Molecular biology. We have this already, its the concepts associated with the UML Class level • • Tasks. We could probably use theirs for Operation task. • • Formats. Probably useful, this is at the Service level – not sure if this is similar to the ‘dimensionality’ attribute we have at the input/output level? Courtesy: K. Wolstencroft et al The myGrid ontology: bioinformatics service discovery, Int. J. Bioinformatics Research and Applications, Vol. x, No. x, 200x .

  19. “Forms” Description • A structured set of metadata for the collection of information and for the ‘form’ itself • There are various types of forms • Statistical forms • Surveys, Questionnaires • Population Science Measures • Eg quality of life • Link to Rules for Form Completion • Rights – can the form be reused? By whom? What is the process for authorization, etc. • Includes Standard way to describe minimal Rendering information • To describes how the form looks • ‘Behavior’ – skip pattern • Form Structure • Standard way to associate the semantics of the Question and Answer Set (e.g. to associate an item on the form with a 11179 Data Element, DEC or Value Domain

  20. Forms MetaModelRelated Projects • OASIS • UN/CEFACT – Forms Metamodel • UN eDocs – Cross Border Trade ‘form templates’ – may have an implicit form metamodel • ISO 15000-5 ebXML • ISO 7372 • Adobe pdf form metamodel • MS InfoPath – has a metamodel for their forms • XForms - language for representing forms

  21. Forms Metadata • caDSR Protocol Forms Metadata • cancerGrid Protocol Forms • HL7 Clinical Document Architecture • Westat Survey/Instrument Metamodel • ??

  22. caDSR Protocol Forms Metamodel December 2007

  23. HL7 Clinical Document Architecture (CDA) Metamodel

  24. Metadata Standards Development • SC 32 and WG 2 Options for progressing these • ServiceMetadata, Forms Metadata, Terminology Metadata and Global Registry Metadata (RoR - Issue 127) • Also need a Service Specification for Registries • Work with Issue 127, or create another issue to put into 11179 • WG 2 ‘Study Period’ • Look at this and decide what approach we should take • New part of 11179? New Standard? • ‘Study Period’ involves more open participation, not as formal as a WG 2 meeting • Could be proposed as a new Study Period next week as a ‘Resolution’ • Needs a Leader, provides an opportunity to circulate something for people to “sign-up” - Could Report back at the next WG 2 meeting in the Winter 2008 (5-8 months) • ISO 19115 (TC 211) Geographic Information • TC 184/SC 4 and ISO 8000 Data Quality – Quality of Services? Gerald Radack and Peter Benson • OMG Evan Wallace/Elisa Kendall • IT 4 Australian, New Zealand

More Related