1 / 42

Goal and Status of the OBO Foundry

Goal and Status of the OBO Foundry. Barry Smith. H ow create broad-coverage semantic annotation systems for biomedicine?. Semantic Web, Moby, wikis, crowd sourcing, NLP, etc . let a million flowers (and weeds) bloom to create integration rely on (automatically generated?) post hoc mappings

stu
Download Presentation

Goal and Status of the OBO Foundry

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Goal and Status of the OBO Foundry Barry Smith

  2. How create broad-coverage semantic annotation systems for biomedicine? • Semantic Web, Moby, wikis, crowd sourcing, NLP, etc. • let a million flowers (and weeds) bloom • to create integration rely on (automatically generated?) post hocmappings • The result is noisy

  3. Perhaps even deadly

  4. for science Foundry alternative:prospective standardization • develop high quality annotation resources in a collaborative, community effort • creating an evolutionary path towards improvement of terminologies of the sort we find elsewhere in science

  5. what makes GO so wildly successful ?

  6. The methodology of annotations • science basis of the GO: trained experts curating peer-reviewed literature • different model organism databases employ scientific curators who use the experimental observations reported in the biomedical literature to associate GO terms with gene products in a coordinated way

  7. A set of standardized textual descriptions of • cellular locations • molecular functions • biological processes • used to annotate the entities represented in the major biochemical databases • thereby creating integration across these databases and making them available to semantic search

  8. and also need to extend the GO by engaging ever broader community support for the addition of new terms and for the correction of errors need to extend the methodology to other domains, including clinical domains

  9. this requires that we • establish common rules governing best practices for creating ontologies and for using these in annotations • apply these rules to create a complete suite of orthogonal interoperable biomedical reference ontologies

  10. 2003 • shared portal + • low regimentation • http://obo.sourceforge.net NCBO BioPortal

  11. 2006 The OBO Foundryhttp://obofoundry.org/

  12. A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast to: post hoc mapping) established March 2006 12 initial candidate OBO ontologies – focused primarily on basic science domains several being constructed ab initio by influential consortia who have the authority to impose their use on large parts of the relevant communities.

  13. Building out from the original GO

  14. OBO Foundry = a subset of OBO ontologies, whose developers have agreed in advance to accept a common set of principles reflecting best practice in ontology development designed to ensure • tight connection to the biomedical basic sciences • compatibility • interoperability, common relations • formal robustness • support for logic-based reasoning The OBO Foundryhttp://obofoundry.org/

  15. CRITERIA • The ontology is OPENand available to be used by all. • The ontology is in, or can be instantiated in, a COMMON FORMAL LANGUAGE. • The developers of the ontology agree in advance to COLLABORATE with developers of other OBO Foundry ontology where domains overlap. CRITERIA The OBO Foundryhttp://obofoundry.org/

  16. UPDATE: The developers of each ontology commit to its maintenance in light of scientific advance, and to soliciting community feedback for its improvement. • ORTHOGONALITY: They commit to working with other Foundry members to ensure that, for any particular domain, there is community convergence on a single controlled vocabulary. CRITERIA The OBO Foundryhttp://obofoundry.org/

  17. for science orthogonality of ontologies implies additivity of annotations • if we annotate a database or body of literature with one high-quality biomedical ontology, we should be able to add annotations from a second such ontology without conflicts • AND WITHOUT THE NEED FOR MAPPINGS The OBO Foundryhttp://obofoundry.org/

  18. CRITERIA CRITERIA • IDENTIFIERS: The ontology possesses a unique identifier space within OBO. • VERSIONING: The ontology provider has procedures for identifying distinct successive versions to ensure BACKWARDS COMPATIBITY with annotation resources already in common use • The ontology includes TEXTUAL DEFINITIONS and where possible equivalent formal definitions of its terms.

  19. CRITERIA • CLEARLY BOUNDED: The ontology has a clearly specified and clearly delineated content. • DOCUMENTATION: The ontology is well-documented. • USERS: The ontology has a plurality of independent users. The OBO Foundryhttp://obofoundry.org/

  20. CRITERIA • COMMON ARCHITECTURE: The ontology uses relations which are unambiguously defined following the pattern of definitions laid down in the OBO Relation Ontology.* • * Smith et al., Genome Biology 2005, 6:R46 The OBO Foundryhttp://obofoundry.org/

  21. top level mid-level domain level Basic Formal Ontology (BFO) Extension Strategy – Downward Population

  22. OGMS Downward Population + Hub-Spokes Strategy

  23. OGMS Cardiovascular Disease Ontology Genetic Disease Ontology Cancer Disease Ontology Genetic Disease Ontology Immune Disease Ontology Environmental Disease Ontology Oral Disease Ontology Infectious Disease Ontology …

  24. OGMS Cardiovascular Disease Ontology Genetic Disease Ontology Cancer Disease Ontology Genetic Disease Ontology Immune Disease Ontology Environmental Disease Ontology Oral Disease Ontology Infectious Disease Ontology …

  25. BFO, OGMS, and IDO • Material Entity • Disposition • Process • Disorder • Disease • Disease Course • Infection • Infectious Disease • Infectious Disease Course

  26. OGMS Cardiovascular Disease Ontology Genetic Disease Ontology Cancer Disease Ontology Genetic Disease Ontology Immune Disease Ontology Environmental Disease Ontology Oral Disease Ontology Infectious Disease Ontology IDO Staph Aureus IDO MRSA IDO Australian MRSA IDO Australian Hospital MRSA …

  27. How IDO evolves IDOMAL IDOCore IDOHIV CORE and SPOKES: Domain ontologies IDOFLU IDORatSa IDORatStrep IDOStrep IDOSa SEMI-LATTICE: By subject matter experts in different communities of interest. IDOMRSA IDOAntibioticResistant IDOHumanSa IDOHumanStrep IDOHumanBacterial

  28. Status Successes New ontologies being added to the OBO library Advance in cross-product methodology

  29. Building out from the original GO

  30. Population-level ontologies

  31. Environment (EnvO, EO) Population-level ontologies

  32. Successes The OBO Foundry strategy for ontology collaboration and reuse is being replicated in major grant-funded projects

  33. OBO Foundry approach extended into other domains FUNDED

  34. Successes Huge and continuing expansion in the awareness of the need for re-using ontologies Huge and continuing expansion in ontology software created to support Foundry efforts (Ontobee, Mireot, …)

  35. Immunology Database and Analysis Portal (ImmPort)

  36. Current status Coordinating editors: Michael Ashburner Chris Mungall Suzanna Lewis Alan Ruttenberg Richard Scheuermann Barry Smith

  37. New operations committee • https://code.google.com/p/obo-foundry-operations-committee/wiki/OutreachWG Mathias Brochhausen Melanie Courtot Melissa Haendel Janna Hastings Chris Mungall Alan Ruttenberg Ramona Walls

  38. Ontologies admitted to full membership afte first phase of reviews • CHEBI: Chemical Entities of Biological Interest • GO: Gene Ontology • PATO: Phenotypic Quality Ontology • PRO: Protein Ontology • XAO: Xenopus Anatomy Ontology • ZFA: Zebrafish Anatomy Ontology

  39. Current status Next round of candidates for review OGMS: Ontology for General Medical Science OBI: Ontology for Biomedical Investigations CL: Cell Ontology IDO: Infectious Disease Ontology

  40. Ontology for General Medical Science Jobst Landgrebe (former Co-Chair of the HL7 Vocabulary Group, now Head of Datamining at Allianz Healthcare): • “the best ontology effort in the whole biomedical domain by far”

More Related