2.29k likes | 2.61k Views
How to Build an Ontology. Barry Smith http://ontology.buffalo.edu/smith. Options. Ontology of Experiments (proper treatment of utility classes), PATO, Upper-Level Ontologies (SUMO, DOLCE, BFO) OBO Relation Ontology GO Evidence Codes Functions, Disease
E N D
How to Build an Ontology • Barry Smith • http://ontology.buffalo.edu/smith
Options • Ontology of Experiments (proper treatment of utility classes), PATO, Upper-Level Ontologies (SUMO, DOLCE, BFO) • OBO Relation Ontology • GO Evidence Codes • Functions, Disease • BioPAX Level 2 Documentation – Commentary
Preamble • Ontologies vs. Data Structures • adapted from Alan Rector et al., Binding Ontologies and Coding Systems to Electronic Health Records and MessageKR-MED 2006
Medical IT’s odd organisational structure • Separate / independent development • Medical Ontologies / Terminologies • SNOMED, GALEN, NCI thesaurus, potential OBO Disease Ontology, etc. • Medical information models • HL7 messages • OpenEHR Archetypes
Data structures and what they carry information about are different • Information models and ontologies are at different levels • The purpose of an ontology is to represent the world • The purpose of an information structure is to specify valid data structures structures to carry information about that world • To constrain the data structures to just those which a given software system can process
Data structures and what they carry information about have different characteristics • All persons have a sex • However not all data structures about people have a field for sex • Information structures are intrinsically closed • Valid structures can be exhaustively and completely described (up to recursion) • Ontologies are intrinsically open • We can never describe the world completely
Representing Information Models and Codes:Basic approach • An information model can be thought of asa logical theory of classes of information structures • The instances of the classes are concrete data structures - EHRs, messages, etc - carrying data about specific patients, tests, organisations, cases of disease, ...
Ontologies • Ontologies represent entities in the world • Cases of diabetes • Patients • Insulin metabolism • Islet cells • The instances in data structures are data items in human artefacts • Information structures of associations and attributes, elements, etc.
“ontology” Diabetes Diabetes Diabetes Diabetes Metabolicdisorder Metabolicdisorder DiabetesType 1 DiabetesType 1 DiabetesType 2 DiabetesType 2 Model of data structures in InformationModel Model of codes in InformationModel Participa-tion Observa-tion code_for_metabolic_disorder is_subcode_of DiabetesObserva-tion code_for_diabetes ONLY is_subcode_of topic is_subcode_of Type 1DiabetesObserva-tion code_for_diabetes_type_1 code_for_diabetes_type_2 ONLY diagnosis World vs. Data structure
National Center for Biomedical Ontology • $18.8 mill. NIH Roadmap Center • Stanford Medical Informatics • University of San Francisco Medical Center • Berkeley Drosophila Genome Project • Cambridge University Department of Genetics • The Mayo Clinic • University at Buffalo Department of Philosophy
From chromosome to disease
genomics • transcriptomics • proteomics • reactomics • metabonomics • phenomics • behavioromics • connectomics • toxicopharmacogenomics • bibliomics • … legacy of Human Genome Project
where in the body ? what kind of disease process ? need for semantic annotation of data
how create broad-coverage semantic annotation systems for biomedicine? covering: in vitro biological phenomena model organisms humans
natural language labels to make the data cognitively accessible to human beings
compare: legends for maps compare: legends for maps
what lesion ? what brain function ?
ontologies are legends for mathematical equations xi = vector of measurements of gene i k = the state of the gene ( as “on” or “off”) θi = set of parameters of the Gaussian model ... ...
The OBO Foundry Idea GlyProt MouseEcotope sphingolipid transporter activity DiabetInGene GluChem
annotation using common ontologies yields integration of databases GlyProt MouseEcotope Holliday junction helicase complex DiabetInGene GluChem
annotation using common ontologies can yield integration of image data
annotation using common ontologies can support comparison of image data
maps may be correct by reflecting topology, rather than geometry
an image can be a veridical representation of reality a fully labeled image can be an even more veridical representation of reality
cartoons, like maps, always have a certain threshold of granularity
grain resolution serves cognitive accessibility we transform true imagesinto true cartoons
pathways can be represented at different levels of granularity
Joint capsule Netter
Holes and Parts • Parts • • 1 head of condyle F • • 2 neck of condyle F • • 3 disc B • • 4 retrodiscal tissue B • • 7 articular eminence F • • 8 zygomatic arch F • • 10 upper head of lateral pterygoid muscle F • • 11 lower head of lateral pterygoid muscle F • Holes • • 5 lower joint compartment B • • 6 upper joint compartment B
Temporomandibular Joint (TMJ) ANTERIOR from Thomas Bittner and Louis Goldberg, KR-MED 2006