370 likes | 480 Views
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute. Microarrays and Data Mining 10 th -11 th December 2002. Outline. Capturing information Ontologies MIAMExpress.
E N D
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute Microarrays and Data Mining 10th-11th December 2002
Outline • Capturing information • Ontologies • MIAMExpress
Capturing information • Lab book – only useful for the individual Need information understandable by all • Annotate in a controlled way Allows easy retrieval • Submit information to a database / LIMS Available to other researchers
What is an ontology? • A kind of controlled vocabulary (CV) expressed in a structured way.
Components of an ontology = container for information. • Class Has a definition and a relationship to other classes (is-a, part-of, kind-of). e.g. An exon is part of a gene • Instance Terms that are contained within a class.
An ontology – what can it do? • Captures knowledge • Shared understanding • Structure enriches CV • Computer ‘readable’
Why do we need an ontology for the database? • To help users annotate their data usefully and easily • To perform structured queries • To accurately compare data • To avoid problems with free text searching • To avoid excessive curation workload in future
Free text Natural language processing Database Annotation Data mining Controlled vocabulary
Standards and Ontologies for Functional Genomics 17 – 20th November 2002 Hinxton Aim: To bring together scientists (biologists and bioinformaticians) developing standards and ontologies http://www.ebi.ac.uk/SOFG
Examples of ontologies and CVs • NCBI Taxonomy - All organisms represented in the genetic databases • GO – Gene Ontology • EMAP – Edinburgh Mouse Atlas Project • FlyBase – Drosophila genome database • MGED Ontology – For describing samples used in microarray experiments
Local MIAMExpress installations MIAMExpress EBI Array manufacturers Submissions www www www MAGE-ML Data pipelines MAGE-ML Queries LIMS Data analysis Expression Profiler Data analysis software MAGE-ML import/export Microarray software External bioinformatics databases Other microarray databases Infrastructure ArrayExpress (Oracle)
MIAME requirements • Experimental design • Array design • Samples • Hybridizations • Measurements • Normalization controls Nature Genetics 29(4): 365-371
MGED MEDLINE Experiment details Publication details EMBL NCBI taxonomy Gene acc. no. Species GO CAS/ Merck Chemical compd. Gene name Mouse stage Genew EMAP External links 6 parts of a microarray experiment Experiment Sample Hybridization Array Normalization Data
MGED Ontology • Community effort - MGED Society • Supports efforts of MAGE • Describes the parts of a microarray experiment • References out to external ontologies
MGED Ontology • Structured in DAML+OIL using OilEd 3.4
MIAMExpress • Submission and annotation tool • Based on MIAME concepts • Array, Experiment and Protocol submissions • Perl-CGI, MySQL database
Login New/Pending Experiment Sample protocol Extraction protocol Sample 1 Sample 2 Sample 3 Sample 4 Labeling protocol Extracts 1….n Extracts 1….n Extracts 1….n Extracts 1….n E1 E2 En E1 E2 En E1 E2 En E1 E2 En Hybridization protocol Lab. Extr. 1….n Lab. Extr. 1….n Lab. Extr. 1….n Lab. Extr. 1….n LE LE LE LE LE LE LE LE LE LE LE LE Scanning protocol Hybridizations Array1 Array2 Array3 Arrayn Image analysis protocol Data1 Data2 Data3 Datan Transformation protocol Combined Experiment Data Submit Submission process
Tour of MIAMExpress • Login +Password • Multi-user environment • Control over data access http://www.ebi.ac.uk/miamexpress
Login New/Pending Experiment Sample 1 Sample 2 Sample 3 Sample 4
Login New/Pending Experiment Sample 1 Sample 2 Sample 3 Sample 4 Extracts 1….n Extracts 1….n Extracts 1….n Extracts 1….n E1 E2 En E1 E2 En E1 E2 En E1 E2 En
Login New/Pending Experiment Lab. Extr. 1….n Lab. Extr. 1….n Lab. Extr. 1….n Lab. Extr. 1….n LE LE LE LE LE LE LE LE LE LE LE LE Sample 1 Sample 2 Sample 3 Sample 4 Extracts 1….n Extracts 1….n Extracts 1….n Extracts 1….n E1 E2 En E1 E2 En E1 E2 En E1 E2 En
Login New/Pending Experiment Sample 1 Sample 2 Sample 3 Sample 4 Extracts 1….n Extracts 1….n Extracts 1….n Extracts 1….n E1 E2 En E1 E2 En E1 E2 En E1 E2 En Lab. Extr. 1….n Lab. Extr. 1….n Lab. Extr. 1….n Lab. Extr. 1….n LE LE LE LE LE LE LE LE LE LE LE LE Array1 Array2 Array3 Arrayn Hybridizations Data1 Data2 Data3 Datan
Submission successful • Curation • Export of MAGE-ML • Loading to ArrayExpress
C MGED Ontology C BiomaterialDescription C Sex C Gender documentation: Subclass of sex applicable to heterogametic species (i.e., those in which the sexes produce gametes of markedly different size). Males produce small numerous gametes. Females produce small numbers of large gametes. Hermaphrodites are individuals with both male and female characteristics. Mixed refers to a population of individuals with more than one type of gender. used in individuals: female,hermaphrodite,male,mixed_sex,unknown_sex Curation of user defined terms, before inclusion in the ontology Ontology instances propagated to submission/annotation web forms User defined terms collected via forms MIAMExpress ArrayExpress RAD MAGE-ML data exchange
Resources • Microarray Informatics Group http://www.ebi.ac.uk/microarray/ • MIAMExpress http://www.ebi.ac.uk/miamexpress/ • MGED Ontology Working Group http://mged.sourceforge.net/ontologies/ • Sourceforge • http://sourceforge.net/
Acknowledgements ArrayExpress Ugis Sarkans Gonzalo Garcia Ahmet Oezcimen Anjan Sharma Alvis Brazma • Curation • Helen Parkinson • Gaurab Mukherjee • Philippe Rocca-Serra • Susanna Sansone • MIAMExpress • Mohammad Shojatalab • Niran Abeygunawardena • Sergio Contrino • MGED Ontology • Chris Stoeckert • (U. Penn)
GO • http://www.geneontology.org • EMAP • http://genex.hgu.mrc.ac.uk/ • FlyBase • http://flybase.bio.indiana.edu/ • NCBI Taxonomy • http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html/