720 likes | 739 Views
Discover how the Gene Ontology (GO) project helps analyze gene expression related to mesoderm development. Learn how to leverage GO annotations for microarray data analysis and understand the hierarchical structure of GO terms. Find genes involved in apoptosis, growth control, and more with the GO browser.
E N D
Gene Ontology (GO) Project http://www.geneontology.org/ Jane Lomax
There is a lot of biological research output
You’re interested in which genes control inmesoderm development…
You get 6752 results! How will you ever find what you want?
time Defense response Immune response Response to stimulus Toll regulated genes JAK-STAT regulated genes Puparial adhesion Molting cycle hemocyanin Amino acid catabolism Lipid metobolism Peptidase activity Protein catabloism Immune response Immune response Toll regulated genes control attacked Microarray data shows changed expression of thousands of genes. How will you spot the patterns? Bregje Wertheim at the Centre for Evolutionary Genomics, Department of Biology, UCL and Eugene Schuster Group, EBI.
Scientists work hard
There are lots of papers to read http://www.teamtechnology.co.uk/f-scientist.jpg
More papers… http://www.teamtechnology.co.uk/f-scientist.jpg
more and more and more… http://www.teamtechnology.co.uk/f-scientist.jpg
Help! more and more and more! http://www.teamtechnology.co.uk/f-scientist.jpg
The Gene Ontology provides a way to capture and represent biological all this knowledge in a computable form
Search on ‘mesoderm development’ mesoderm development
Definition of mesoderm development Gene products involved in mesoderm development
Microarray process: • Treat samples • Collect mRNA • Label • Hybridize • Scan • Normalize • Select differentially regulated genes • Understand the biological phenomena involved
Traditional analysis • gene by gene basis • requires literature searching • time-consuming
Gene 1 Apoptosis Cell-cell signaling Protein phosphorylation Mitosis … Gene 2 Growth control Mitosis Oncogenesis Protein phosphorylation … Gene 3 Growth control Mitosis Oncogenesis Protein phosphorylation … Gene 4 Nervous system Pregnancy Oncogenesis Mitosis … Gene 100 Positive ctrl. of cell prolif Mitosis Oncogenesis Glucose transport … Traditional analysis
GO:0006915 : apoptosis Using GO annotations • But by using GO annotations, this work has already been done
Grouping by process Mitosis Gene 2 Gene 5 Gene45 Gene 7 Gene 35 … Glucose transport Gene 7 Gene 3 Gene 6 … Apoptosis Gene 1 Gene 53 Positive ctrl. of cell prolif. Gene 7 Gene 3 Gene 12 … Growth Gene 5 Gene 2 Gene 6 …
GO for microarray analysis • Annotations give ‘function’ label to genes • Ask meaningful questions of microarray data e.g. • genes involved in the same process, same/different expression patterns?
How does the Gene Ontology work?
GO structure • GO isn’t just a flat list of biological terms • terms are related within a hierarchy
gene A GO structure
GO structure • This means genes can be grouped according to user-defined levels • Allows broad overview of gene set or genome
How does GO work? What information might we want to capture about a gene product?
How does GO work? What information might we want to capture about a gene product? • What does the gene product do?
How does GO work? What information might we want to capture about a gene product? • What does the gene product do? • Where and when does it act?
How does GO work? What information might we want to capture about a gene product? • What does the gene product do? • Where and when does it act? • Why does it perform these activities?
GO structure • GO terms divided into three parts: • cellular component • molecular function • biological process
Cellular Component • where a gene product acts
Cellular Component • Enzyme complexes in the component ontology refer to places, not activities.
Molecular Function • activities or “jobs” of a gene product glucose-6-phosphate isomerase activity
Molecular Function insulin binding insulin receptor activity
Molecular Function drug transporter activity
Molecular Function • A gene product may have several functions; a function term refers to a reaction or activity, not a gene product • Sets of functions make up a biological process
cell division Biological Process a commonly recognized series of events
Biological Process transcription
Biological Process regulation of gluconeogenesis
Biological Process limb development
Biological Process courtship behavior
Ontology Structure • Terms are linked by two relationships • is-a • part-of
cell membrane chloroplast mitochondrial chloroplast membrane membrane is-a part-of Ontology Structure
Ontology Structure • Ontologies are structured as a hierarchical directed acyclic graph (DAG) • Terms can have more than one parent and zero, one or more children
Ontology Structure cell membrane chloroplast mitochondrial chloroplast membrane membrane Directed Acyclic Graph (DAG) - multiple parentage allowed
Anatomy of a GO term unique GO ID id: GO:0006094 name: gluconeogenesis namespace: process def: The formation of glucose from noncarbohydrate precursors, such as pyruvate, amino acids and glycerol. [http://cancerweb.ncl.ac.uk/omd/index.html] exact_synonym: glucose biosynthesis xref_analog: MetaCyc:GLUCONEO-PWY is_a: GO:0006006 is_a: GO:0006092 term name ontology definition synonym database ref parentage