310 likes | 465 Views
Representing Biological Processes: The Reactome Database Gopal Gopinathrao 1 & Peter D’Eustachio 1,2 1 Cold Spring Harbor Laboratory 2 NYU School of Medicine gopinath@cshl.edu deustp01@med.nyu.edu. Reactome is
E N D
Representing Biological Processes: The Reactome Database Gopal Gopinathrao1 & Peter D’Eustachio1,2 1Cold Spring Harbor Laboratory 2NYU School of Medicine gopinath@cshl.edu deustp01@med.nyu.edu
Reactome is • - reductionist. All of biology can be represented as events that convert input physical entities into output physical entities. • - a generic parts list. Tissue and state specificity of events are not captured. • - qualitative. Kinetic parameters and data are not captured. • - human-centric. Experiments can use reagents from diverse sources, but most biological processes take place in single species, and our focus is on human biological processes. • manually curated. Events are annotated by expert curators, and linked to published data. • open source. All data and software are freely downloadable and reusable.
Regulation Input 1 Output 1 Reaction Pathway Pathway Reaction Reaction Input 2 Output 2 CatalystActivity Data model in a nutshell
Annotating more details • post-translational modifications of proteins • exact locations of entities and events • Annotating more ambiguities • sets of entities - defined, open, and candidate • incompletely specified entities • “black box” reactions
A geometrical compartment set for locating molecules in human cells
The starry sky view of all of Reactome Nucleotidemetabolism Cell cycle& DNA replication Lipid metabolism Notchsignal-ing Carbohydratemetabolism Translation Amino acidmetabolism Apop-tosis DNArepair Transcription Hemo-stasis TCAcycle Posttransla-tional modifi-cations HIV & Influenza life cycles Sterol metab- olism Insulinsignal-ing Glucagonsignaling Xenobioticmetabolism
Reactome Home Page http://brie8.cshl.edu/cgi-bin/frontpage?DB=gk_central
Reactome Event Page http://brie8.cshl.edu/cgi-bin/eventbrowser?DB=gk_central&ID=163767&
Export Formats <owl:Ontology rdf:about=""> <owl:imports rdf:resource="http://www.biopax.org/release/biopax-level2.owl" /> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string">BioPAX pathway converted from "DNA Replication" in the Reactome database.</rdfs:comment> </owl:Ontology> <bp:pathway rdf:ID="DNA_Replication"> <bp:PATHWAY-COMPONENTS rdf:resource="#Regulation_of_DNA_replicationStep" /> <bp:PATHWAY-COMPONENTS rdf:resource="#DNA_strand_elongationStep" /> <bp:PATHWAY-COMPONENTS rdf:resource="#DNA_replication_initiationStep" /> <bp:PATHWAY-COMPONENTS rdf:resource="#Switching_of_origins_to_a_post_replicative_stateStep" /> <bp:PATHWAY-COMPONENTS rdf:resource="#DNA_Replication_Pre_InitiationStep" /> <bp:ORGANISM rdf:resource="#Homo_sapiens" /> <bp:NAME rdf:datatype="http://www.w3.org/2001/XMLSchema#string">DNA Replication</bp:NAME> <bp:SHORT-NAME rdf:datatype="http://www.w3.org/2001/XMLSchema#string">DNA Replication</bp:SHORT-NAME> <bp:XREF rdf:resource="#Reactome69306" /> <bp:XREF rdf:resource="#REACT_383.2" /> <bp:COMMENT rdf:datatype="http://www.w3.org/2001/XMLSchema#string">Studies in the past decade have suggested that the basic mechanism of DNA replication initiation is conserved in all kingdoms of life. Initiation in unicellular eukaryotes, in particular Saccharomyces cerevisiae (budding yeast), is well
Bioinformatics Access • BioMart API • MySQL/Perl API • MySQL/Java API • SOAP/WSDL Interface (multiple languages) • Flat files • Database dumps • Local site install (instructions going into CPBI)
direct curation underway Inference Statistics
Validation of inference • Comparison of manually curated yeast reactions from YBP with inferred reactions from human Reactome • Sensitivity: 72% • Specificity: 78%
Gaps in Reactome Gopal Gopinathrao, PhD Reactome, CSHL
1) Gaps in Reactome annotation 2) Gaps in annotate-able information 3) What a network / pathway ontology can do to fill this gap?
Information Cell ular P a thogens
Information Metabolism P a thogens
Signaling Signaling Information
Domains of Biology waiting to be Reactomized Protozoan/Host interactions Developmental pathways Transcriptional regulation Feedback loops Neuroscience topics Degenerative diseases Synaptic processes Cancer processes OMIM-functional (biochemical) Complex diseases Cellular differentiation, Regulation
Unique human proteins used in pathways (in March 2008) 2500 ~16,000 Swissprot section of UniProt Pathogens/ Host interactions 376 Cellular housekeeping 414 476 Metabolism 600 Information Signaling 755
Gaps in Reactome annotation 6000 5000 4000 total proteins 3000 2000 unique proteins unique + isoforms 1000 0 10 20 30 40 release
Mind what gets filled in… Are all Swissprot proteins annotatable for pathways/interactions? Can all interactions can be placed in a biologically relevant ‘pathway’ or even sub-graphs of a network? If yes, who is going to validate and how, the biological ‘truth’ of any subgraphs derived from a network? [Terms of biological truth - tissue, regulation, developmental stage, expression …]
Watching the gap… Adding in pathway data decomposed to interactions … Adding PPI data to the above …
How a network / pathway ontology may help to fill the gap in pathway annotations..
ABCD complex A+B+C+D Feedback loop C<----->D Novel A<----->B C<----->A C<----->B A<-----| B A<----->D Known New regulatory event
Updated model for curation would be: 1. A+B+C+D ABCD complex Feedback loop C<----->D Novel A<----->B A<-----| B New regulatory event Interaction of C and D may regulate ABCD complex formation Evidence from a network ontology 3. Post-translational inhibition of B by A may result in down regulation of A, there by affecting the stability of complex ABCD Evidence from a network ontology in a model organism
The Team • CSHL • Lincoln Stein (PI) • Gopal Gopinathrao (managing editor) • Marc Gillespie, Lisa Matthews, Bruce May, Mike Caudy (curators) • Guanming Wu, Alex Kanapin (developer) • EBI • Ewan Birney (coPI) • Esther Schmidt, Imre Vastrik, David Croft (developers) • Bernard de Bono, Bijay Jassal, Phani Garapati (curators) • NYU • Peter D’Eustachio (co-PI; editor-in-chief) • Shahana Mahajan (curator) P41 HG003751