410 likes | 539 Views
23.09.2014. Master title. Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI. 5. EBI is an Outstation of the European Molecular Biology Laboratory. Living cells contain crowded and diverse molecular environments.
E N D
23.09.2014 Master title • Molecular Interactions – the IntAct Database • Sandra Orchard • EMBL-EBI 5 EBI is an Outstation of the European Molecular Biology Laboratory.
Living cells contain crowded and diverse molecular environments • Proteins constitute ~30% of E. coli and ~5% of yeast cytoplasm by weight • ~2000 protein types are • co-expressed • co-localized in yeast cytoplasm
Example of a PPI Network >80% of proteins are all connected in one giant cluster of PPI network Small-world effect median network distance – 6 steps • Nodes – proteins • Edges – interactions
Why is it useful to study PPI networks? • Proteins are the workhorses of cell, carrying out catalytic reactions, transport, forming viral caspids, transmitting information from DNA to RNA, traverse membranes, forming regulated channel, make possible synthesis of new proteins, responsible for degradation of unnecessary proteins, vehicles of immune response • One way to predict protein function is through identification of binding partners – Guilt by Association • If the function of at least one of the components with which the protein interacts is known, that should let us assign its function(s) and the pathway(s) • Hence, through the intricate network of these interactions we can map cellular pathways, their interconnectivities and their dynamic regulation
Why is it useful to study the structure of PPI networks? • Common properties of biological networks • Can help us relate network structure to biological function • Protein’s relative position in a network • Correlate conserved functional modules with protein complexes
IntAct goals & achievements • Publicly available repository of molecular interactions (mainly PPIs) - >285K binary interactions taken from >5,100 publications (November 2011) • Data is standards-compliant and available via our website, for download at our ftp site or via PSICQUIC • Provide open-access versions of the software to allow installation of local IntAct nodes. http://www.ebi.ac.uk/intact ftp://ftp.ebi.ac.uk/pub/databases/intact www.ebi.ac.uk/Tools/webservices/psicquic/view/main.xhtml
Sanity Checks(nightly) reject Public web site . exp annotate accept FTP site p2 I p1 IMEx check CVs Curation manual Mint DIP MatrixDB report report Super curator curator IntAct Curation “Lifecycle of an Interaction” Publication (full text) Master headline
UniProt Knowledge Base Interactions can be mapped to the canonical sequence… .. to splice variants.. http://www.ebi.uniprot.org/ .. or to post-processed chains
Relationship with UniProtKB Other IMEx databases Interaction curation Protein sequence Data filters In place Other DBs Early 2012 High confidence PPIs Master headline
Interacting domains Overlay of Ranges on sequence: Data model • Support for detailed featuresi.e. definition of interacting interface
How to deal with Complexes • Some experimental protocol do generate complex data: • Eg. Tandem affinity purification (TAP) • One may want to convert these complexes into sets of binary interactions, 2 algorithms are available:
PSI-MI XML Standard format PSI-MITAB XML Java API Data format Tools MITAB Java API XMLMakerFlattener Semantic Validator RPsiXML (Bioconductor) Control vocabulary PSI-MI CV Reporting guideline MIMIx Data submission PSI-MI XML files PSI-MI PSI Excel Sheet Tools PSI Web Form Servers Data distribution PSICQUIC Registry Clients Servers PSISCORE Registry Clients
Performing and visualing a Simple Search Data, Standards and Tools EBI Walthrough May 2009 EBI
IntAct – Home Page http://www.ebi.ac.uk/intact
Visualizing - networkView From search to networkView…
Cytoscape Web Cytoscape Web - web-based network visualization tool Modeled after Cytoscape – open-source, interactive, customizable and easily integrated into web sites. Contains none of the plugin architecture functionality of Cytoscape
Visualization Opening the network in Cytoscape… Master headline
Visualization Applying a better graph layout… Master headline
Visualization Applying a better graph layout… Master headline
Visualization Highlighting network properties… Master headline
Visualization Highlighting network properties… Master headline
Visualization Highlighting network properties… Master headline
Visualization Highlighting network properties… Master headline
Interaction detail First search from the home page… Choice of UniProtKB or Dasty View Details of interaction
Detail of interaction UniProt Taxonomy Expansion method Details of interaction PubMed
Participant information Search result for ‘RAD1’
Interaction detail First search from the home page… Details of interaction
Viewing Interaction Details Additional information
Advanced search: Fields Filtering options Add more filtering options
Searching with MIQL • Using the Molecular Interaction Query Language (MIQL), one can also build complex queries • List of terms one can query on : First search from the home page…
Browsing – Molecule View Binary view of o60671_human
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?