220 likes | 380 Views
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes. Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia, PA 19111. Agenda. Background MolIDE in retrospect MolIDE2 Demonstration Discussion.
E N D
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia, PA 19111
Agenda • Background • MolIDE in retrospect • MolIDE2 • Demonstration • Discussion Dunbrack Lab, FCCC - NIGMS Workshop 2009
MQEQLTDFSKVETNLISW-QGSLETVEQMEPWAGSDANSQTEAY | |..|. ||| ... |..||.|.| | |||..| MHQQVSDYAKVEHQWLYRVAGTIETLDNMSPANHSDAQTQAA |= Identity .= positively related Background Homology Modelling • What is it and why do we need it? Given a protein without known structure, predict its 3D structure based on its sequence: • Search structure databases for homologous sequences • Transfer coordinates of known protein onto unknown Dunbrack Lab, FCCC - NIGMS Workshop 2009
Background An inconvenient truth: huge gap between known structures and known sequences. • Experimentally determined structures (through x-ray crystallography & NMR spectroscopy.) As of 2/24/2009, PDB has 56,066 entries (< 52K protein structures) • Decoded protein sequences • As of 2/10/2009, UniProtKB/Swiss-Prot (Release 56.8 ) contains 410,518 sequence entries. • As of 2/10/2009, UniProtKB/TrEMBL(Release 39.8 ) contains 7,157,600 sequence entries Dunbrack Lab, FCCC - NIGMS Workshop 2009
Background Homology modeling • Various methods • Swiss-model – fully automated modeling server • Modeller – satisfaction of spatial restraints • Nest – based on artificial evolution • Similar steps • Template identification • Sequence alignment • Backbone generation • Side-chain prediction & loop modeling • Structure refinement Dunbrack Lab, FCCC - NIGMS Workshop 2009
In Retrospect MolIDE: A graphical IDE for homology modeling A. Canutescu and R.L. Dunbrack, Bioinformatics, 2005 Dunbrack Lab, FCCC - NIGMS Workshop 2009
In Retrospect MolIDE: open-source, cross-platform • sequence search • multiple-round psiblast alignments • secondary structure prediction • assisted alignment editing (joint with a template viewer) • side chain conformation prediction • loop building Dunbrack Lab, FCCC - NIGMS Workshop 2009
In Retrospect MolIDE1.6 (released on July 1, 2008) • MolIDE automatically downloads these large sequence databases (nr or uniref100) and formats them for use with BLAST. • Easy installer for Windows version • One-step updating of PDB and Non-redundant protein sequence databases • PSI-BLAST search of non-redundant database can be opened as a sortable table for browsing homologues of query • List of templates from PDB includes protein names and species • Works with current remediated XML files from the PDB • NCBI's non-redundant protein database (nr) can be replaced with Uniprot's Uniref100 database: Wang et al. Nature Protocols, Dec., 2008 Dunbrack Lab, FCCC - NIGMS Workshop 2009
In Retrospect Previous MolIDE CAN NOT deal with protein oligomers and protein complexes Some examples: • Structure with ligands • Multi-chain protein complex • Modeling of biological unit • Modeling with multiple templates • Structural or functional restraints in modeling • … Dunbrack Lab, FCCC - NIGMS Workshop 2009
Goal: Develop a new homology modeling program capable of modeling protein oligomers and protein complexes. Challenge: • Identify an appropriate template; • One-to-many sequence alignment; • Better understanding of Biological Unit (BU). MolIDE2 Dunbrack Lab, FCCC - NIGMS Workshop 2009
MolIDE2 Key features: • Able to model protein oligomers and complexes; • Modeling process based on Biological Unit (BU). • Identifying structural template based on domain and family information; • Integrated database providing protein structural and sequence information; • Better organization and representation of template information; • User-friendly graphical interface for selecting template; • Integration of profile-profile sequence alignment method; • Improved graphical editing of sequence alignment; Dunbrack Lab, FCCC - NIGMS Workshop 2009
MolIDE2 Screenshot Dunbrack Lab, FCCC - NIGMS Workshop 2009
MolIDE2 Operation flowchart Dunbrack Lab, FCCC - NIGMS Workshop 2009
Demonstration A typical modeling process: • Open sequence file • Run hmmpfam (generate domain file) • 3. Run psiblast (generate query profile) • 4. Run psipred (predict secondary structure of query sequence) • Open domain file • Search PDB for potential template; get profile-profile alignment result • Open alignment file • Manually modify sequence alignment (optional) • Copy backbone structure • Run SCWRL for sidechain replacement. • Build loops (not implemented yet). Dunbrack Lab, FCCC - NIGMS Workshop 2009
Demonstration Sequences and domains Dunbrack Lab, FCCC - NIGMS Workshop 2009
Demonstration Finding template (1) Dunbrack Lab, FCCC - NIGMS Workshop 2009
Demonstration Finding template (2) Dunbrack Lab, FCCC - NIGMS Workshop 2009
Demonstration Editing sequence alignment & generating model Dunbrack Lab, FCCC - NIGMS Workshop 2009
Demonstration Editing sequence alignment & generating model (cont’d) Dunbrack Lab, FCCC - NIGMS Workshop 2009
Discussion Some future work • Loop modeling component dealing with space symmetry • Involvement of protein-protein interaction information (ProtBuD) • Modeling with multiple templates • modeling of ligands • refinement of models with Rosetta and RosettaDock • … Dunbrack Lab, FCCC - NIGMS Workshop 2009
Acknowledge Dr. Adrian Canutescu, Dr. Mark Andrake NIH R01 GM84453 NIH R01 GM73784 Dunbrack Lab, FCCC - NIGMS Workshop 2009
Q & A Thanks Dunbrack Lab, FCCC - NIGMS Workshop 2009