MolIDE2: Homology Modeling Of Protein Oligomers And Complexes

MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia, PA 19111

Agenda • Background • MolIDE in retrospect • MolIDE2 • Demonstration • Discussion Dunbrack Lab, FCCC - NIGMS Workshop 2009

MQEQLTDFSKVETNLISW-QGSLETVEQMEPWAGSDANSQTEAY | |..|. ||| ... |..||.|.| | |||..| MHQQVSDYAKVEHQWLYRVAGTIETLDNMSPANHSDAQTQAA |= Identity .= positively related Background Homology Modelling • What is it and why do we need it? Given a protein without known structure, predict its 3D structure based on its sequence: • Search structure databases for homologous sequences • Transfer coordinates of known protein onto unknown Dunbrack Lab, FCCC - NIGMS Workshop 2009

Background An inconvenient truth: huge gap between known structures and known sequences. • Experimentally determined structures (through x-ray crystallography & NMR spectroscopy.) As of 2/24/2009, PDB has 56,066 entries (< 52K protein structures) • Decoded protein sequences • As of 2/10/2009, UniProtKB/Swiss-Prot (Release 56.8 ) contains 410,518 sequence entries. • As of 2/10/2009, UniProtKB/TrEMBL(Release 39.8 ) contains 7,157,600 sequence entries Dunbrack Lab, FCCC - NIGMS Workshop 2009

Background Homology modeling • Various methods • Swiss-model – fully automated modeling server • Modeller – satisfaction of spatial restraints • Nest – based on artificial evolution • Similar steps • Template identification • Sequence alignment • Backbone generation • Side-chain prediction & loop modeling • Structure refinement Dunbrack Lab, FCCC - NIGMS Workshop 2009

In Retrospect MolIDE: A graphical IDE for homology modeling A. Canutescu and R.L. Dunbrack, Bioinformatics, 2005 Dunbrack Lab, FCCC - NIGMS Workshop 2009

In Retrospect MolIDE: open-source, cross-platform • sequence search • multiple-round psiblast alignments • secondary structure prediction • assisted alignment editing (joint with a template viewer) • side chain conformation prediction • loop building Dunbrack Lab, FCCC - NIGMS Workshop 2009

In Retrospect MolIDE1.6 (released on July 1, 2008) • MolIDE automatically downloads these large sequence databases (nr or uniref100) and formats them for use with BLAST. • Easy installer for Windows version • One-step updating of PDB and Non-redundant protein sequence databases • PSI-BLAST search of non-redundant database can be opened as a sortable table for browsing homologues of query • List of templates from PDB includes protein names and species • Works with current remediated XML files from the PDB • NCBI's non-redundant protein database (nr) can be replaced with Uniprot's Uniref100 database: Wang et al. Nature Protocols, Dec., 2008 Dunbrack Lab, FCCC - NIGMS Workshop 2009

In Retrospect Previous MolIDE CAN NOT deal with protein oligomers and protein complexes Some examples: • Structure with ligands • Multi-chain protein complex • Modeling of biological unit • Modeling with multiple templates • Structural or functional restraints in modeling • … Dunbrack Lab, FCCC - NIGMS Workshop 2009

Goal: Develop a new homology modeling program capable of modeling protein oligomers and protein complexes. Challenge: • Identify an appropriate template; • One-to-many sequence alignment; • Better understanding of Biological Unit (BU). MolIDE2 Dunbrack Lab, FCCC - NIGMS Workshop 2009

MolIDE2 Key features: • Able to model protein oligomers and complexes; • Modeling process based on Biological Unit (BU). • Identifying structural template based on domain and family information; • Integrated database providing protein structural and sequence information; • Better organization and representation of template information; • User-friendly graphical interface for selecting template; • Integration of profile-profile sequence alignment method; • Improved graphical editing of sequence alignment; Dunbrack Lab, FCCC - NIGMS Workshop 2009

MolIDE2 Screenshot Dunbrack Lab, FCCC - NIGMS Workshop 2009

MolIDE2 Operation flowchart Dunbrack Lab, FCCC - NIGMS Workshop 2009

Demonstration A typical modeling process: • Open sequence file • Run hmmpfam (generate domain file) • 3. Run psiblast (generate query profile) • 4. Run psipred (predict secondary structure of query sequence) • Open domain file • Search PDB for potential template; get profile-profile alignment result • Open alignment file • Manually modify sequence alignment (optional) • Copy backbone structure • Run SCWRL for sidechain replacement. • Build loops (not implemented yet). Dunbrack Lab, FCCC - NIGMS Workshop 2009

Demonstration Sequences and domains Dunbrack Lab, FCCC - NIGMS Workshop 2009

Demonstration Finding template (1) Dunbrack Lab, FCCC - NIGMS Workshop 2009

Demonstration Finding template (2) Dunbrack Lab, FCCC - NIGMS Workshop 2009

Demonstration Editing sequence alignment & generating model Dunbrack Lab, FCCC - NIGMS Workshop 2009

Demonstration Editing sequence alignment & generating model (cont’d) Dunbrack Lab, FCCC - NIGMS Workshop 2009

Discussion Some future work • Loop modeling component dealing with space symmetry • Involvement of protein-protein interaction information (ProtBuD) • Modeling with multiple templates • modeling of ligands • refinement of models with Rosetta and RosettaDock • … Dunbrack Lab, FCCC - NIGMS Workshop 2009

Acknowledge Dr. Adrian Canutescu, Dr. Mark Andrake NIH R01 GM84453 NIH R01 GM73784 Dunbrack Lab, FCCC - NIGMS Workshop 2009

Q & A Thanks Dunbrack Lab, FCCC - NIGMS Workshop 2009

MolIDE2: Homology Modeling Of Protein Oligomers And Complexes

MolIDE2: Homology Modeling Of Protein Oligomers And Complexes

Presentation Transcript

Protein Homology Modelling

Homology Modeling via Protein Threading

Homology Modeling Workshop

Homology modeling

Homology Modeling

Protein Networks / Protein Complexes

Predicting 3D Protein Structure using Homology Modeling

Identification of Macromolecular Assemblies and Protein-Protein Complexes

Content Protein fold and structure Homology modeling Protein-protein docking

Protein Networks / Protein Complexes

Applications of Homology Modeling

Applications of Homology Modeling

6. Homology Modeling

Protein structure and homology modeling

Homology modeling of G protein-coupled receptors

Homology Modeling

Homology Modeling

A Definition of Interfaces for Protein Oligomers

Homology Modeling

Protein Homology Modelling