180 likes | 335 Views
Protein Homologue Clustering and Molecular Modeling. L. Wang. Background. Project done in Mt. Sinai School of Medicine, New York City. Lab of Dr. Roberto Sanchez. A major aim of this project is to answering: how accurate are the protein structure models from Comparative Modeling?.
E N D
Background • Project done in Mt. Sinai School of Medicine, New York City. • Lab of Dr. Roberto Sanchez. • A major aim of this project is to answering: how accurate are the protein structure models from Comparative Modeling?
Protein Structure and Function • Protein function is largely decided by its 3D structure • 3D structure is from 1D sequence folding • Protein structure can be determined by experimental methods, X-ray, NMR • Or predicted by modeling
What is Comparative Modeling • Many protein structures are similar • Proteins sharing at least 30% sequence identity are generally similar to each other, and considered as homologues • Building target protein structure model based on a template protein structure already known
How to Comparative Modeling • Find template • Get Alignment of target and template • Based on 3-D structure of template protein • Build model structure of target protein
Accuracy of the Comparative Model • The structural difference between the target and templates • The alignment between target and templates
Effect of Genome Project • Generated a lot of protein that we know the sequence but not the structure (need) • Generated more experimental structures (template) • By programs like psi-blast, more remote homologues can be determined (coverage) • Larger database helps more accurate alignment (accuracy)
Modeller, an implementation of Comparative Modeling • Use scripts to control its behavior • Can perform a lot of tasks from aligning, to model building • Modeller can be easily automated.
Project Overview • Building protein homologues using single linkage clustering algorithm • Automatically align target and template and building 3-D protein models • Comparing the models with the experimental structure
Building Protein Homologue Cluster • From PDB, run Blast • Build table • Get first cluster • Find representatives • Running PsiBlast on representatives • Build second table • Merge groups and get final cluster
PDB Chains Unique Chains Blast Cluster (90% coverage,e-value<10-4., single linkage (1)) Find Representatives Psi-blast Re-cluster & Merge Groups (1) Refer to Duba, R.O., et al’s “Pattern Classification”, P566. Final Cluster Nice Group Keep the Representative Find Representative Pickup X-ray Data (<=2.5Å)
Homologue cluster Using the good X-ray structures Building models (pair-wise align. & 3-D align.) Comparing the models with the experimental structures
Error source:Alignment, plus structure difference between target and template