490 likes | 506 Views
Supervised By: Dr.Habib Bokhari (Ass.Professor) Presented By: Shazia Sabir (FA04-BSB-005). Introduction of GFP. Strategy used for Modeling of HriGFP and HriCFP. Tour of Database of GFP. Tour of web-Interface of developed Database & developed tool. Introduction of GFP.
E N D
Supervised By: Dr.Habib Bokhari(Ass.Professor) Presented By: Shazia Sabir (FA04-BSB-005)
Introduction of GFP. • Strategy used for Modeling of HriGFP and HriCFP. • Tour of Database of GFP. • Tour of web-Interface of developed Database & developed tool.
Introduction of GFP • Existed for more than one hundred and sixty million years. • Found in one species of jellyfish, Aequorea victoria (monomeric) • First time clone in 1992. • Fluorescent GFP has been expressed in bacteria, yeast, slime mold, plants, , drosophila, zebrafish, and in mmmalian cells. • Since 1999, numerous GFP homologues have been discovered in Anthozoa, Hydrozo species • GFP as the microscope of the twenty-first century.
GFP found in Aqurea Victoria is comprised of 238 amino acids with mol.wt of w25 kDa. • Its wild-type absorbance/ excitation peak is at 395 nm with a minor peak at 475 nm. • The emission peak is at 508 nm. • Analysis of a hexapeptide derived by proteolysis of purified GFP led to the prediction that the fluorophore originates from an internal Ser-Tyr-Gly sequence.
Florophore Ref: Fan Yang, Larry G. Moss, and George N. Phillips, Jr. The Molecular Structure of Green Fluorescent Protein
Applications • Monitoring of gen expression • The gene encoding a FP is cloned under the control of the target promoter, whereby activity of the promoter can be monitored by the magnitude of the fluorescent signal • split FP->the fluorescent signal occurs only when both promoters are active. Ref: Dmitriy M. Chudakov, Sergey Lukyanov and Konstantin A. Lukyanov, Fluorescent proteins as a toolkit for in vivo imaging
HriGFP& HriCFP • Recently discovered proteins in Hydnophora rigida with emission maxima of 527nm & 495nm. • Have high similarity with each other while have very low similarity with related members. • Deletion of only one nucleotide in HriGFP at 446th position causes the shift in emission from green to cyan.
Goals of Project • Structural Modeling of HriGFP and HriCFP. • Development of Database of Green Flourescent Proteins. • Development of web-Interface of developed Database. • Development of Tool for prediction of Fluorophore in the protein sequence.
Introduction of Homology Modeling • Predicts the three-dimensional structure of a given protein sequence (TARGET) based on an alignment to one or more known protein structures (TEMPLATES) • Homology models are of great interest for planning and analyzing biological molecules when no experimental three dimensional structure is available.
Continue…. • The number of structurally characterized proteins (20,000) is small compared with the number of known protein sequences (1,000,000).(RefEswar et all, 2003) • Protein structures are more conserved than protein sequences, detectable levels of sequence similarity usually imply significant structural similarity
Step I & II i) Three regimes of the sequence-structure relationship • The easily detected relationships, characterized by >30% sequence identity • The “twilight zone”(with identities 10% to 30%) , corresponds to relationships with statistically significant sequence similarity • The “midnight zone” (less than 10%), corresponds to statistically insignificant sequence similarity,
Continue…. ii) Fold Recognition Approaches • Pairwise sequence alignment methods • Profile-Sequence alignment methods • Profile-Profile alignment methods • Sequence-Structure threading methods iii) Template Selection & Alignment
Step I & IIFold Assignment & alignment with template Ref: Sali, A., and Blundell, T.L. (1993). Comparative protein modelling by satisfaction of spatial restraints, J. Mol. Biol.
Step III (Model Building) • i) Modeling by Satisfaction of Spatial Restraints • Homology-derived restraints • Stereochemical restraints • Optimization of the objective function • Restraints derived from experimental data
Loop Modeling • Loops play an important role in defining the functional specificity of a given protein (e.g. forming active & Binding site). • important aspect of comparative modeling for 30% to 50% sequence identity
Step III (Model Building) Ref: Sali, A., and Blundell, T.L. (1993). Comparative protein modelling by satisfaction of spatial restraints, J. Mol. Biol.
Step IV(Predicting Model Errors) • Errors in side-chain packing • Distortions and shifts in correctly aligned regions • Errors in regions without a template • Errors due to misalignments
Sequence of Template (1EMA) with 2D Structure Ref: http://www.rcsb.org/pdb/explore/explore.do?structureId=1EMA
3D Structure of Template (1EmA) Ref:http://www.rcsb.org/pdb/explore/explore.do?structureId=1EMA
SWISS-MODEL • First Approach Mode - more than 25% seq similarity. - automatic template selection. • Alignment Mode - less than 25% seq similarity. - input alignment with template. • Project Mode - input file is deep view project file.
Alignment of HriGFP with Template Ref: http://www.ebi.ac.uk/Tools/clustalw2/index.html?
Alignment of HriCFP with Template Ref: http://www.ebi.ac.uk/Tools/clustalw2/index.html?
Alignment w.r.t structure Ref: http://swissmodel.expasy.org/workspace/index.php?func=modelling_align1
MODELLER • Input Files • Atom file (.pdb, .ent, .atm) • Alignment file (.ali) • Script file (.py) • Output Files • Log file (text file) • Output of Script
Alignment Generated by MODELLER for HriGFPPap file (Output file)
Alignment Generated by MODELLER for HriCFPPap file (Output file)
Phase III (Model Generation) Five Models were generated DOPE score : -6536.567383 DOPE score : -6770.957031 DOPE score : -6814.932129 DOPE score : -7199.573242 DOPE score : -6945.205078 DOPE, or Discrete Optimized Protein Energy, is a statistical potential used to assess homology models in protein structure prediction.
Phase IV (Evaluation of Model) • The recognition of errors in experimental and theoretical models of protein structures is a major problem in structural biology. • If this score is outside a range characteristic for native proteins the structure probably contains errors. • A plot of local quality scores points to problematic parts of the model
Z-Score (Evaluation Criteria) • The z-score indicates overall model quality. • value is displayed in a plot that contains the z-scores of all experimentally determined protein chains in current PDB. • Can be used to check the input structure is within the range of scores typically found for native proteins of similar size.
Z-Score Ref: https://prosa.services.came.sbg.ac.at
Molecular Interactive View & Plot Generation Ref: https://prosa.services.came.sbg.ac.at
Ref: http://laboheme.df.ibilce.unesp.br/cluster/parmodel_mpi/
Ref: http://laboheme.df.ibilce.unesp.br/cluster/parmodel_mpi/
Conclusion • Both HriGFP and HriCFP are capable of forming only five beta-sheets that are not enough for formation of complete beta-can and hence may not provide full protection to fluorophore. So these gaps must be filled in order to give complete protection to fluorophore.
Future Prospects • Development of a Tool that would determine change in fluorescence upon mutagenesis.
Tour of • Database • Web-Interface • Tool