1 / 61

Computational Analysis of Proteins

Dr. K. Sivakumar Department of Chemistry SCSVMV University chemshiva@gmail.com. Computational Analysis of Proteins. National Workshop on Modern Techniques in Analytical Chemistry. Chemistry – Our Life, Our Future. www.kanchiuniv.ac.in/DrKSivakumar_chemistry.html. Amino Acid.

olin
Download Presentation

Computational Analysis of Proteins

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Dr. K. Sivakumar Department of Chemistry SCSVMV University chemshiva@gmail.com Computational Analysis of Proteins National Workshop onModern Techniques in Analytical Chemistry Chemistry – Our Life, Our Future www.kanchiuniv.ac.in/DrKSivakumar_chemistry.html

  2. Amino Acid Triple letter code Single letter code General structureof an amino acid Alanine Ala A Cysteine Cys C Aspartic acid Asp D Glutamic acid Glu E Phenylalanine Phe F Glycine Gly G Histidine His H Isoleucine Ile I Lysine Lys K Leucine Leu L Methionine Met M Asparagine Asn N Proline Pro P Glutamine Gln Q Arginine Arg R Serine Ser S Threonine Thr T Valine Val V Tryptophan Trp W Tyrosine Tyr Y AMINO ACIDS: THE BUILDING BLOCKS OF PROTEINS Triple & single letter codes of amino acids

  3. Protein sequencer Alanine Leucine Serine Phenylalanine Methionine MALSFTVGQLIFLFWTMRITEASPD Protein sequence PROTEIN SEQUENCING ( Order of amino acids in proteins) • Protein sequencing - determining the order of amino acid sequence • Methods– Mass Spec., Edman degradation,…. • Amino acids in a protein -determines the properties of proteins • Proteins are sequenced - by microbiologists and biotechnologists for various purposes.

  4. Refer “GENOME” by Sujatha, for simple explanations on sequencing process www.writersujatha.com

  5. Various levels of protein structure……..

  6. Methane C for carbon C for single atom Primary structure Secondary structure Tertiary structure M for Metheonine M for group of atoms Protein Primary structure Tertiary structure Secondary structure

  7. Protein sequences are continuously submitted by sequencing centers and updated in protein databases. • Till date more than 10 Lac proteins are sequenced and publicly made available through protein databases. For example, Protein Sequence Databases No. of Sequences 524,420 1,365,912 13,593,921

  8. Sequence growth in Protein sequence databases: Ref: SwissProt – Feb’ 2011 Ref: GenomeNet – Feb’ 2011

  9. Protein Sequence Databases No. of Sequences 524,420 - ~ 5 Lac 1,365,912 - > 10 Lac 13,593,921 - ~ 1 Cr The ONLY Protein Structure Database No. of Structure 70,947 Till 01, Feb, 2011 Ref: K. Sivakumar, Advanced BioTech, V (9), 20-27 (2007)

  10. PDB contains (70,947) structures determined by X-ray, NMR & Electron microscopy EM ~350 X-ray ~60,500 NMR ~8,700

  11. Most of the sequenced proteins lack a descriptive, documented physico-chemical and STRUCTURAL characterization. • Because, experimental methods (X-ray, NMR, EM) are, • Trial and error based • Time consuming • Expensive Computational methods are, • Minimizing the number of experimental trials. • Reduces the cost of experimental investigation. • Facilitates experimental analysis be more focused. Ref: K. Sivakumar, S. Balaji, Ganga Radhakrishnan, Journal of Theoretical and Computational Chemistry, 6 (1), 127-140 (2007).

  12. Need for computational analysis • > 10 Lac sequences are available in public databases • Sequences are highly valuable resources, because… • Huge amount of structural, functional & evolutionary information are locked up in sequences • By contrast, the # of unique protein structures is very less • - this represents a hugeinformation deficit • So, We need to construct 3D Models by COMPUTATIONAL METHODS

  13. 3D Structure can be modelled by… • Homology Modeling • Threading • Ab initio

  14. Homology Modeling – Principle… Repeated with other suitable templates Ref: K. Sivakumar, Advanced BioTech, IV (11), 18-23 (2006)

  15. Predicting Protein Structure: Comparative Modeling (formerly, homology modeling) Homologous KQFTKCELSQNLYDIDGYGRIALPELICTMFHTSGYDTQAIVENDESTEYGLFQISNALWCKSSQSPQSRNICDITCDKFLDDDITDDIMCAKKILDIKGIDYWIAHKALCTEKLEQWLCEKE KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAKFESNFNTQATNRNTDGSTDYGILQINSRWWCNDGRTPGSRNLCNIPCSALLSSDITASVNCAKKIVSDGNGMNAWVAWRNRCKGTDVQAWIRGCRL Share Similar Sequence 1alc ? Use as template & model 8lyz Template sequence Target sequence Template structure

  16. What is Homology Modeling? • Predicts the three-dimensional structure of a given protein sequence (TARGET) based on an alignment to one or more known protein structures (TEMPLATES) • If similarity between the TARGET sequence and the TEMPLATE sequence is detected, structural similarity can be assumed. • In general, 30% sequence identity is required for generating useful models.

  17. Homology Modeling • Get protein sequence from sequence database http://expasy.org/sprot/

  18. Click to get protein details

  19. Click to get protein sequence

  20. protein sequence in fasta format • Save it in a notepad for further use

  21. Using Protein Blast server to find similar STRUCTURE http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins Paste sequence in Fasta format Choose PDB Click to search, similar structures in PDB

  22. Graphical summary of Blastp suite Blast search of O70456 Vs PDB

  23. List of similar structure - Blastp suite

  24. Detailed summary of Blastp suite

  25. Method1: EsyPred3D server - Submit the sequence and PDB ID Paste sequence only Type the PDB ID Click to submit

  26. Get built in structure through email in Inbox

  27. Download the attached the *.pdb file and save it

  28. Open and visualize the *.pdb file in RasMol

  29. Open and visualize the *.pdb file in RasMol

  30. Method2: SWISS-MODEL server Click for modeling

  31. Submit sequence only in Fasta format (without PDB ID) • Similarity search (BlastP) will be done by SWISS-MODEL server Paste sequence Click to submit

  32. Get built in structure through email in Inbox

  33. The links in the email will lead to Click to download 3D structure

  34. Open and visualize the *.pdb file in RasMol

  35. Structure retrieval from Protein 3D Structure Database – PDB……….

  36. Structure retrieval from Protein 3D Structure Database – PDB………. 491 sequence in SwissProt for « Keratin » PDB ID Click for protein details

  37. Structure retrieval from Protein 3D Structure Database – PDB………. Click for downloading structure

  38. Structure retrieval from Protein 3D Structure Database – PDB………. Save & Know the location

  39. Open and visualize the *.pdb file in RasMol Structure of 3EUU

  40. MNRVDLSLFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIYHDDADGEARFIRDILTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTGMNRVDLSLFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIYHDDADGEARFIRDILTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTG Sequence data Structural data (in notepad) Structural data (in RasMol)

  41. Built model validation by ProQ server Click for uploading structure

  42. Built model validation by ProQ server Click & upload the structure

  43. Built model validation by ProQ server Submit after uploading

  44. Built model validation by ProQ server result

  45. Built model validation by Ramachandran Plot Click & upload the structure

  46. Built model validation by Ramachandran Plot…. Submit after uploading

  47. Built model validation by Ramachandran Plot…. RESULTS G.N.Ramachandran

  48. 3D structure modeling and validation Ref: K. Sivakumar, S. Balaji, Ganga Radhakrishnan, Journal of Chemical Sciences,119 (5), 571-579 (2007)

  49. Disulphide bridges in 3D structure of Q01758 • Backbone of Q01758 (rainbow smelt fish) • 10 Cysteines - ball and stick • 10 Sulphur in Cysteines and 5 SS bonds (dotted lines)

  50. Disulphide bridges in 3D structure of P05140 • Ribbon model of P05140 (sea raven) • 10 Cysteines - ball and stick • 10 Sulphur in Cysteines and 5 SS bonds (dotted lines)

More Related