1 / 15

Clustering of peptide fragment structures reveals nature’s building block approach

Clustering of peptide fragment structures reveals nature’s building block approach. Ashish V. Tendulkar Research Scholar Kanwal Rekhi School of I.T. I.I.T. Bombay. Guide: Prof. P. Wangikar Co-guide: Prof. Sunita Sarawagi. Outline. Terms Objectives Approach Results Conclusion. Terms.

alaqua
Download Presentation

Clustering of peptide fragment structures reveals nature’s building block approach

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Clustering of peptide fragment structures reveals nature’s building block approach Ashish V. Tendulkar Research Scholar Kanwal Rekhi School of I.T. I.I.T. Bombay Guide: Prof. P. Wangikar Co-guide: Prof. Sunita Sarawagi

  2. Outline • Terms • Objectives • Approach • Results • Conclusion

  3. Terms • Protein is made up of amino acids. There are in all 20 different types amino acids. • Protein is a linear sequence of amino acid. • Protein takes up 3-D structure. The structure is result of its amino acid sequence.

  4. Protein Structure • Primary Structure: ACGADSTYKSTYSCPLA • Secondary structure • 3-D structure

  5. Objectives • Prediction of protein structure from merely its sequence. • Protein sequence is believed to take up vast number of conformations • Learn relation between sequence and structure by example of known protein structures. • Build library of sequence-structure mapping

  6. Salient Features • Geometric invariant: A quantity, which is unchanged under a group of geometric transformations, in this case, the group of translations and rotations in 3-dimensional space. • Examples of continuous invariants: signed volumes, areas, lengths. • For our group of transformations, it has been shown that invariants suffice to decide superimposability of two structures. Thus, if two patterns K1 and K2are not superimposable then there is an invariant f such that f(K1)≉f(K2).

  7. Salient Features • We discretize a structure by its evaluations on a fixed suite of N invariants and mapped into the N-dimensional space as a vector. • We examine 1.2 million peptides from 4,500 non-redundant protein structures. • This collection may now be subjected to the tools of data-mining. • Clustering of Patterns: A cluster is a small region in this N-space, which has a large number of pattern-vectors. • Closeness of points and density is decided via a training regime

  8. All overlapping octapeptide fragments from PDB_95 Geometric invariant based representation of each peptide as a point in 56-dimensional space and clustering Dense cluster of peptides in a 56-dimensional box GIk GI56 Wi Training regime to decide the tolerance window Wi in each dimension based on known superimposable peptides. GI2 GI1 Categorization of clusters “Structural” clusters “Functional” clusters with majority of peptides drawn from a single SCOP superfamily. Hierarchical clustering based on closeness of centroinds of clusters

  9. C4 C4 C5 C5 C3 C3 C7 C7 C6 C6 C1 C1 C8 C8 C2 C2 b) Tetrahedron_gap_1: constructed from alternate C  atoms. a) Tetrahedron_gap_0: constructed from consecutive C  atoms. • Examples of G.I. • Surface area • Volume • Perimeter • Sum of squares of edges • Sum of centroid to node distances c) Geometric invariants associated with a tetrahedron

  10. Summary of Peptide Library • 12000 clusters, size range from 5-160,000. • 2000 functional clusters. • Demonstrates nature’s bias toward a selected conformations. • Potential applications in protein structure prediction.

  11. Distribution of clusters By Information Content Distribution of clusters By Cluster size. No. of clusters No. of clusters Avg. information content of the cluster No. of peptides in a cluster

  12. Structural Clusters Twisted -strand (S.2.10.1.23.389) Known -hairpin (S.1.6.1.6.19)

  13. Functional Clusters Acid Proteases: Active site loop conformation I (F.b.50.1.3.11.7870) Acid Proteases: Active site loop conformation II (F.b.50.1.4.9.3460)

  14. Conclusions • Century old “Geometric Invariant theory” applied to protein structure for the first time. • Peptide fragment library(DPFS) can be used in protein structure prediction. It is available on web at www.it.iitb.ac.in/dpfs/

  15. Acknowledgements • Prof. Milind Sohoni for his inputs on Geometric Invariants • Anand Joshi for his contribution in the project

More Related