1 / 55

Thinking Outside the Box: Applications Including Finding Off-targets for Major Pharmaceuticals

Thinking Outside the Box: Applications Including Finding Off-targets for Major Pharmaceuticals. Philip E. Bourne pbourne@ucsd.edu. Agenda. Overall Theme - Thinking differently about proteins: Spherical harmonics and phylogeny The Gaussian Network Model and new modes of motion

ethel
Download Presentation

Thinking Outside the Box: Applications Including Finding Off-targets for Major Pharmaceuticals

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Thinking Outside the Box: Applications Including Finding Off-targets for Major Pharmaceuticals Philip E. Bourne pbourne@ucsd.edu

  2. Agenda • Overall Theme - Thinking differently about proteins: • Spherical harmonics and phylogeny • The Gaussian Network Model and new modes of motion • The Geometric Potential for Describing Ligand Binding Sites • SOIPPA for finding off-site targets

  3. The Curse of the Ribbon The conventional view of a protein (left) has had a remarkable impact on our understanding of living systems, but its time for new views It is not how a ligand sees a protein after all.

  4. Limitations • A local viewpoint – does not capture the global properties of the protein • A local viewpoint does not capture the global properties of a protein • Cartesian coordinates do not necessarily capture the properties of the protein • Comparative analysis is limited

  5. Agenda • Overall Theme- Thinking differently about proteins: • Spherical harmonics and phylogeny • The Gaussian Network Model and new modes of motion • The Geometric Potential for Describing Ligand Binding Sites • SOIPPA for finding off-site targets

  6. Protein Kinase A – Open Book View Scheeff & Bourne 2005 PLoS Comp. Biol. 1(5): e49

  7. Superfamily Members – The Same But Different Scheeff & Bourne 2005 PLoS Comp. Biol. 1(5): e49

  8. An Alternative Approach: Multipolar Representation • Roots in spherical harmonics • Parameter space and boundary conditions can be a variety of properties • Order of the multipoles defines the granularity of the descriptors • Bottom line – interpreted as shape descriptors Gramada & Bourne 2006 BMC Bioinformatics 7:242

  9. Geometric Comparison Does Not Reflect Biological Reality Gramada & Bourne 2006 BMC Bioinformatics 7:242

  10. Results – Protein Kinase Like Superfamily Alignment • Clear distinction between families. • Some clustering seen inside TPKs that resemble various groups, even though there is little shape discrimination at this level. Gramada & Bourne 2006 BMC Bioinformatics 7:242

  11. Results – Protein Kinase Like Superfamily Alignment Gramada & Bourne 2006 BMC Bioinformatics 7:242

  12. Possibilities – Structure Based Phylogenetic Analysis Scheeff & Bourne Multipoles Gramada & Bourne 2007 PLoS ONE submitted

  13. Agenda • Overall Theme- Thinking differently about proteins: • Spherical harmonics and phylogeny • The Gaussian Network Model and new modes of motion • The Geometric Potential for Describing Ligand Binding Sites • SOIPPA for finding off-site targets

  14. Structures exist in a spectrum from order to disorder Protein Motion Ordered Structures Disordered Structures Gu, Gribskov & Bourne 2006 PLoS Comp. Biol. 2(7) e90

  15. Obtaining Protein Dynamic InformationProtein Structures Treated as a 3-D Elastic Network Bahar, I., A.R. Atilgan, and B. Erman Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Folding & Design, 1997. 2(3): p. 173-181.

  16. Gaussian Network Model • Each Cais a node in the network. • Each node undergoes Gaussian-distributed fluctuations influenced by neighboring interactions within a given cutoff distance. (7Å) • Decompose protein fluctuation into a summation of different modes.

  17. Functional Flexibility Score • Utilize correlated movements to help define regional flexibility with functional importance. Functionally Flexible Score For each residue: Find Maximum and Minimum Correlation. Use to scale normalized fluctuation to determine functional importance. Gu, Gribskov & Bourne 2006 PLoS Comp. Biol. 2(7) e90

  18. Identifying FFRs in HIV Protease Gu, Gribskov & Bourne 2006 PLoS Comp. Biol. 2(7) e90

  19. Other Examples BPTI and Calmodulin Gu, Gribskov & Bourne 2006 PLoS Comp. Biol. 2(7) e90

  20. Side Note: Gaussian Network Model vs Molecular Dynamics • GNM relatively course grained • GNM fast to compute vs MD • Look over larger time scales • Suitable for high throughput

  21. Agenda • Overall Theme- Thinking differently about proteins: • Spherical harmonics and phylogeny • The Gaussian Network Model and new modes of motion • The Geometric Potential for Describing Ligand Binding Sites • SOIPPA for finding off-site targets

  22. Motivation • What if we can characterize a protein-ligand binding site from a 3D structure (primary site) and search for that site on a proteome wide scale? • We could perhaps find alternative binding sites (secondary sites) for existing pharmaceuticals? • We could use it for lead optimization and possible ADME/Tox prediction

  23. Background – PDB Contains Major Pharmaceuticals Bound to Receptors

  24. Background – Superfamily (Derived from Structure) Covers 38% of the Human Proteome http://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY

  25. Background – Advantage to Using Functional Site Similarity • Poor correlation between structure and activity • Infinite chemical space Small molecule Similarity . Not adequately reflecting functional relationship . Not directly addressing drug design problem Protein Sequence/Structure Similarity . Build closer structure- function relationships . Limit chemical space through co-evolution Protein Functional Site Similarity

  26. Overview of Algorithm • Protein structure is represented with Ca atoms only and is characterized with a geometric potential • tolerant to protein flexibility and model uncertainty • Optimum superimposition is achieved with a maximum weighted sub-graph algorithm with geometric constraints • sequence order independent to detect cross-fold relationships • to identify sub site similarity • Functional site similarity is measured with both evolutionary correlation and physiochemical similarity • to distinguish divergent and convergent evolution Xie and Bourne 2007 BMC Bioinformatics, 8(Suppl 4):S9

  27. 1 2 a b 3 c 4 5 Characterization of the Ligand Binding Site - Conceptual • Represent the protein structure • Determine the environmental boundary • Determine the protein boundary • Computation of the geometric potential • Computation of the virtual ligand Xie and Bourne 2007 BMC Bioinformatics, 8(Suppl 4):S9

  28. Characterization of the Ligand Binding Site - Conceptual • Conceptually similar to hydrophobicity or electrostatic potential that is dependant on both global and local environments • Initially assign Ca atom with a value that is the distance to the environmental boundary • Update the value with those of surrounding Ca atoms dependent on distances and orientation – atoms within a 10A radius define i Xie and Bourne 2007 BMC Bioinformatics, 8(Suppl 4):S9

  29. Discrimination Power of the Geometric Potential • Geometric potential can distinguish binding and non-binding sites 100 0 Geometric Potential Scale

  30. Boundary Accuracy of Ligand Binding Site Prediction • ~90% of the binding sites can be identified with above 50% sensitivity • The specificity of ~70% binding sites identified is above 90%

  31. So Far… • Geometric potential dependant on local environment of a residue – relative to other residues and the environmental boundary • Geometric potential reasonably good at discriminating between ligand binding sites and non-ligand binding sites • Boundary of the binding site reasonably well defined • How to compare sites ???

  32. Agenda • Overall Theme- Thinking differently about proteins: • Spherical harmonics and phylogeny • The Gaussian Network Model and new modes of motion • The Geometric Potential for Describing Ligand Binding Sites • SOIPPA for finding off-site targets

  33. Identification of Functional Similarity with Local Sequence Order Independent Alignment • Geometric and graph characterization of the protein structure • Chemical similarity matrix and evolutionary relationship with profile-profile comparison • Optimum alignment with maximum-weight sub-graph algorithm Xie and Bourne 2007 PNAS, Submitted

  34. Similarity Matrix of Alignment • Chemical Similarity • Amino acid grouping: (LVIMC), (AGSTP), (FYW), and (EDNQKRH) • Amino acid chemical similarity matrix • Evolutionary Correlation • Amino acid substitution matrix such as BLOSUM45 • Similarity score between two sequence profiles fa, fb are the 20 amino acid target frequencies of profile a and b, respectively Sa, Sb are the PSSM of profile a and b, respectively Xie and Bourne 2007 PNAS, Submitted

  35. Local Sequence-order Independent Alignment with Maximum-Weight Sub-Graph Algorithm Structure A Structure B L E R V K D L L E R V K D L • Build an associated graph from the graph representations of two structures being compared. Each of the nodes is assigned with a weight from the similarity matrix • The maximum-weight clique corresponds to the optimum alignment of the two structures

  36. Efficient Functional Site Comparison with Evolutionary and Geometric Constraints • The search space is segmented with the residue clusters determined from the geometric potential • The nodes and edges are greatly reduced with the robust residue boundary orientation and neighbors 2 2 a a 1 a a 1 a 1 + b c b c 2 2 1 1 b 2 c b b c c 2 2 1 1 • The time complexity is almost linearly dependant on the number of residues

  37. Improved Performance of Alignment Quality and Search Sensitivity and Specificity RMSD distribution of the aligned common fragments of ligands from 247 test cases showing four scores: amino acid grouping, chemical similarity, substitution matrix and profile-profile. .

  38. So What is the Potential of this Methodology?

  39. Lead Discovery from Fragment Assembly • Privileged molecular moieties in medicinal chemistry • Structural genomics and high throughput screening generate a large number of protein-fragment complexes • Similar sub-site detection enhances the application of fragment assembly strategies in drug discovery 1HQC: Holliday junction migration motor protein from Thermus thermophilus 1ZEF: Rio1 atypical serine protein kinase from A. fulgidus

  40. Lead Optimization from Conformational Constraints • Same ligand can bind to different proteins, but with different conformations • By recognizing the conformational changes in the binding site, it is possible to improve the binding specificity with conformational constraints placed on the ligand 1ECJ: amido-phosphoribosyltransferase from E. Coli 1H3D: ATP-phosphoribosyltransferase from E. Coli

  41. Finding Secondary Binding Sites for Major Pharmaceuticals • Scan known binding sites for major pharmaceuticals bound to their receptors against the human proteome • Try and correlate strong hits with known data from the literature, databases, clinical trials etc. to provide molecular evidence of secondary effects

  42. A Case Study

  43. Selective Estrogen Receptor Modulators (SERM) • One of the largest classes of drugs • Breast cancer, osteoporosis, birth control etc. • Amine and benzine moiety Xie, Wang and Bourne 2007 Nature Biotechnology, Submitted.

  44. Adverse Effects of SERMs cardiac abnormalities loss of calcium homeostatis thromboembolic disorders ????? ocular toxicities Xie, Wang and Bourne 2007 Nature Biotechnology, Submitted.

  45. Ligand Binding Site Similarity Search On a Proteome Scale SERCA ERa • Searching human proteins covering ~38% of the drugable genome against SERM binding site • Matching Sacroplasmic Reticulum (SR) Ca2+ ion channel ATPase (SERCA) TG1 inhibitor site • ERa ranked top with p-value<0.0001 from reversed search against SERCA Xie, Wang and Bourne 2007 Nature Biotechnology, Submitted.

  46. Structure and Function of SERCA • Regulating cytosolic calcium levels in cardiac and skeletal muscle • Cytosolic and transmembrane domains • Predicted SERM binding site locates in the TM, inhibiting Ca2+ uptakes Xie, Wang and Bourne 2007 Nature Biotechnology, Submitted.

  47. Binding Poses of SERMs in SERCA from Docking Studies • Salt bridge interaction between amine group and GLU • Aromatic interactions for both N-, and C-moiety 6 SERMS A-F (red)

  48. Off-Target of SERMs cardiac abnormalities loss of calcium homeostatis thromboembolic disorders SERCA ! ocular toxicities • in vivo and in vitro Studies • TAM play roles in regulating calcium uptake activity of cardiac SR • TAM reduce intracellular calcium concentration and release in the platelets • Cataract results from TG1 inhibited SERCA up-regulations • EDS increases intracellular calcium in lens epithelial cells by inhibiting SERCA • in silico Studies • Ligand binding site similarity • Binding affinity correlation

  49. Conclusion • By thinking differently about how to represent proteins we have seen potential value in: • Phylogenetic analysis • The study of the dynamics of proteins • Improvements to the drug discovery process

  50. Acknowledgements Lei Xie Jian Yang Jenny Gu Protein Motions Apostol Gramada Multipole Analysis Support Open Access

More Related