1 / 40

Introduction to Chemoinformatics

Introduction to Chemoinformatics. Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course. Drug Discovery Process. The drug candidate.

Download Presentation

Introduction to Chemoinformatics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course

  2. Drug Discovery Process

  3. The drug candidate • ... is a (ligand) compound that binds to a biological target (protein, enzyme, receptor, ...) and in this way either initiates a process (agonist) or inhibits it (antagonist/inhibitor) • The structure/conformation of the ligand is complementary to the space defined by the protein’s active site • The binding is caused by favorable interactions between the ligand and the side chains of the amino acidsin the active site. (electrostatic interactions, hydrogen bonds, hydrophobic contacts...)

  4. Wet-lab drug discovery process HTS Screening collection Actives 106 cmp. 103 actives

  5. Wet-lab drug discovery process HTS Screening collection Actives 106 cmp. 103 actives High rate of false actives!!! High throughput is not enough to get high output…..

  6. Wet-lab drug discovery process Chemical structure Purity Mechanism Activity value HTS Follow-up Screening collection Actives 106 cmp. 103 actives

  7. Wet-lab drug discovery process HTS Follow-up Screening collection Actives Hits 106 cmp. 103 actives 1-10 hits Analogues synthesis and testing ADMET properties

  8. Wet-lab drug discovery process HTS Follow-up Hit-to-lead Screening collection Lead series Actives Hits 106 cmp. 103 actives 1-10 hits 0-3 lead series Analogues synthesis and testing ADMET properties

  9. Wet-lab drug discovery process Lead-to-drug HTS Follow-up Hit-to-lead Screening collection Lead series Drug candidate Actives Hits 106 cmp. 103 actives 1-10 hits 0-3 lead series 0-1 Analogues synthesis and testing ADMET properties

  10. Failures

  11. We need more.. to find less..

  12. Drug Discovery Process Chemoinformatics

  13. Wet-lab + Dry-lab drug discovery in vitro in silico+ in vitro Diverse set of molecules tested in the lab Computational methods to select subsets (to be tested in the lab) based on prediction of drug-likeness, solubility, binding, pharmacokinetics, toxicity, side effects, ...

  14. The Lipinski ‘rule of five’ for drug-likeness prediction • Molecular weight ≤ 500 • # hydrogen bond acceptors (HBA) ≤ 10 • # hydrogen bond donors (HBD) ≤ 5 • Octanol-water partition coefficient (logP) ≤ 5 (MlogP ≤ 4.15) • If two or more of these rules are violated, the compound might • have problems with oral bioavailability. (Lipinski et al., Adv. Drug Delivery Rev., 23, 1997, 3.)

  15. Exercise : Prediction of drug-likeness • Go to the following webpage www.molsoft.com/mprop • Draw proguanil and decide if it is a drug-like compound

  16. Proguanil antimalarian tablets

  17. Gathering and systematic use of chemical information, and application of this information to predict the behavior of unknown compounds in silico. Chemoinformatics data prediction

  18. Major Aspects of Chemoinformatics • Databases:Development of databases for storage and retrieval of small molecule structures and their properties. • Machine learning: Training of Decision Trees, Neural Networks, Self Organizing Maps, etc. on molecular data. • Predictions: Molecular properties relevant to drugs, virtual screening of chemical libraries, system chemical biology networks…

  19. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms C8H9NO3

  20. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms

  21. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types (aromatic ring identification) • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms

  22. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms

  23. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms

  24. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms

  25. Representinga chemicalstructure • How much information do you want to include? • atoms present • connections between atoms • bond types • stereochemical configuration • charges • isotopes • 3D-coordinates for atoms

  26. From chemists to representations

  27. Structural representation of molecules Structural representation of molecules Line notations Connection tables

  28. SMILES (Simplified Molecular Input Line Entry System) Canonical SMILES: unique for each structure Isomeric SMILES: describe isotopism, configuration around double bonds and tetrahedral centers, chirality

  29. InChI(IUPAC International Chemical Identifier)

  30. MOLfile format (.sdf)

  31. Small molecule databases

  32. Try it yourself! • Go to PubChem: pubchem.ncbi.nlm.nih.gov/ • Type proguanil and press Go • Click on the first result on the list

  33. Try it yourself! • Scroll down and find the SMILES and InChI

  34. Try it yourself! • Click on SDF (top right icon) • Select: 2D SDF: Display

  35. Try it yourself! • Go back and click again on SDF • Select: 3D SDF: Display

  36. Questions?

More Related