600 likes | 618 Views
PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant. The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing February 3, 2009. PubChem Overview … . … NIH “Molecular Libraries” … Basic design / approach
E N D
PubChem: An Open Repository forChemical Structure andBiological Activity InformationSteve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing February 3, 2009
PubChem Overview … … NIH “Molecular Libraries” … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ?
NIH Molecular Libraries Program … Technology Development Informatics Screening Instrumentation Cheminformatics Research Centers Molecular Libraries Screening Centers Network (MLSCN) Assay Development Chemical Diversity Compound Repository (MLSMR) Predictive ADMET
Hit List Customized Assay Optimization Chemistry Compound Repository Molecular Libraries BioAssays … Peer review Assay Investigator Screen Hit picking, confirmation, secondary screens
PubChem Overview … … NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ?
PubChem Approach … … “GenBank model” … direct depositions by investigators … highly automated (low database cost) … 25 year precedents in biology … less precedent in chemical biology
PubChem Contents … … Contributed substance records … with chemical structure … chemical names and comments … links to contributor web sites … contributed links to other NCBI biomedical databases
PubChem Contents … … Contributed bioassay records … with assay description / protocol … links to tested substances … summary and detailed test results … links to contributor web sites and other NCBI databases
PubChem Overview … … NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ?
PubChem Retrieval System … … Optimize “discoverability” for molecular biologists by integrating PubChem into NCBI’s Entrez / PubMed Search Engine … Chemical structure search … Bioassay result search … Structure-activity tools
Entrez Links and Neighbors ... 2,000,000 users ... 60,000,000 hits ... … per day VAST Structure Similarity Protein 3D Structure Activity Profile Similarity Bioactivity Screens Target Sequence Similarity PubChem Small Molecules Chemical Structure Similarity Term Frequency Statistics PubMed Literature Protein Sequences
Entrez Links and Neighbors ... 2,000,000 users ... 60,000,000 hits ... … per day VAST Structure Similarity Protein 3D Structure Activity Profile Similarity Bioactivity Screens Target Sequence Similarity PubChem Small Molecules Chemical Structure Similarity Term Frequency Statistics PubMed Literature Protein Sequences
PubChem Retrieval System … … Optimize “discoverability” for molecular biologists by integrating PubChem into NCBI’s Entrez / PubMed Search Engine … Chemical structure search … Bioassay result search … Exploratory structure-activity tools