940 likes | 1.05k Views
Java Solutions for Cheminformatics. March 2005. About Us. About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments.
E N D
Java Solutions for Cheminformatics March 2005
About Us About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments
Formed: 1998 Budapest, Hungary Skills base: Chemistry, Software development, Predictive tools Aim: Platform independent software forchemistry Highlights 1998: Custom projects 1999: Java tools for sketching/viewing structures 2000: Structure database support 2001: Clustering and diversity analysis 2003: Pharmacophore screening, property predictions, reaction processing, fragmenting 2004: Cartridge technology, virtual synthesis, improved SMARTS support History
People Developers: 17(7 Phd, 10 MSc) Technical expertise • Cheminformatics • Synthetic and physico-chemistry • Virtual drug design • Java • Web technology Business Support: 3(1 MSc, 2 BSc) Commercial expertise • Negotiation & contracting • Relationship management • Collaboration steering and development • Strategic marketing • Mutually benefitial (win win) business relationships
Selected Application Areas Global licenses Custom development projects Value added constructions Websites/portal front and back end Educational
Chemical drawing Product development 1998 1999 2000 2001 2002 2003 2004 Marvin SDF, RDF, XYZ animations, CML, templates, compressed formats, Swing, 3D rendering SMILES, SMARTS, PDB, Rgroups, isotopes, shortcuts, Marvin Beans Ball and stick, JPG, PNG, SVG, Cut&Paste with Isis/ChemDraw, 2D cleaning, (de)aromatization, reactions Partial charge, pKa, logP, logD, 3D generation, radicals, Sgroups Marvin file format, enhanced stereo, enhanced SMARTS support, shapes, text boxes, multiple groups, TPSA, Donor/Acceptor... Mac support, signed applets, Java Web Start, atom mapping Applets, Molfiles, stereo support, Windows, Unix JChem reaction searching, reaction processing, pharmacophoreanalysis. screening,standardization, fragmentation cartridge, enhanced stereo searching, recursive SMARTS, chemical expressions, virtual synthesis… clustering, diversity Oracle, MySQL, SQLServer, Access, hashed fingerprints, substructure and similarity searching DB2, PostgreSQL, Rgroup searching Structure Database and Cheminformatics toolkit
Multiple Deployment Formats • Applications • Java Applets • Signed Java Applets • Java Web Start • Java Beans • Plugins • JSP
Why ChemAxon? • Sophisticated virtual chemistry technology • Platform independence and Web (Java) • High performance tools (speed, capacity) • Client oriented development • Comprehensive API for the developers • Detailed documentation • Competitive prices • Fast and reliable support
Fast response to support question – max. 24 hourresponse (fast solution also!) Final and beta releases available online. Detailed documents available online and extensive help bundled within software Skilled and relevant human support quality (direct developer to developer) Product development based on support requests „Developers supporting developers” Product Support
Molecule Drawing and Visualization About Us Molecule Drawing and Visualization Structure Searching Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments
Operating Systems • 100% pure java • Windows • 95, 98, Me, NT, 2000, XP • Macintosh • OS 9, OS X • Unix • Linux, Solaris, Irix, etc.
Web Browsers • Internet Explorer • Netscape • Mozilla • Safari • Opera
Various file formats Isotopes, charges, radicals Alias, pseudo atoms Templates Abbreviated groups Reactions Atom maps R-groups Stereo bonds, stereo configurations (R/S, E/Z) Enhanced stereo(ABS/AND/OR) SMARTS properties (atoms, bonds, recursive SMARTS) Chemical errorchecking Generic atoms and bonds Atom lists and not lists 2D cleaning 3D cleaning Various 3D models Shapes, text boxes Plugins Marvin
Structure Cleaning CC(C)NCC(O)COC1=C2C=C(C)NC2=CC=C1 topology 2D 3D
Structure Searching About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments
JChem Base Features • Rapid fingerprint-based database scanning • Sophisticated graph-based searching • Integration with databases • Oracle • MS SQL Server • DB2 • MYSQL • PostgreSQL • InterBase • Access • Custom standardization • JChem Cartridge for searching in Oracle • JSP integration
Exact structure Substructure Atom lists and notlists Explicit hydrogens Generic atoms Generic bonds SMARTS atom properties Aliphatic, aromatic Hydrogen count Connection count Valence Ring count Smallest ring size Recursive SMARTS Stereo atoms Stereo bonds R-group queries R-groups Occurence if / then conditions RestH Reaction search Transformation recognition Component identification Stereospecific reactions (inversion, retention) Diastereomers Enhanced stereo groups (Abs, And, Or) Query Features
JChem Base JSP Integration Thin client support: only a web browser and Java required
Cartridge Technology About Us Molecule Drawing and Visualization Structure Searching Cartridge Technology Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments
JChem Cartridge for Oracle JChem Cartridge for Oracle Oracle can be extended to support chemical database operations using the JChem Cartridge for Oracle Examples: Substructure search displaying ID, SMILES codes, and molweight: SELECT cd_id, cd_smiles, cd_molweight FROM my_structuresWHERE jc_contains(cd_smiles, 'CC(=O)Oc1ccccc1C(O)=O') = 1; Finding benzene derivatives conforming the Lipinski’s rule of five: SELECT count(*) FROM my_structures WHERE jc_compare(structure, 'c1ccccc1','sep=!t:s!ctFilter:(mass() <= 500) &&(logP() <= 5) && (donorCount() <= 5) &&(acceptorCount() <= 10)') = 1;
JChem Cartridge for Oracle JChem Cartridge for Oracle Example Oracle search returning similar structures with logP >1, which were acquired after April 14th, 2002. MarvinView below.
Structure Standardization About Us Molecule Drawing and Visualization Structure Searching Cartridge Technology Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments
Standardization • Explicit hydrogens • Aromatic bonds • Mesomers • Tautomers • Counterions
Standardization Example before after
Molecular Predictions About Us Molecule Drawing and Visualization Structure Searching Cartridge Technology Structure Standardization Molecular Predictions Chemical Expressions Screening Clustering Fragment Analysis Virtual Synthesis Current Developments
Available Calculations Elemental analysis Charge distribution Polarizability pKa logP logD Polar surface area Huckel Analysis H-bond donor-acceptor Major microspecies Refractivity Calculation Interface Marvin GUI Command line Chemical Terms API Calculator Plugins
Partial Charge Distribution Calculation Partial Equalization of Orbital Electronegativities (PEOE) Orbital electronegativity defined by Mulliken Orbital electronegativity of atom i: ci=at+btqi+ctqi2 qi: partial charge Partial charge of atom i is iteratively calculated based on Gasteiger’s method: ci(0)= at, qi(0)= 0 qi(n+1)= qi (n) + S(0.5)n(ci- ck)/ max(ci, ck) k: index of a neighbor of atom i
logP Example logP = Sfi fI: atomic logP increment
logD Example 1+(1) 2+(2) 3-(3) k4 1+2+(4) 1+3-(5) 2+3-(6) k1 k7 k5 123(0) k2 1+2+3-(7) k6 k3 logD is computed using micro ionization constants (ki), micro partition coefficients (pi), and pH
pKa Plugin - Microconstants Micro ionization constants (logk) are calculated from regression equations that have three types of calculated parameters: Intramolecular interactions Partial charges logk Polarizabilities
pKa Plugin - Macroconstants 1 3 2 Macro ionization constants (pKa) are calculated from the microconstants (logk) Ionization scheme 1- 1-2+ 123 2+ 1-3- 1-2+3- 3- 2+3-
Hydrogen Bonds in pKa Calculation Dlogk = a (qi - qk) + b a,b:regression parameters Intramolecular hydrogen bonds are also taken into account