790 likes | 837 Views
Proof of Concept Studies & Consortia Building Networks. The Basic Technology Research Programme. Background. Cross research council endeavour administered by EPSRC Funding for research to create a new technology Change the way we do science Underpin the future industrial base. Background.
E N D
Proof of Concept Studies & Consortia Building Networks The Basic Technology Research Programme
Background • Cross research council endeavour • administered by EPSRC • Funding for research to create a new technology • Change the way we do science • Underpin the future industrial base
Background • 15 research projects funded up to April 2003 • Total funding for this period - £41M • To support large, long term, high risk, high impact research consortia • Encourage investigation of speculative ideas
Background • Two levels of funding • One year start up • Full grant up to five years • Two types of start up funding • Proof of concept • Consortia building networking
Proof of Concept Studies • One year funding up to £100K • Research to investigate feasibility of developing the new technology • Output – a business case for the next step of investigation to be submitted in May 2004 • Basic Technology Programme • Existing Research Council initiatives • DTI programmes
Consortia Building Networks • Involvement of the users of the new technology at a very early stage • Funding to form networks & hold workshops
ParaSurf – in silico Screening Technology • Basic Technology Funding for October 2003 to September 2004 • Proof of concept • Consortia building networking • Academic partners • University of Portsmouth • University of Erlangen • University of Southampton • University of Oxford • University of Aberdeen
ParaSurf – Proof of Concept Research Programme • Development of techniques to describe irregular solids & surfaces • Development of projection & pattern recognition techniques for non-planar colour-coded surfaces • spherical harmonics, molecular topology • Conformational analysis • Rigid body dynamics incorporating surface features • rigid parts of molecule treated as anisotropic solids linked by rotatable bonds • Investigate how best to generate prediction models using surface properties that define a low dimensional chemical space • QSAR, pattern recognition, artificial intelligence, analysis of surfaces • Bench marking using Grid computing
Potential applications of the in silico screening technology • High throughput virtual docking • Physical property mapping • ADMET prediction • Long time-period simulation techniques • Crystallisation and solubility • Prediction of tautomers • Chemical reactivity and metabolism
Letchworth, 16th March 2004 ParaSurf Progress Report
Main Areas • Molecular Surfaces and Property Calculation • RGB Encoding & Pattern Recognition • Conformational Analysis • Rigid Body Molecular Dynamics • Analysis of Variables & QSAR models • Grid Computing • Consortium Building
Datasets Small Consensus Set of 74 Drug Molecules (diverse) QSAR set (31 CoMFA steroids) Medium WDI subset (2,400 comps) Harvard Chembank dataset (2,000 comps) Large WDI (50,000) Maybridge (50,000)
Example Molecule Allopurinol
Calculations 3D co-ordinates from CORINA QM calculations with VAMP Local Properties and surfaces from ParaSurf
ParaSurf v1.0 Surfaces Isodensity Surfaces Shrink Wrap Marching Cube Surfaces fit to Spherical Harmonics Properties MEP, LIE, LEA and LP Encoded at points on the surface Encoded as Spherical Harmonic Expansions
RGB Encoding Each Local Property encoded as a colour LIE encoded on Red channel LEA encoded on Green Channel LP encoded on Blue Channel
RGB Encoding Alternative Encoding LIE LEA Absolute value of MEP
Conformational Analysis Efficient All Atom MD analysis (DASH) Treated as time series (not Cluster Analysis) Scales linearly with simulation length No need for arbitrary choice of number of clusters Can be analysed using Markov Chain methodology
Rigid body molecular dynamics Well founded methodology e.g. CNS / XPLOR (Axel T. Brunger, Stanford University) Idea is to use rigid groups to model flexibility: In the ligand and the protein binding site. Allows time-steps of 10fs to 20fs.
LIE LEA LP MEP LIE 1 0.44 0.26 0.39 LEA 0.44 1 0.58 0.47 LP 0.26 0.58 1 -0.1 MEP 0.39 0.47 -0.1 1 Correlation Matrix
Descriptors 34 descriptors based on Normal Distribution Principal Components Spherical Harmonic Co-efficients
Maximum value of the local ionization energy Minimum value of the local ionization energy Mean value of the local ionization energy Range of the local ionization energy Variance in the local ionization energy Descriptors for LIE
Other Descriptors Moments Order 1 – Mean Order 2 – Variance Order 3 – Skewness Order 4 – Kurtosis Overlapping Gaussians Derived from previous work on MD analysis
QSAR models Models derived from Local Properties Surface Integral Model for Solvation Energy RMS Error ~ 0.75 Kcal Drug Likeness SOMs trained on WDI (drugs) & Maybridge (general) Parameters from PC of Local Property Descriptors Medium sized datasets superimposed on SOMs
GRID Computing ParaSurf compiled on SGI IRIX Windows Linux (SUSE) IBM AIX Future Platforms SUN Solaris GRID enabling at Portsmouth (Mark Baker), Southampton and Oxford.
Provisional Timings SGI R10k, 256MB VAMP ~ 30s/compound ParaSurf ~ 10s/compound Intel 1.8 Xeon/ AMD Athlon XP-2000+ ParaSurf ~ 2s/compound SGI FUEL Workstation R14K ParaSurf ~ 2s/compound
Conclusions • Properties can be calculated • Properties can be RGB encoded • Properties are local • Properties can be used for QSAR models
Computer vision methods for comparing molecular surfaces • Comparing and recognising 3D objects is an active research area in robotics and AI. • Fast methods have been developed for database indexing. • Rotationally invariant descriptors of 3D objects are possible.
Pattern matching on molecular surfaces • Can we recognise similar surfaces? • Can we recognise similar surfacefragments? • Can we identify the most similar surface to our target? • How do we compare field descriptors on the molecular surface?
Rotationally invariant 3D object descriptors • Internal coordinates e.g. a distance matrix. • Energy distributions based on the spherical harmonics. • The spherical harmonic coefficients. • Radial integration, radial scanning, and invariant moments.
Surface comparison Two different approaches: • Using spherical harmonic molecular surfaces [J. Comp. Chem. 20(4) 383-395; Ritchie and Kemp 2000; University of Aberdeen]. • Partial molecular alignment via local structure analysis [J. Chem. Inf. Comput. Sci. 40(2) 503-512 ; Robinson, Lyne and Richards 1999; University of Oxford].
An example grid of surface points A grid is placed on a ParaSurf surface in order to reduce the number of surface points from 4038 to 55.