1 / 66

Introduction to Biomolecular Structure and Modeling

Introduction to Biomolecular Structure and Modeling. Dhananjay Bhattacharyya Biophysics Division Saha Institute of Nuclear Physics Kolkata dhananjay.bhattacharyya@saha.ac.in. Biomolecular Structures. These are determined experimentally by X-Ray Crystallography

iona
Download Presentation

Introduction to Biomolecular Structure and Modeling

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Biomolecular Structure and Modeling Dhananjay Bhattacharyya Biophysics Division Saha Institute of Nuclear Physics Kolkata dhananjay.bhattacharyya@saha.ac.in

  2. Biomolecular Structures These are determined experimentally by • X-Ray Crystallography • Nuclear Magnetic Resonance Spectroscopy • Neutron Diffraction Study • Raman Spectroscopy And also by theoretical methods

  3. 2d sinq=nl

  4. Nucleic Acid Backbone is Connected to Either of Four Different Bases

  5. C A G T

  6. A-DNA Z-DNA B-DNA

  7. Proteins (polymers) are made up of Amino Acids (monomer units) • There are Twenty different Amino Acids • with different shape, size and electrostatic • properties. • These amino acids form covalent • bonds to form a linear polypeptide chain.

  8. Alanine Phenylalanine Serine Cystine

  9. Glutamic Acid (Negatively charged) Arginine (Positively charged)

  10. Amino Acids are joined together by covalent bonds, called peptide bond, which is structurally very important

  11. a-helix: Hydrogen bonding between every i  i+4 residues

  12. b-sheet: Hydrogen bonding between ij, i+1j-1 (Antiparallel), or ij, i+1j+1 (parallel)

  13. Coordinate System: • External coordinates, such as (x,y,z), (r,q,f), (r,f,z) • Internal coordinates (BondLength, BondAngle, TorsionAngle)

  14. Torsion Angle Bond Angle Bond Length

  15. Internal  External Coordinate

  16. Generated coordinates • H 0.000000 0.000000 0.000000 • C 0.000000 0.000000 1.089000 • C 1.367073 0.000000 1.572333 • C 2.050610 -1.183920 1.089000 • C 3.417683 -1.183920 1.572333 • H -0.513360 0.889165 1.452000 • H -0.513360 -0.889165 1.452000

  17. y f

  18. Theoretical Modeling of Biomolecules: • Quantum Mechanics based Methods • Statistics based Methods • Classical or Molecular Mechanics methods

  19. Peptide modeling initiated in India by G.N. Ramachandran (1950s) • Postulates: • Impenetrable spherical volumes for each atom • Radius of the sphere depend on atom type • No two atomic spheres can overlap if they are not covalently bonded y f

  20. Between H N O C P S H 2.0 (1.9) 2.4 (2.2) 2.4 (2.2) 2.4 (2.2) 2.65(2.5) 2.65(2.5) N 2.7 (2.6) 2.7 (2.6) 2.9 (2.8) 3.2 (3.1) 3.1 (3.0) O 2.7 (2.6) 2.8 (2.7) 3.2 (3.1) 3.1 (2.9) C 3.0 (2.9) 3.4 (3.2) 3.3 (3.1) P 3.5 (3.3) S Normal and Extreme Limit (within parenthesis) distances (Å) used by Ramachandran co-workers

  21. Fully Allowed Regions Partially Allowed Regions Original Ramachandran Plot

  22. Ramachandran plot for 202 proteins at 1.5A or better resolution

  23. Variation of angle by 5o allowed to fit observed phi-psi of protein structures.

  24. Schrodinger Equation: Quantum Mechanics Time dependent (3 Dimensional) Time independent

  25. DFT formalism with B3LYP Pseudoeigenvalue equation: where Potential due to exchange-correlation, is defined by with a, b and c as parameters obtained from fit with experimental data for sample compounds, Ex are for electron exchange and Ec are for correlation. Essentials of Computational Chemistry by C.J. Cramer (2002) John Wiley & Sons Ltd,

  26. Input data (atom coordinates, basis sets) Generate input guess density (overlap integrals) Construct the potential and Solve Kohn-Sham equation Repeat the cycle using the output density as the input density Generate output densities from Solutions to Kohn-Sham equations NO YES Are input and output density same? Analyze electronic population FLOW CHART DESCRIBING THE DFTMETHODOLOGY

  27. G:C W:W C DE = -26 kcal/mol A:U W:W C DE = -14 G:U W:W C DE = -15 A:G H:S T DE = -10 Strengths of different H-bonds from 33 non-canonical Base Pairs A:G s:s T DE = -6 A:U H:W T DE = -13 A:A H:H T DE = -10 G:A W:W C DE = -15 G:A S:W T DE = -11 A:A W:W T DE = -12 A:U W:W T DE = -13 A:A H:W T DE = -11 2=>NH..N 2=>NH..O 1=>NH..N 1=>NH..O 1=>NH..N 2=>NH..O 1=>NH..O 1=>NH..N 1=>NH..N 1=>CH..O 1=>NH..O 1=>NH..N 2=>NH..N 2=>NH..N 1=>NH..O 1=>NH..N 1=>NH..O 1=>NH..N 2=>NH..N

  28. Considered Energy components, ENHO, ENHN, etc are additive. Additional stabilities, di may come from van der Waals, dipole-dipole etc interactions. Least Squares Fit indicates di, errors should be smallest for best Fit A. Roy, M. Bhattacharyya, S. Panigrahi, D. Bhattacharyya, (2008) J. Phys. Chem. B (in press)

  29. Netropsin like drugs bind in the B-DNA narrow and deep minor groove

  30. Actinomycin D like drugs make their place in between two stacked base pairs by distorting the DNA double helix

  31. DNA kinks by 90o at the dyad location while binding to two subunits of Catabolite Activator Protein (CAP)

  32. TATA-box binding protein transforms the interfacing DNA region to A-DNA like structure

  33. DNA Smooth Curvature induced by Histone proteins in Chromatin (Nucleosome)

  34. Definition and Nomenclature of Base Pair Doublet Parameters

  35. Calculation of Base Pair parameters by NUPARM Local Step Parameters: Mean Local Helix Axis: Zm = XmYm, where Xm = Xaxis1 + Xaxis2 and Ym = Yaxis1 + Yaxis2 M is Base Pair Center to Center Vector Tilt : 2.0 * sin-1 ( -ZmY1) Roll: 2.0 * sin-1 ( ZmX1) Twist: cos-1 (( X1Zm)  ( X2Zm)) Shift (Dx) MXm Slide(Dy) MYm Rise(Dz) MZm

  36. Partial list of DNA crystal structures available at http://ndbserver.rutgers.edu bd0001 12: A C C G A C G T C G G T bd0003 12: A C C G G T A C C G G T bd0004 12: C G C G A A T T C G C G bd0006 10: G G C C A A T T G G bd0011 12: C G C A A A T A T G C G bd0014 12: C G C G A A T T C G C G bd0015 10: C C G C C G G C G G bd0017 9: C G CG C G G A G bd0018 11: G C G A A T T C G C G bd0019 12: G G C G A A T T C G C G bd0022 12: A C C G G CG C C A C A bd0023 10: C C A G T A C T G G Bd0024 10: C C G A A T G A G G

  37. Base-Pair Step Size of Database Tilt Roll Twist Rise G:G 37 -0.24 5.80 30.99 3.46 G:C 106 -0.33 -5.37 38.52 3.32 C:G 157 0.66 3.81 36.26 3.46 A:A 116 -0.01 0.67 35.92 3.21 A:T 54 0.20 -0.60 32.76 3.25 T:A 18 -0.02 0.07 40.39 3.30 A:C 20 -0.37 0.97 32.73 3.43 C:A 47 -0.19 2.17 37.75 3.48 A:G 34 0.16 5.34 31.92 3.44 G:A 55 -0.23 0.52 38.40 3.14 Average Structural Parameters from Crystal Structures

  38. DNA Bending: Experimental and Theory

  39. Curved DNA models built from Crystal parameters (A3G7)n (A10)n (A6G4)n

  40. Bond Angle Deformation q Deformation from equilibrium value q costs energy. Simplest form of energy penalty is: Eq=1/2 k(q-qo)2

  41. Bonds are also stretchable but at a cost of energy Bond Breaking energy

  42. Ethane (three fold symmetry) Ethiline (two fold symmetry)

  43. Between H N O C P S H 2.0 (1.9) 2.4 (2.2) 2.4 (2.2) 2.4 (2.2) 2.65 (2.5) 2.65 (2.5) N 2.7 (2.6) 2.7 (2.6) 2.9 (2.8) 3.2 (3.1) 3.1 (3.0) O 2.7 (2.6) 2.8 (2.7) 3.2 (3.1) 3.1 (2.9) C 3.0 (2.9) 3.4 (3.2) 3.3 (3.1) P 3.5 (3.3) S Normal and Extreme Limiting (within parenthesis) distances (Å) used by Ramachandran co-workers Minimum Energy position: rijo Interaction between Instantaneous Atomic dipoles and Induced Atomic dipoles

  44. Force Field for Biomolecular Simulation

  45. E(Dx, Dy, Dz) E(Dx+1, Dy, Dz) E(Dx+2, Dy, Dz) ….. Search for Conformation with Lowest Energy

  46. Multivariable Optimization: NP-hard Problem • Systematic Grid Search procedure: Impossible, large no. variables • Guided Grid Search: Depends on Choice • Approximate Method based on Taylor series • Newton-Rhapson Method:

  47. Energy Landscape of typical bio-molecules Energy Positional Variables

  48. Conformation 0: Calculate energy (Ei) Alter conformation randomly Calculate energy (Ei+1) Calculate ρ = exp(-(Ei+1-Ei)/kT) If ρ > random no accept the conformation Repeat the procedure Energy Uniformly generated Random numbers are used to accept if exp(-U/kT) > random no and reject otherwise Reject Always Accept Accept

  49. Deterministic Method Molecular Dynamics Verlet Algorithm:

More Related