950 likes | 1.43k Views
Serkan Apaydın. Normal mode analysis (NMA) tutorial and lecture notes by K. Hinsen . Protein flexibility. Frequency spectrum of a protein
E N D
Serkan Apaydın Normal mode analysis (NMA) tutorial and lecture notes by K. Hinsen
Protein flexibility Frequency spectrum of a protein Over half of the 3800 known protein movements can be modelled by displacing the studied structure using at most two low-frequency normal modes. Gerstein et al. 2002
Outline • NMA • What it is • Vibrational dynamics • Brownian modes • Coarse grained models • Essential dynamics
Harmonic approximation Energy (U) 0 Rmin Conformation (r)
Harmonic approximation U 0 Rmin r U(r) =0.5(r − Rmin)’ · K(Rmin) · (r − Rmin)
NMA U(r) =0.5(r − Rmin)’ · K(Rmin) · (r − Rmin)
NMA Normal mode direction 1 U(r) =0.5(r − Rmin)’ · K(Rmin) · (r − Rmin)
NMA Normal mode direction 2 -e2 U(r) =0.5(r − Rmin)’ · K(Rmin) · (r − Rmin)
NMA (2) U(r) =0.5(r − Rmin)’ · K(Rmin) · (r − Rmin) min min O(n3)
Properties of NMA • The eigenvalues describe the energetic cost of displacing the system by one length unit along the eigenvectors. • For a given amount of energy, the molecule can move more along the low frequency normal modes • The first six eigenvalues are 0, corresponding to rigid body movements of the protein
4 ways of doing NMA A. Using minimization to obtain starting conformation, and computing the Hessian K: • Vibrational NMA • Brownian NMA B. Given starting structure: • Coarse grained models C. Given set of conformations corresponding to the motion of the molecule: • Essential Dynamics
1. Vibrational NMA • derived from standard all-atom potentials by energy minimization • time scale: < residence time in a minimum • appropriate for studying fast motions • Useful when comparing to spectroscopic measurements • Requires minimization and Hessian computation
2. Brownian NMA • derived from standard all-atom potentials by energy minimization • time scale: > residence time in a minimum • appropriate for studying slow motions • Requires minimization and Hessian computation
The friction coefficients • describe energy barriers between conformational substates • Can be obtained from MD trajectories (<xi2>) • Depend on local atomic density (not a solvent effect) http://dirac.cnrs-orleans.fr/plone/Members/hinsen/
3. Coarse grained models • Around a given structure • time scale: >> residence time in a minimum • appropriate for studying slow, diffusive motions (jump between local minima) • Does not require expensive minimization and Hessian computation
3. Coarse grained models (2) • Capture collective motions • Specific to a protein • Usually related to its function • Largest amplitudes • Atoms are point masses • Springs between nearby points
Coarse grained models (3) f can be a step function or may have an exponential dependence. Elastic network model NMA (aka ANM) Find Hessian of V, then eigendecomposition Gaussian network model
Coarse grained models (4) All atom or C-alpha based models… • Or a step function…
Equilibrium fluctuations Ribonuclease T1 Disulphide bond facilitator A (DsbA) Gaussian network model: Theory and applications. Rader et al. (2006)
Difference between ENM NMA and GNM • GNM more accurate in prediction of mean-square displacements • GNM does not provide the normal mode directions
Lower resolution models • Groups of residues clustered into : • unified sites • Rigid blocks (rotation and translation of blocks (RTB) model) • To examine larger biomolecular assemblies G Li, Q Cui - Biophysical Journal, 2002
4. Essential dynamics • Given a set of structures that reflect the flexibility of the molecule • Find the coordinates that contribute significantly to the fluctuations • time scale: >> residence time in a minimum
Essential Dynamics(2) <r> = R <(r − R) (r − R)'> =kBT inv(K) Angel E. García, Kevin Y. Sanbonmatsu Proteins. 2001 Feb 15;42(3):345-54.
Essential dynamics(3) • Cannot capture the fine level intricacies of the motion • Freezing the small dofs make small energy barriers insurmountable • Need to run MD for a long time in order to obtain sufficient samples 38, 150, 199 dofs
Applications of normal modes • Use all modes or a large subset • Analytical representation of a potential well • Limitations: • approximate nature of the harmonic approximation • Choice of a subset • Properties of individual modes • Must avoid overinterpretation of the data • E.g., discussing differences of modes equal in energy • No more meaningful than discussing differences between motion in an arbitrarily chosen Cartesian coord. system
Applications of normal modes (2) • Explaining which modes/frequencies are involved in a particular domain’s motion • Answered using projection methods: • Normal modes form a basis of the config. space of the protein • Given displacement d, pi= d · ei • Contribution of mode i to the motion under consideration • Cumulative contribution of modes to displacement
Cumulative projections of transmembrane helices in Ca-ATPase
Amplitude Time scale Starting structure Practical Vibrational Small Short By Minimization N Brownian Large Long By Minimization Y/N Coarse grained Large Long Given Y Essential Large Long Given N Comparison chart
Summary NMA: • no sampling problem • computational efficiency, especially for coarse-grained models • simplicity in application • Predicts experimental quantities related to flexibility, such as B-factors, well.
WebNM: (C-alpha based) http://www.bioinfo.no/tools/normalmodes
http://promode.socs.waseda.ac.jp/pages/jsp/index.jsp (all-atom)
http://lorentz.immstr.pasteur.fr/nomad-ref.php (all atomic or just C-alpha)
Serkan Apaydın Protein Flexibility Predictions Using Graph TheoryJacobs, Rader, Kuhn and ThorpeProteins: Structure, function and genetics 44:150-165 (2001)
Characterizing intrinsic flexibility and rigidity within a protein • Compares different conformational states Limited by the diversity of the conformational states
Characterizing intrinsic flexibility and rigidity within a protein • Simulates molecular motion using MD Limited by the computational time
Characterizing intrinsic flexibility and rigidity within a protein • Identifies rigid protein domains or flexible hinge joints based on a single conformation Can provide a starting point for more efficient MD or MCS
Outline • The main idea: constraint counting • Brute force algorithm • Rigidity theory • Pebble game analysis • Rigid cluster decomposition • Flexibility Index • Examples
Overview of FIRST • Floppy Inclusion and Rigid Substructure Topography • Given constraints: • Covalent bonds • hydrogen bonds • Salt bridges • Evaluate mechanical properties of the protein: Find regions that are: • rigid • move collectively • move independently of other regions Compute a relative degree of flexibility for each region
Rigidity in Networks – a history • 1788: Lagrange introduces constraints on the motions of mechanical systems • 1864: Maxwell determined whether structures are stable or deformable applications in engineering, such as the stability of truss configurations in bridges • 1970: Laman’s theorem: determines the degrees of freedom within 2D networks and allow rigid and flexible regions to be found extended to bond-bending networks in 3D http://unabridged.m-w.com
Brute force algorithm to test rigidity INDEPENDENT ORACLE REDUNDANT
Brute force algorithm to test rigidity INDEPENDENT ORACLE REDUNDANT • Compute normal modes w/ and w/o the constraint • If the number of zero eigenvalues remains constant, then the constraint is redundant. O(n2 .n3) O(n5) Complexity?
Laman’s theorem accelerates constraint counting • Constraint counting to all the subgraphs • Applying directly, complexity is O(exp(n)) • Applying recursively, pebble game algorithm. Complexity is O(n2), O(n) in practice.
Pebble Game • 3 pebbles per node • Each edge must be covered by a pebble if it is independent • Pebbles remaining with nodes are free and represent DOFs of the system • An edge once covered should stay covered but pebbles can be rearranged.
Mykyta Chubynsky and M. F. Thorpe Arizona State University The Pebble Game: A Demonstration