530 likes | 680 Views
Mechanics Inspired Bioinformatics: Predicting the Function of Eukaryotic Scaffold/Matrix Attachment Region (SMAR) by Single Molecule DNA Mechanics. International Workshop on Continuum Modeling of Biomolecules Sept. 14-16, 2009 in Beijing, China Zhong-can Ou-Yang
E N D
Mechanics Inspired Bioinformatics: Predicting the Function of Eukaryotic Scaffold/Matrix Attachment Region (SMAR) by Single Molecule DNA Mechanics International Workshop on Continuum Modeling of Biomolecules Sept. 14-16, 2009 in Beijing, China Zhong-can Ou-Yang Institute of Theoretical Physics Chinese Academy of Sciences Beijing 100080, oy@itp.ac.cn
Outline: I. Stretching single molecule DNA II. Mechanics-inspired Bioinformatics :An example S/MARson Eukaryotic Chromosome, predicting the location and function
I. Stretching single molecule DNA In the past decade Physical techniques such as hydrodynamic drag [4], magnetic beads [5], optical tweezers [6], glass needles [7] and AFM [8,9] offer the opportunity to study DNA/RNA and protein mechanics with single molecules. [4] J. T. Perkins, D. E. Smith, R. G. Larson, S. Chu, Science 268 (1995) 83-87 [5] S. B. Smith, L. Finzi, C. Bustamantl, Science 258 (1992) 1122-1126 [6] S. B. Smith, Y. Cui, C. Bustmantl, Science 271 (1996) 795-799 [7] P. Cluzel et al., Science 271 (1996) 792-794 [8] M . Rief, H. C.-Schauman, H. E. Gaub, Nat. Struct. Biol. 6 (1999) 346-349 [9] David J. Brockwell et al., Nat. struct. Biol. 10 (2003) 731
Stretching double-stranded DNA can be treated as a uniform polymer
f R b * Two classical models: ** Freely jointed chain (FJC) ** Worm-like chain (WLC) As a Hookian spring with Hooke's constant
• Our Model * Introduction of a new structural parameter, the folding angle . * Without consideration of force-induced melting and nick.
* Two backbones , * Central axis , • Mathematically • Bending energy * decomposed into the bending energy of central axis plus folding energy
• Our Model * Introduction of a new structural parameter, the folding angle . * Without consideration of force-induced melting and nick.
16 base-pair stacking potentials
* The asymmetric Lennard-Jones potential ensures relaxed DNA in B-form, a right-handed double-helix. * * is harmonic in low force/extension regimes. (FJC and WLC) * strongly prevent right-handed overtwist, weakly so for left-handed one and allow a torque-induced B-to-Z-form transition in dsDNA Biophys. J. 78(2000) 1979-1987
• The Potential of External Force • The Energy of External Torque T
• Total Elasticity Energy Particle moves in a field A and a potential V if look s as time t
* * Free energy: * • Polymer Dynamics and Path Integral Method * Partition function: * “Schrödinger equation”
• Extension/Force of is Hermitian, it is real Schrödinger Eq. Extension:
Extension of Torsionally Relaxed DNA Zhou, Zhang, Ou-Yang, PRL, 82, 4560(1999)
Above calculationss are interesting for pure theoretical physicists but not for biologists and IT scientists. Both they are interested in the information and function hided in their sequence (AGCT….). The Bioinformatics is based on pure statistic mathematics, our propose is a Mechanics-Inspired Bioinformatics.
II. Mechanics-inspired Bioinformatics :An example S/MARson Eukaryotic Chromosome, predicting the location and function 4 types of nucleotides: Adenine, Guanine, Thymine, Cytosine Watson-Crick base pair: A-T, G-C Intrinsic right-handed helix (torsional state) B-DNA: uniform, sequence-independent 4-lettertext: …ATTTTAATGTCATGATAAAGTTACTTCCTTTTTTTTTAAGTTACTTCTATAATATATGTAAATTACTTTTAATCTCTACTGAAATTACTTTTATATATCTAAGAAGTATTTAGTGAAATCTAAAAGTAATTTAGATATAATATAAAAGTAATTTGTATTTTTTTCATCAAAATATAATCATGTGAGACCTTGTTATAAAGATTTAA…
Elasticity Plays the Key Role… ! • DNA: ~ centimeters (human cell 2meters) • DNA in lily cell 30 meters. • Nucleus: ~ microns • compaction ratio: ~1/8000 • DNA must undergo significant mechanical force in the nucleus • The elastic response is vital for DNA
Chromosome Assembly Chromatin Loop Model • compaction ratio: ~ 1/8000 • considerable force exerted on DNA (stretching, bending and twisting) • S/MARs: topologically independent domains basement of chromatin loops S/MAR (Scaffold/Matrix Attachment Region)
H-Bond Broken Chirality Variable bubble cruciform Structure Heterogeneity Induced by Mechanical Force: Secondary Structures
How to predict SMAR location and function ? it’s difficult in the framework of conventional bioinformatics methods because there is very little similarity among SMAR sequences, thus sequence comparison cannot work well.
S/MARs have been observed to adopt noncanonical DNA structures, bubble configuration (stress-induced unwound elements *) *Bode J., et al., Science, 1992, 255: 195-197 Standard B-form DNA Local bubble
The unwinding stress can induce the formation of local bubbles Lk=Wr+Tw, writhing number—axis self-linking number, Tw—inter-winding number of two strands
topological parameters for ds-DNA • Lk: linking number, number of helical turns when DNA is imposed in planar conformation • Lk0: linking number of relaxed ds-DNA. Lk0= N/10.5 • Tw: twisting number, number of helical turns • Wr : writhing number, coiling times of the central axis (supercoiling). for planar conformation, Wr = 0 • σ: superhelical density, defined as (Lk – Lk0)/ Lk0 σ< 0, negative supercoiling ;σ> 0, positive supercoiling • For eukaryotes, σ~ - 0.06 • σ*Lk0 = Lk – Lk0 = △Tw (r, r’) + △Wr (r)
Can we make the prediction on bubbles (S/MARs) by taking account of the unwinding stress, i.e., the energy corresponding toσ(~ -0.06 ) ?
2N configurations {…10111111100…} = 0 … base paried local bubble = 1 … base unparied : base unparing energy : rewinding angle of the denatured region a: initiation energy of bubble formation A : 10.5 bp per helical turn of B-DNA σ : superhelical density Bubble Formation is Sequence Dependent Benham Model total change in twisting turns upon bubble formation Bauer WR, Benham CJ., J Mol Biol. 1993, 234(4):1184-96.
Benham Model • total energy • twisting energy of DNA • interwinding energy of the two strands in bubble regions • unpairing energy in bubble (sequence dependent ) • initiation energy of bubble formation from the intact helix, The boundary energy between bubble and B-form, interface energy
dE/dt=0 Base-stacking Energy form:
H(n), Hj(n) calculated by transfer matrix method (e.g., circular DNA) sj=0 sj=1 Constrains on specific sites can be realized as following: (sk= 0)
The following calculation is indeed insensitive to the parameters except the difference between bAT and bGC Different unpairing energy
Unpairing Probability Profile Benham Model M. Li, Z.C. Ou-Yang, Thin Solid Film, 499:207-212 (2006) Unpairing Probability for any base pair
M.Li, Z.C. Ou-Yang, J. Phys:Condens. Matter 17 S2853-S2860 (2005) Nucleosome: Core of 8 histone molecules:2(H3—H4—H2A—H2B)—link H1 Drosophila melanogaster: Real DNA Sequence: Histone Gene Cluster
Arrow: transcriptional direction 5- —H3—H4—H2A—H2B—H1— -3 MAR MAR Drosophila melanogaster: Real DNA Sequence: Histone Gene Cluster Experimentally find a SMAR for the cluster of the above five genes with known DNA sequence (X14215, NCBI), calculation with 2 repeats
Result shows nicely: Where Are They ? • The position of the two distinct peaks coincide with the identified S/MARt DB (SM0000037) • S/MAR identified between H1 and H3 • The two SMARs define a single structure unit
5—H3—H4—H2A—H2B—H1—3 Take out Flanking SMARs, find new bubbles: Why They Are There? Long Range Allosteric Effect (LRAE) play the role… • Flanking SMARs as barriers to retain the unwinding stress • Possible LRAE: SMARs fixation onto the matrix induces unpairing events elsewhere • Function Unit: the new unpairing events may play a role in • transcriptional termination between H4- • (weaker SMAR ?)
Summary • Unwinding stress induces strong bubbles (SMARs) • (strong) SMARs may inversely function in gene regulation by protecting the unwinding stress on the chromatin loop • chromatin loop as both structure and function unit • Mechanics analysis is hopefully a new approach complementary to sequence analysis, especially on the study of DNA function
topological parameters for ds-DNA • Lk: linking number, number of helical turns when DNA is imposed in planar conformation • Lk0: linking number of relaxed ds-DNA. Lk0= N/10.5 • Tw: twisting number, number of helical turns • Wr : writhing number, coiling times of the central axis (supercoiling). for planar conformation, Wr = 0 • σ: superhelical density, defined as (Lk – Lk0)/ Lk0 σ< 0, negative supercoiling ;σ> 0, positive supercoiling • For eukaryotes, σ~ - 0.06 • σ*Lk0 = Lk – Lk0 = △Tw (r, r’) + △Wr (r)
one strand Central axis of dsDNA local frame DNA Topology : Ribbon Model Circular dsDNA: topological invariant Lk (r, r’) = Tw (r, r’) + Wr (r) Ribbon (r, r’): central axis + one strand
Adapted from: Wang, J.C. 1991. DNA topoisomerases: why so many? Journal of Biological Chemistry 266:6659-6662.
Some geometrical parameters to characterize ds-DNA • The double-helical DNA taken as a flexible ladder with rigid rungs of fixed length 2R. • Central axis R0 (s), its arc length denoted as s. Thetangent vector of R0 (s) denoted ast • The two strands R1(s),R2 (s).The tangent vector of R1(s),R2(s) denoted ast1 , t2. • The distance between nearest rungs: along R1(s) or R2(s): r0 , fixed and along R0(s): U, variable • The folding angle between t and t1 (or t2):. ~ 57o for standard B-DNA
a word about twist: given the link shown below, the twist tells us basically which component ‘wraps around’ which.