150 likes | 175 Views
RNA Folding Kinetics. Bonnie Kirkpatrick bkirk@cns.montana.edu. Dr. Nancy Amato, Faculty Advisor Guang Song, Graduate Student Advisor. Overview. Problem RNA Folding Landscapes Approach: Roadmap Techniques Future Work. Problem. RNA is a sequence of nucleotides (A,U,G,C)
E N D
RNA Folding Kinetics Bonnie Kirkpatrick bkirk@cns.montana.edu Dr. Nancy Amato, Faculty Advisor Guang Song, Graduate Student Advisor
Overview • Problem • RNA Folding Landscapes • Approach: Roadmap Techniques • Future Work
Problem • RNA is a sequence of nucleotides (A,U,G,C) • After being manufactured, the primary sequence folds into some conformation of secondary structures called the nativestate. • This native state can be predicted quite well by finding the most stable conformation, which is the one with the lowest energy. • The specific path the sequence takes as it folds into the native state is not known.
RNA Secondary Structure • Described by base pair interactions (a.k.a. contact pairs) • Watson-Crick Pairs: A, U and G, C • Wobble Pair: G, U • Defn: Valid Secondary Structure Configurations • Given any two contact pairs [i, j] with i < j • and [i’, j’] with i’ < j’, then: • i = i’ if and only if j = j’ • Each nucleotide can appear in only one contact pair • If i’ < j, then i < i’ < j’ < j • No pseudo-knots are allowed
RNA Folding Landscapes • A folding landscape contains points for every possible valid secondary structure configuration. • Folding pathways can be obtained from the folding landscape. • Existing techniques for mapping landscapes are limited to relatively short sequences (~200 nucleotides). • A robotics motion planning technique called PRM has successfully been applied to protein folding. • Our goal: to apply PRM to RNA folding landscapes.
goal C-obst C-obst C-obst C-obst C-obst start Approach: Probabilistic Roadmap Methods • Purpose: to find a feasible path for movement through an environment (configuration space) while avoiding obstacles. 1. Generate roadmap nodes 2. Generate roadmap edges 3. Add start and goal to the map 4. Query path between start and goal
Node Generation • Let U be the power set of every possible contact pair. • Let C-space, C, be the sub-set of U containing valid combinations contact pairs. • C-space is very large. • Sequence: (ACGU)10 • Length: 40 nucleotides • C-Space: 1.6x108 structures • Purpose: generate nodes in C-space that describe the space without covering it
Node Generation (cont.) • Generation • Starting with an empty configuration, c, contacts are added to c one at a time. • Each step preserves the condition that c contains a valid set of base pair contacts. • Contacts are added until there are no more contacts that do not conflict with the contact set of c. • Evaluation • Energy determines how good a node is. • Energy function • Computed by examining the types and sizes of secondary structures • Reject high energy nodes
… C1 = S1 S2 Sn-1 Sn = C2 Node Connection • Given two nodes in C-Space, C1 and C2, find a path between them consisting of a sequence of nodes: { C1 = S1, S2, …, Sn-1, Sn = C2 } • The path must have the property that for each i, 1 < i<n, the set of contact pair of Si differs from that of Si-1 by the application of one transformation operation. • The allowed set of operations that perform one transformation is called the move-set. • Our move-set operates on configuration by (1) opening or (2) closing a single contact pair.
Node Connection (cont.) • Given any two nodes in C-Space, there exists a path connecting them. • Not just any path will do; we want a good one. • Bad paths have high energy nodes in them. • The best path will have connect the nodes along the lowest energy route. • How do we find a good path?
Node Connection (cont.) • The more contacts a node has, the lower its energy will be. • Choose a path that contains nodes with as many contacts as possible. • Heuristic: if a contact is opened by the transition from one node to another, try to close a contact in the next transition
Future Work • Analysis of the roadmap • Finding the low energy folding pathways with a shortest path algorithm • Validation • How do we know if our results agree with experimental results? • Compare a fully enumerated roadmap to experimental folding rates • Compare a more sparse roadmap with a fully enumerated roadmap