210 likes | 227 Views
Explore the potential of DNA implementation for fitness-based separation in evolutionary computation using the Royal Road model. Study the population dynamics and feasibility of implementing the MutS and MutY proteins. Investigate the enormous storage capacity of DNA and its potential for improving computing evolutionary algorithms.
E N D
DNA Implementation of a Royal Road Fitness Evaluation Elizabeth Goode, David Harlan Wood, and Junghuei Chen Ji Yoon Park Dept. of Biochem Hanyang University
Abstract 1. A model for DNA implementation of Royal Road evolutionary computation - for separation by fitness : 2-d DGGE, PAGE 2. Suggestion for possible use of the MutS and MutY - mismatch-binding proteins in combination with gel shift assays for separation by fitness
The Royal Road * A class of evolutionary computations - van Nimwegen et al ▪ The population dynamics of various Royal Road fitness functions ▪ Only a relatively few generation ▪ Don’t support theoretical results on the stasis ▪ Limitation of genetic variation - By implementing Royal Road problems using DNA, ◊ Use populations many others of magnitude larger than the populations available using conventional computers ◊Huge DNA storage capacity permits exploring populations with much greater genetic diversity ◊ Test precisely theoretical predictions van Nimwegen
Focus on… ☞ Fitness-based separation of individuals for a Royal Road problem * DNA model for simulating Royal Road computation - Potential of computing with very large population ! - Separation : 2-d DGGE, PAGE
Evolutionary Algorithms - Population(possibly random) - Selection → fitness function - Reproduction → reproduce the next generation of individuals according to some reproduction strategy which may include mutation and crossover ex)MaxOnes -Begins with a random set of individual bitstrings of 0 and 1, each of length n. -For a given initial population size, the goal is to generation such perfect individuals
Royal Road Fitness Function * Generation of the MaxOnes fitness function - The population : Strings which contain discreteblockswhich are subsequences of bits - Each block is evaluated for fitness ▶ Each block in a given individual bitstring which satisfies its predefined block fitness criterion contributes to the fitness rating of that individual ▶ Any deviations from the required specification fails to contributes the total fitness for the bitstring ▶The sum of the block contributions constitutes the total fitness for the bitstring ▶ Blocks are assigned fitness 1 if they are perfect, and fitness 0 otherwise.
1. Examine the population dynamics in instances of the Royal Road problem ▶The potential of generating previously unobtained information(1012 >) 2. Feasibility of the necessary laboratory stepsfor DNA implementation * The enormous storage capacityof DNA ▶The potential gain in computing evolutionary algorithms using DNA rather than silicon is unprecedented
The preliminary Example for Royal Road Fitness-Proportional Selection * Let A={C, T, G} be working set of symbols * The block is B={C, T} ▶The population of interest is a set of bitstrings of length 88 written over A, each containing 2 blocks written over B of length 6 in bit positions 25-30 and 57-62 ▶ The population contains at most 212 individuals ▶The individuals, once encoded in DNA, must be physically separable by fitness ▶Fitness 1 for each perfect block containing all Ts ▶ A perfect individual contains only T in each of its blocks: fitness 2 ▶Doing selection over the entire population of one generation in one day(possible to treat populations of size 1016)
Principle ◈ Fisher and Lerman(1983); Myers et al (1987); Sheffield et al (1989) ▶When ds DNA migrates through increasing concentrations of urea and formamide, the complementary strands will dissociate in a domain-dependent fashion. ▶The dissociation causes an abrupt decrease in the mobility of the fragment in polyacrylamide gels. ▶The presence of a mutation may change the stability of its local domain and hence alter its pattern of migration.
Experimental ◈ When preparing DNA for analysis by DGGE, ▶PCR is used to attach an ~40 bp G-C clamp to one end of the fragment. *Clamp: highly stable, denaturation-resistant domain ▶ Allow mutations in lower melting domains to be acertained ▶Heteroduplexes between a wild-type strand and a potential mutant strand will be destabilized by the single base-pair mismatch and will migrate more slowly than either homoduplex ▶ Heteroduplexes generated during PCR amplification of heterozygous genomic DNA can greatly assist in the detection of mutations
Strengths/Limitations ◈Advantage: ▶ Used to analyze PCR-amplified, G-C clamped segments of DNA < 500 bp in length. ▶ Best suited to scanning multiple samples for mutations in the same DNA fragment ▶ A change from A/TtoG/C usually increase the stability of the local domain ▶ A change from G/C to A/T usually has a destabilizing effect ◈ Disadvantage: ▶ The exact position and nature of the mutation must be confirmed by DNA sequencing ▶ Requires specialized equipment and a distinctly user-unfriendly computer program, which is needed to select sequences for oligonucleotide primers
Perpendicular 2-d DGGE * Separation by fitness ▶Denaturing gradient gel electrophoresis(DGGE) - ds DNA is moved through the gradient gel environment by electrophoresis - Partial dehybridization of ds DNA in a denaturing environment reduces the mobility of DNA - The different m.p of different seqs ▶differences between the movement of those seqs, even if those seqs are the same length - To determine an optimal denaturing gradient between candidates of different fitness - Separation is verified with PAGE
The Candidate Individuals * Candidate ▶ss DNA consisting of 88 bases each ▶ Each individual strand consists of 5 concatenated seqs of C, G and T ▶ All concatenates of the following five seqs : Clamp1 - Block1 - Clamp2 - Block2 - Clamp3 ▶Clamps are distinct, but constant for all candidates, and have lengths 24, 26 and 26 and G-C rich regions ▶Blocks have length 6, and contain a mixture of C and T, varying among different candidates. ▶ The ‘perfect candidate’ has only T in Block1 and Block2
The candidate strands can be divided - Physically divide candidate strands into equivalence classes * To separation ▶Anneal the various ‘imperfect’ candidatesto target ◊fitness = 0: at least one C in each of B1 and B2 ◊ fitness = 1: one perfect block containing only T, and one imperfect block containing at least one C ◊ fitness = 2(perfect candidate): only T in both B1 and B2 * Clamp: constant for all individuals ▶only one seq associated with a perfect individual
Candidate perfect: 5’ - - GGGCGGCCTCGCCTCCCCTGCTGGTTTTTTCCTTCTCCCTCTGTCGGGCTCGCGTTTTTTTTTTGTTGCTTCGTTTGTCCTTCCGTCC - - 3’ Candidate 2.1: 5’ - - GGGCGGCCTCGCCTCCCCTGCTGG TTTTTT CCTTCTCCCTCTGTCGGGCTCGCGTTCTTTTTTTGTTGCTTCGTTTGTCCTTCCGTCC - - 3’ Candidate 2.6: 5’ - - GGGCGGCCTCGCCTCCCCTGCTGG TTTTTT CCTTCTCCCTCTGTCGGGCTCGCGTT CCCCCCTTGTTGCTTCGTTTGTCCTCCTCC - - 3’ Candidate 1.6-2.6: 5’ - - GGGCGGCCTCGCCTCCCCTGCTGGCCCCCCCCTTCTCCCTCTGTCGGGCTCGCGTT CCCCCCTTGTTGCTTCGTTTGTCCTTCCGTCC - - 3’ Target strand:exact complement of Candidate Perfect 5’ - - CGACGGAAGGACAAACGAAGCAACAA AAAAAA AACGCGAGCCCGACAGAGGGAGAAGG AAAAAA CCAGCAGGGGAGGCGAGGCCGCCC - - 3’ Fitness = 1+ 1 = 2 = 1 + 0 = 1 = 1 + 0 = 1 = 0 + 0 = 0
Separation by Fitness ¶ 2-d DGGE in combination with PAGE ◊Separation of a subset of candidate strands by fitness class ◊ Different candidate strands annealed to the Target strand should run differently according to their fitness ◊Candidateshaving blocks which perfectly anneal to Target strand are predicted to run more quickly through a gel than candidate strands
PAGE Lane 1: 25bp ladder Lane 2: Candidate Perfect/Target Lane 3: Candidate2.1/Target Lane 4: Candidate 2.6/Target
Conclusion “ Can 2-d DGGE and PAGE be used for separating candidates according to fitness in a Royal Road evolutionary computation?” ▶Fitness-based separation - Clamp-block style encoding of individuals is useful for DNA implementation of a Royal Road problem - Verify a complete separation ability for the Royal Road fitness function - 2-d DGGE and PAGE separation : useful for implementing fitness separation for the Royal Road problem and for other evolutionary algorithm