290 likes | 420 Views
Biocomputation : Comparative Genomics. Tanya Talkar Lolly Kruse Colleen O’Rourke. DNA . Junk DNA. Conserved DNA. What is Biocomputation ?. Four Main Parts. Biomolecular computation Biological Computation Computational Biology Bioinformatics. Bioinformatics: . Sequence Analysis.
E N D
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke
Junk DNA Conserved DNA
Four Main Parts • Biomolecular computation • Biological Computation • Computational Biology • Bioinformatics
Sequence Analysis • Very Functional! • Compare DNA between Species • Small Fragments • Return full sequence
Computational Genomics • Needleman – Wunsch • Not used much • More Mapped Genomes = Computational Genomics!
Global Alignment:Needleman - Wunsch • O(N3) • Fewest edit operations • Similar strings
Local AlignmentSmith - Waterman • O(N2) • Dissimilar strings • Find high similarity regions
B C A X A S X B A C S A
BLAST • Basic Local Alignment Search Tool • FASTA
Improvements • Increased Speed • Locate initial alignment hot spots • Statistical significance
Terminology • Segment Pairs • Locally maximal segment pairs • Maximal segment pairs
How it works • Query sentence, P • Database • Must have score over C! • Multiple segment pairs combined
How it works • Extends each hit • Done efficiently • Truncates • Doesn’t find all pairs
Proteins • Fixed length,W • Words above threshold • Each hit extended
DNA • Word List • Exact matches • NOT dynamic programming
Scoring • Blosum62 Matrix • Match (+2), Mismatch (-3), Gaps penalized
Substitution Matrix • Represents Scoring Functions
Methods of MSA • Progressive Alignment Construction • Iterative Methods • Hidden Markov Models • Genetic Algorithms and Simulated Annealing
Comparative Genomics • Compare Species • Find Evolutionary Significances! • Low Level • High Level • Importance of Non Coding DNA