120 likes | 132 Views
An introduction to gene prediction. Introduction Prokaryotes Start/stop, operons Eukaryotes Start/stop promoter/polyA Intron/exons/UTR Problems Pseudogenes Alternative splicing RNA genes Repeats/CpG island. Methods HMMs Neural networks Compositional bias Syntheny Programs.
E N D
Introduction Prokaryotes Start/stop, operons Eukaryotes Start/stop promoter/polyA Intron/exons/UTR Problems Pseudogenes Alternative splicing RNA genes Repeats/CpG island Methods HMMs Neural networks Compositional bias Syntheny Programs Content
Transcription start/stop -35 Region TATA box Translation start/stop ORFs Shine-Delgarno motif Start ATG/GTG Stop TAA/TAG/TGA Stem-loops Operons Few special cases Introns, inteins, slipering Signals in Prokaryotes
Transcription Promoter/enhancer/silencer TATA box Introns/exons Donor/acceptor/branch polyA Repeats Alu, Satellites, Expansions CpG islands Cap/CCAAT&GC boxes Translation 5’ and 3’ UTR Kozak consensus Start ATG Stop TAA/TAG/TGA Signals in Eukaryotes
Intron/exons splicing • Consensus • Donor • (A,C)AG/GT(A,G)AGT • Acceptor • TTTTTNCAG/GCCCCC • Branch • CT(G,A)A(C,T)
Pseudogenes • Promoters loss, stop codons, frameshifts • Translocation, duplication
RNA genes and other problems • rRNA (ribosome) • tRNA (transfert) • snRNA (splicing) • tmRNA (telomerase) • Repeats (Alu, satellites, expansion etc…) • CpG islands
Methods • Signals • Statistics (compositional bias) • HMMs • Neural networks • Homology/Syntheny
Programs • See Lorenzo Cerutti presentation