20 likes | 121 Views
ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACAAGA TGATCTTCTTCTCCCTTTTCGTCCTACTTATCACCAGTAGAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTCT AAACTGAAGTTCTGGTTCAAGAGGAAACACCAAAAGCTCC. EST sequence reads from chromatogram and base calling. ESTs from the database(dbEST).
E N D
ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACAAGA TGATCTTCTTCTCCCTTTTCGTCCTACTTATCACCAGTAGAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTCT AAACTGAAGTTCTGGTTCAAGAGGAAACACCAAAAGCTCC EST sequence reads from chromatogram and base calling ESTs from the database(dbEST) ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACAAGA TGATCTTCTTCTCCCTTTTCGTCCTACTTATCACCAGTAGAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTCT AAACTGAAGTTCTGGTTCAAGAGGAAACACCAAAAGCTCC Vector identification and removal (UniVec/EMVEC) ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACAAGA TGATCTTCTTCTCCCTTTTCGTCCTACTTATCACCAGTAGAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTXX XXXXXXXXXXXXXXXXXXXXXXXXXX Masking of repeats (RepeatMasker) low complexity regions (DUST /nseg) ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACANNN NNNNNNNNNNNNNNCTTTTCGTCCTACTTATCACCAGTAGAAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTXX XXXXXXXXXXXXXXXXXXXXXXXXXX High quality sequence for clustering and assembly
ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACANNN NNNNNNNNNNNNNNCTTTTCGTCCTACTTATCACCAGTAGAAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTXX XXXXXXXXXXXXXXXXXXXXXXXXXX EST 1 ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACANNN NNNNNNNNNNNNNNCTTTTCGTCCTACTTATCACCAGTAGAAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTXX XXXXXXXXXXXXXXXXXXXXXXXXXX EST 2 ACATGGAGGAGCTCAATGAGCTCATTTACGACAGAATCCACAACANNN NNNNNNNNNNNNNNCTTTTCGTCCTACTTATCACCAGTAGAAAACTGGA TAGAAACATTTCCATTATATTACGTTATCAAGCTAGGCATTCTTCTXX XXXXXXXXXXXXXXXXXXXXXXXXXX EST 3 EST clustering, assembly and consensus sequence generation Consensus sequence . : . : . : . : . : . : TVf01_G02+ ACAGAG-ATCGGTGGTGAGGAAAGTGGCAGCTCGAATAGCACCGAGATCGGTGGTGAAGA TVf02_E05+ ACCGAG-ATCGGTGGTGAA TVf01_H11+ ACCGAGCATCGGTGGTGAAGAAAGTG TVf04_F07- ACCGAA-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGT TVf03_F07- ACAGAG-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGCACCGAGATCGGTGGTGAAGA TVf10_A02+ ACCGAG-ATCGGTGGTGAAGAAAGTG TVf05_D09+ ACCGAA-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGCACCGAGATCGGTGGTGAAGA TVf05_E01+ ACCGAN-ATCGGTGGTGAAGAAAGTGGNAGCTCGAA TVf05_C04+ ACCGAG-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGCACCGAGATCGGTGGTGAAGA TVf11_F02+ ACCGAG-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGCAC TVf03_D12- ACAGAG-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGCACCGAGATCGGTGGTGAAGA TVf10_F09+ ACCGAGATCGGTGGTGAAGA TVf11_D02+ ACCGAGATCGGTGGTGAAGA ____________________________________________________________ consensus ACCGAG-ATCGGTGGTGAAGAAAGTGGCAGCTCGAATAGCACCGAGATCGGTGGTGAAGA