140 likes | 251 Views
Survey of Misannotations and Pseudogenes in the Arabidopsis Genome. Tanmay Prakash. Objectives. Objectives Find Possible Misannotations Find Possible Pseudogenes. Why Misannotation can hinder research Pseudogenes can be used to study natural selection. Misannotations. Intron. UTR.
E N D
Survey of Misannotations and Pseudogenes in the Arabidopsis Genome Tanmay Prakash
Objectives • Objectives • Find Possible Misannotations • Find Possible Pseudogenes • Why • Misannotation can hinder research • Pseudogenes can be used to study natural selection
Misannotations Intron UTR CDS CDS UTR Many misannotations are the result of gene prediction programs mislabeling introns because of the presence of a stop codon
Pseudogenes • Pseudogenes are DNA sequences that no longer function but resemble the functional genes they once were. There are two types: • Processed • Non-processed • Common Properties of Pseudogenes • Stop Codons • Frameshift mutations • Lack of Selective Pressure agtacatgcataggactcgatcgactc STCIGLDRL ST..DSID agtacatgataggactcgatcgactc
Pipeline Query Protein Domains Genes Matching In Introns BLAST Search Subject Arabidopsis Introns Genes Matching In Both Possibly Misannotated Genes Query Protein Domains Check for Stop Codons Frameshift Genes Matching In CDS HMMER Search Subject Arabidopsis CDS Check Ka/Ks Possible Pseudogenes
Query Protein Domains Genes Matching In Introns BLAST Search Subject Arabidopsis Introns Query Protein Domains Genes Matching In Exons HMMER Search Subject Arabidopsis CDS
Genes Matching In Both Possibly Misannotated Genes
Results There were 346 genes (different models not included) that had matches to the same domain in the introns and exons There were 299 genes (different models not included) that had matches to the same domain in an intron and flanking exons. These are most likely misannotations.
Future Research • Identify pseudogenes by looking for stop codons, and frameshift mutations in the introns and checking the Ka/Ks value • Use a more recent database of domains • Follow the same process for the rice genome
Acknowledgement Dr. Shin-Han Shiu Dr. Kosuke Hanada Dr. Melissa Lehti-Shiu Dr. Gail Richmond HSHSP