70 likes | 218 Views
Machine Learning group meeting. Yiming Zhang 01/27/2012. Biological questions. What is the retained/constitutive introns distribution in gene level? What is the number of retained/constitutive introns per gene?
E N D
Machine Learning group meeting Yiming Zhang 01/27/2012
Biological questions • What is the retained/constitutive introns distribution in gene level? • What is the number of retained/constitutive introns per gene? • How many retained /constitutive introns are near 5’ end or 3’ end of genes?
Biological questions • Which genes have all introns retained/ constitutively spliced? • What is the function of genes which are always alternatively/constitutively spliced? (GO analysis)
Dataset and model • Arabidopsis TAIR 10 genome sequence and TAIR 10 gff3 annotation. • After removing redundancy and the introns or their flanking exons which are shorter than 20 nt, 126,064 introns and 21,091 genes have been used. • AT specific model (trained by RF with 200 trees) has been used to make prediction.