160 likes | 446 Views
Comparative Genomics in Basidiomycetes - Analyzing multigene families. Balaji Rajashekar Anders Tunlid Dag Ahrén. Jason Stajich. Basidiomycete genome data. 58,030. Gene 3. Gene 4. Gene 2. Gene 7. Gene 5. Gene 6. Gene 8. Gene 9. Gene 10. Sequence similarity & clustering. BLASTP.
E N D
Comparative Genomics in Basidiomycetes- Analyzing multigene families Balaji Rajashekar Anders Tunlid Dag Ahrén Jason Stajich
Basidiomycete genome data 58,030
Gene 3 Gene 4 Gene 2 Gene 7 Gene 5 Gene 6 Gene 8 Gene 9 Gene 10 Sequence similarity & clustering • BLASTP Gene 1
TribeMCL (Enright et al. NAR 2002) • BLASTP: All against all for the basidiomycete genomes • 58,000 versus 58,000 proteins • Split generated network into families • Data and settings dependent TribeMCL animation
Statistical analyses of gene families CAFE (Bie et al, Bioinformatics 2006) • Model the evolution of gene family sizes • Takes phylogeny into account • Calculates birth and death of genes in all nodes • Identifies families with accelerated gene gain/loss including extinction
Protein families in Laccaria • Protein families analysed by CAFE • 1969 Unique protein families • 7352 Protein families in total
PCA of expression data Mycorrhiza Mycelia Fruiting bodies 11 experiments Protein family 2 Axis 1
Comparative Genomics in Basidiomycetes- Analyzing multigene families Balaji Rajashekar Anders Tunlid Dag Ahrén Jason Stajich