280 likes | 495 Views
Mimulus guttatus : a resource for comparative genomics in the asterids. UNC Chapel Hill: Todd Vision et al. (Toby Clarke, Amy Bouck, Eric Ganko*, Andrew Morgan, Stefanie Hartmann, Jason Phillips) Duke University: John Willis et al University of Washington: Toby Bradshaw et al
E N D
Mimulus guttatus: a resource for comparative genomics in the asterids UNC Chapel Hill: Todd Vision et al. (Toby Clarke, Amy Bouck, Eric Ganko*, Andrew Morgan, Stefanie Hartmann, Jason Phillips) Duke University: John Willis et al University of Washington: Toby Bradshaw et al University of Montana: Lila Fishman et al CUGI: Jeff Tomkins* et al JGI: Dan Rokhshar et al Stanford*: Jeremy Schmutz, Jane Grimwood Funded by: NSF FIBR, DOE CSP
Outline • Background on Mimulus and the asterids • Status of genomic resources • ESTs, BACs, physical and genetic maps • JGI whole-genome shotgun assembly • Status of annotation and high-density mapping • Comparative genomics • Polyploidy and synteny • Phylogenetic relationship of Solanaceae relative to Mimulus and coffee
106-119 Mya The lamiid (euasterid I) clade Coffea Solanaceae Antirrhinum Mimulus Striga Sesamum Bremer et al (2002) Mol Phyl Evol 24, 274 Bremer et al (2004) Syst Bio 53, 496
The genus Mimulus • ~160 species in the Phrymaceae • ex Scrophulariaceae • Multiple species complexes • Diversity of floral morphologies and mating systems • Habitat and life history diversity • from sea level to 10,000 ft • serpentine and mine tailings • aquatics, woody shrubs
M. guttatus as a lab organism • Easy to cultivate & clone • Small size • Generation time ~2 mos. • High seed yield • Can be easily selfed or crossed • Small genome (430 Mb) • High recombination (1 cM ~ 250 kb) • Rich in SNP/indel polymorphism
Chronology of an emerging model • Various investigators study: • the genetics of speciation • inbreeding depression • mating system evolution • ecological adaptation • cytological evolution Ecological genetic studies begun by Clausen, Keck & Heisey Permanent mapping populations ESTs, gene based markers BAC-based physical maps Molecular phylogeny of genus Genome sequence Biosystematic studies begun by Robert Vickery First markers & linkage maps 1940 1950 1960 1970 1980 1990 2000 ~4K records for “Mimulus + genetics” in Google Scholar
Some ongoing studies in M. guttatus • Dobzhansky-Muller factors • Cytoplasmic male sterility • Meiotic drive • Copper tolerance & reproductive isolation • Floral pigmentation • Inbreeding depression • Water use efficiency • Flowering time & life history
Candidate genes for floral diversification in Mimulus EMS mutants of M. lewisii Christina Pince, Bradshaw lab M. lewisii mutant M. bifidus M. lewisii wt M. lewisii mutant M. parishii M. inconspicuus M. lewisii mutant M. douglasii M. lewisii mutant M. lewisii mutant M. cardinalis
M. lewisii and M. cardinalisEvolution of prezygotic isolation Toby Bradshaw and Doug Schemske
Genome projects in Mimulus • NSF Frontiers in Biological Research (2003) • Integrated ecological and genomic analysis of speciation in Mimulus • To enable positional cloning of ecologically and evolutionarily interesting genes, particularly for speciation and reproductive isolation • Physical and genetic maps, markers, transformation protocols • DOE Joint Genome Institute • Community Sequencing Program (2005) • Whole genome shotgun of M. guttatus IM62 • A number of supporting sequencing projects
Physical maps • BAC Libraries • M. guttatus: two 11X libraries • M. lewisii: one 10X library • HICF contigs • End-sequences for all BACs • Overgos (~750) CUGI & JGI
Expressed sequence tags • >500K from M. guttatus IM62 • ~ half Sanger, half 454 • Mostly from floral buds and maturing fruits • Also from roots, seedlings, leaves • 30K from M. guttatus DUN • 30K from M. nasutus SF • 20K from M. lewisii
Other FIBR products • ~1000 genetic (EPIC) markers • Multiple RIL and IL populations • Baseline polymorphism and LD data • Agrobacterium-based transformation • Germplasm collection • Community database
JGI whole-genome shotgun • M. guttatus IM62 • inbred line from Iron Mountain, OR • ~7X coverage • Mostly 3 kb & 8 kb paired-end Sanger reads • >200K ~30K fosmid end-sequences • >100K BAC end-sequences • M. guttatus DUN • inbred line from a coastal population • ~20X coverage of 454 paired-ends
Assembly statistics Scaffolds span ~75% of estimated 430 Mb genome Gaps comprise ~6.5% of the scaffolds Number of scaffolds > 50 KB = 512 (which includes 95.7% of assembly) Jeremy Schmutz
Genetic mapping of scaffolds • To order and orient the >2K scaffolds on the genetic map • Procedure • Genome reduction to ~100K markers using restriction-associated DNA (RAD) - Miller et al (2007) Genome Res 7, 214 • Nextgen sequencing of ~100 IM62xDUN RIL lines at ~0.4X coverage each (in pools of 12) • Infer genotypes in 20-100kb sliding windows and analyze linkage map • Model predicts ~50% of scaffolds and >90% of bp in assembly will be mapped Toby Clarke
Structural annotation • Currently ~25,000 predictions (earlier assembly) • Planned JGI pipeline • Mask for repeats • Homology to Phytozome proteins, Genbank asterid proteins • M. guttatus and M. nasutus ESTs, translated M. lewisii ESTs • Genomescan, Fgenesh integrated with Pasa • Filter out TEs & low-confidence genes • Annotation planned to be available in February Theresa Mitros, JGI
Repeat composition • RepeatModeler/RepeatMasker masks ~60% of the assembly • Known LTR retrotransposons: 15% • Most active yet seen is a gypsy with 35 EST hits • Known centromere linked repeats: 8% • Cnt728: highly abundant, heterochromatic, tandem repeat interspersed with LTRs FISH of pachytene chromosomes with probes for DNA (blue), Cnt728 (green) and a gypsy retrotransposon (pink). Eric Ganko and Arpiar Saunders
BAC annotations • 22 fully-sequenced BACs (2.5 Mb) • 236 genes -> 40-45Kgenes in genome Eric Ganko
Mimulus-rice synteny Andrew Morgan
Microsynteny with Solanaceae(ovate region, Wang et al 2008 Genetics) May be broken into 3 regions in Mimulus Within each, overall order and orientation well conserved 2 Mimulus genes with no syntenolog in Solanaceae 9 Solanaceae genes with no syntenolog in Mimulus Eric Ganko
VISTA conservation plot (20/50nt windows) Mimulus-specific genes (blue) Mimulus genes with Solanceae syntenologs (orange) Eric Ganko
Unresolved relationships among lamiid genomes Solanum (Solanales) Mimulus (Lamiales) Coffee (Gentianales) All 3 possible relationships for these taxa have been published
Mimulus is an outgroup to the Solanaceae-coffee clade Solanum (Solanales) 100 Coffee (Gentianales) Mimulus (Lamiales) Maximum likelihood analysis of concatenated alignment for 13 gene families totalling 5,691 amino acids Amy Bouck and Stefanie Hartmann
Mimulus as a resource for tomato • Data should be available by mid-2009 • Assist with gene and repeat structural annotation • Early look at asterid-specific biology • Shared functional annotation • Evolutionary analysis • Potentially improve coverage of syntenic blocks • Could serve as outgoup to reconstruct Solanaceae ancestor and polarize evolutionary changes between Solanaceae and coffee