1 / 45

Sequencing gene rich regions of the Medicago truncatula genome

Sequencing gene rich regions of the Medicago truncatula genome. Molecular Breeding of Forage and Turf Noble Foundation May 21, 2003. Dr. Doris Kupfer Advanced Center for Genome Technology Department of Chemistry and Biochemistry University of Oklahoma

jerica
Download Presentation

Sequencing gene rich regions of the Medicago truncatula genome

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Sequencing gene rich regions of the Medicagotruncatula genome Molecular Breeding of Forage and Turf Noble Foundation May 21, 2003 Dr. Doris Kupfer Advanced Center for Genome Technology Department of Chemistry and Biochemistry University of Oklahoma dkupfer@ou.edu www.genome.ou.edu

  2. Why sequence the Medicago genome? • An important forage crop • A genetically tractable model legume • A relatively small (~500 Mbp) diploid genome • Active legume research community • Medicago Research Consortium • Large collection of ESTs • Excellent BAC library • Integrated physical and genetic map

  3. Sequence Pipeline at the University of Oklahoma Genome Center, OU-ACGT DNA GenBank Sequencing (ABI 3700) Growing subclones (HiGroTM) Subclone isolation II (VPrepTM) DNA shearing (HydroshearTM) Data assembly and Analysis Thermocycling (ABI 9700) Subclone Isolation I (Mini-StaccatoTM) Colony Piking (QPixIITM) Closure Miscelaneous liquid handling Primer Synthesis

  4. Functional Distribution Overview of M. truncatula Predicted ORFs No Hits 29% Metabolism 28% No Metabolic Classification 40% Genetic Information & Processing 2% Cellular Processes 1% Environmental Information Processing <1%

  5. Metabolism Biodegradation of Xenobiotics 17% Carbohydrate Metabolism 10% Energy Metabolism 5% Lipid Metabolism 1% Nucleotide Metabolism 3% Biosynthesis of Secondary Metabolites 13% Amino acid Metabolism 6% Metabolism of Other Amino Acids 2% Metabolism of Cofactors & Vitamins11% Metabolism of Complex Carbohydrates 13% Metabolism of Complex Lipids 19%

  6. Genetic Information & Processing Translation 54% Sorting and Degradation 23% Replication 3% Transcription 20%

  7. Data Analysis and Annotation Schema BAC Sequences Catenated Contig Sequences (>5 KB) Medic Repeats BLASTX (Arab.) BLASTN (GB-EST) Genscan FgeneSH BLASTX (GB- NR) BLASTN (TIGR_Plant gene Indices) tRNA/rRNA Analysis BLASTP (against KEGG-A. thaliana) KEGG Metabolic Reconstruction GBrowse

  8. Examples of Reconstructed Pathways in M. truncatula (Purine and Pyrimidine Metabolism)

  9. Examples of Reconstructed Pathways in M. truncatula (CO2 Fixation and Nitrogen Metabolism)

  10. Examples of Reconstructed Pathways in M. truncatula (Aminoacyl-tRNA synthesis and Globoside Metabolism)

  11. Hydroshear • GeneMachines, Inc. San Carlos, CA • Precision-drilled ruby orifice • Pump retraction speed range 0 – 40 • A 100 to 300 ul sample sheared at a retraction speed • setting of 10 produces DNA 1- 4 Kbp fragments

  12. Genetix QPixII Colony Picker Digitizes colonies and picks in batches of 96 into 384-well plates Pins are sterilized after each set of 96 colonies are picked

  13. Cell Growth in 384 Well Plates in a HiGro • Capacity: 48 shallow, 384 well plates or 24 deep well plates. • Cells are grown in supplemented TB medium • Cells are shaken at 520 rpm for 22 hours at 370C. • After 3.5 hours, oxygen is added @ 0.5 ft3/min for 0.5 second every 30 seconds.

  14. 4 built in shakers Robotic 386 well plate loader and stacker 384 tip pipettor Zymark SciClone with Twister II

  15. Primer synthesis (Mermade IV) for PCR-based closure and finishing • Standard phosphoramidite chemistry in an argon- filled reaction chamber. • 192 primers synthesized at 2.5 nmole scale twice each day. • 2.5 nanomole synthesis (50 cents/oligo) typically is used for either PCR or DNA sequencing primers, but can be scaled to 10 nanomole.

  16. Medicago truncatula Mapped BAC Approach in collaboration with Doug Cook and DJ Kim at U.C. Davis • Focus is on gene rich euchromatic regions • Initial sequencing of 1000 BACs with known biological markers and covering regions of biological interest as supplied to us by the UC Davis group. • Once the BACs are received, we create the shotgun libraries, isolate the sequencing templates and obtain the working draft sequence followed by closure and finishing.

  17. UC Davis -------- Oklahoma University

  18. Pachytene FISH From D. Cook, et al (2001)

  19. UC Davis BAC Tiling Path

  20. May12,2003

  21. Blast Homology with M.truncatula ESTs and Arabidopsis thaliana

  22. Medicago GC Content for Regions Sequenced to Date

  23. Exon Size Distribution (All Sequence Data) (FgenesH vs. Genscan) 12000 10000 FgenesH = 7838 genes Genscan = 6693 genes 8000 Number of Exons 6000 4000 2000 0 1-50 401-500 101-200 51-100 701-800 301-400 501-600 601-700 201-300 801-900 901-1000 Exon Size Range

  24. Intron Size Distribution (All Sequence Data) (FgenesH vs. Genscan) 8000 7000 FgenesH = 7838 genes Genscan = 6693 genes 6000 5000 Numberof Introns 4000 3000 2000 1000 0 1-50 51-100 401-500 501-600 701-800 601-700 301-400 201-300 801-900 101-200 901-1000 Intron Size Range

  25. PrintrepeatAnalysis of M. truncatula BAC AC121240 vs. A. thaliana Chr.2 Expansion, Duplication, Repeat Elements

  26. PIP of M. truncatula BAC AC121240 vs. A. thaliana Chr.2

  27. Medicago truncatula Summary and Conclusions • At present we have received six (6) sets of 96 well characterized BACs from the UC Davis group. • Of these, all 576 BACs have been isolated, have shotgun libraries constructed, and are being sequenced. • Data for almost all of the first five 96 sets and several of the sixth 96 set of BACs (>61 million bp) have been submitted to GenBank and assigned accession numbers as of May 14, 2003 . • We have scaled up our sequencing and are close to our goal of obtaining a working draft sequence (5-6 fold coverage) of 96 BACs/month.

  28. Medicago truncatula Summary and Conclusions • Average Gene Density of 140 to 160 genes per million bp in the euchromatic, gene rich regions of the 8 Medicago truncatula chromosomes based on 55 finished (phase 3) BACs covering ~6.5 Mbp. • Very close to Doug Cook’s 1 gene per 6.5 Kbp. • Genome characteristics such as %GC, intron/exon size and conserved cis sequences reveal Medicago characteristics • The sequence of the Medicago truncatula genome shows homology to the sequenced Arabidopsis thalianagenome but expansion, rearrangements and duplications are evident.

  29. Data Release and Preliminary Annotation • All our sequence data is available through links on our web site to GenBank and on our ftp site at URL: ftp.genome.ou.edu/medicago • keyword and blast searches can be done on our web site at URL: http://www.genome.ou.edu/medicago.html • Additional annotation via Genome Browser database are available on our web site at URL: http://www.genome.ou.edu/medicago_table.html • E-mail suggestions for additional annotation to Bruce Roe at: broe@ou.edu

  30. Future Plans • Complete working draft sequence of ~1000 mapped Medicago truncatula BACs over the next 6 months with funding from the Noble Foundation • Finish a significant number of these BACs with additional • funding from the DOE • Pending NSF application to: • complete the genomic sequence of unfinished BACs to fewer than one uncertain base in 10,000-The Bermuda rules • and • sequence to working draft and finish an additional ~750 mapped BACs • Obtain the contiguous sequenceof the Gene Rich regions of four of the 8 Medicago truncatula genome at OU, with the remaining four being completed by our international partners at TIGR, Sanger, and Genoscope.

  31. Laboratory Organization Bruce Roe, PI Support Teams Reagents & Equip. Maint. Informatics Production DNA Synthesis Administration Jim White Steve Kenton Hongshing Lai Sean Qian Rose Morales-Diaz* Mounir Elharam* Yonas Tesfai Steve Shaull** Doug White Work-study Undergraduates** Phoebe Loh* Sulan Qi Bart Ford* Mounir Elharam* Doug White Kay Lynn Hale Dixie Wishnuck Tami Womack Mary Catherine Williams Research Teams Limei Yang Angie Prescott* Audra Wendt** Mandi Aycock** Doris Kupfer Julia Kim* Sun So Graham Wiley** Lauren Ritterhouse** Axin Hua Weihong Xu Fares Najar Chunmei Qu Keqin Wang Carson Qu Shuling Li ShaoPing Lin Honggui Jia Hongming Wu Baifang Qin Peng Zhang Ziyun Yao Steve Shaull* Youngju Yoon Jami Milam Sara Downard** Trang Do Anh Do Lily Fu Yang Ye James Yu Tessa Manning** Stephan Deschamps Shelly Oommen Christopher Lau Yanhong Li Fu Ying Liping Zhou Ruihua Shi Junjie Wu Pheobe Loh * Sulan Qi Bart Ford* Lin Song Ying Ni Huarong Jiang Funding from the Noble Foundation, DOE, (pending NSF) Collaborators at UC Davis and the Noble Foundation * Previous undergraduate research student ** Present undergraduate research student

  32. TheACGTTeam

  33. Future Plans Sdf sdfkjh

  34. Medicago truncatula Summary and Conclusions • Average Gene Density of 140 to 160 genes per million bp in the euchromatic, gene rich regions of the 8 Medicago truncatula chromosomes based on 55 finished (phase 3) BACs covering ~6.5 Mbp. • Close to Doug’s 1 gene per 6.5 Kbp. • We have observed numerous unique repeated sequences in heterochromatic and euchromatic regions of the Medicago truncatula genome. • The sequence of the Medicago truncatula chloroplast genome is complete and shows a high degree of homology to the sequenced Arabidopsis thalianachloroplast genome.

  35. Medicago truncatula chloroplast genome

  36. Arabidopsis thalianachloroplast Medicago truncatula chloroplast

More Related