390 likes | 488 Views
Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago http://naturalvariation.org/. Widely Distributed. Olivier Loudet. http://www.inra.fr/qtlat/NaturalVar/NewCollection.htm.
E N D
Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thalianaJustin BorevitzEcology & EvolutionUniversity of Chicagohttp://naturalvariation.org/
Widely Distributed Olivier Loudet http://www.inra.fr/qtlat/NaturalVar/NewCollection.htm
Local Population Variation Ivan Baxter Scott Hodges
Seasonal Variation Matt Horton Megan Dunning
Seasons in the Growth Chamber Seasons in the Growth Chamber Developmental Plasticity == Behavior Developmental Plasticity == Behavior Sweden Spain • Changing Day length • Cycle Light Intensity • Cycle Light Colors • Cycle Temperature • Changing Day length • Cycle Light Intensity • Cycle Light Colors • Cycle Temperature
Which arrays should be used? BAC array cDNA array Long oligo array
Which 25mer arrays should be used? Gene array Exon array Tiling array 35bp tile, 25mers 10bp gaps
Which 25mer arrays should be used? SNP array Ressequencing array Tiling/SNP array
Universal Whole Genome Array DNA RNA Gene/Exon Discovery Gene model correction Non-coding/ micro-RNA Chromatin Immunoprecipitation ChIP chip Alternative Splicing Methylation Antisense transcription Polymorphism SFPs Discovery/Genotyping Transcriptome Atlas Expression levels Tissues specificity Comparative Genome Hybridization (CGH) Insertion/Deletions Copy Number Polymorphisms RNA Immunoprecipitation RIP chip Allele Specific Expression Control for hybridization/genetic polymorphisms to understand true EXPRESSION polymorphisms
Improved Genome Annotation ORFa Transcriptome Atlas ORFb start AAAAA deletion M M M M M M M M M M M M SFP SNP SNP SFP SFP conservation Chromosome (bp)
Talk Outline • Whole Genome Tiling Arrays • Spatial Correction, grid alignment • Alternative splicing • Methylation • Single Feature Polymorphisms (SFPs) • Genetic Mapping • Potential deletions/ Copy Number Variants • Allele Specific Expression • Resequencing/ Haplotypes • Variation Scanning • Whole Genome Tiling Arrays • Spatial Correction, grid alignment • Alternative splicing • Methylation • Single Feature Polymorphisms (SFPs) • Genetic Mapping • Potential deletions/ Copy Number Variants • Allele Specific Expression • Resequencing/ Haplotypes • Variation Scanning
Tiling Array Re annotation • 6.25Million probes • 3.125Million PM probes • 1.67Million unique PM probes 17bp (blast) • 736k PM features in TUs (exon array) • 130k TUs • 28k genes
Spatial Correction, grid Alignment Background correction for RNA, ! For DNA
Transcription subUnits (TUs) Intron1 Exon1 Exon2 Tu1 Tu2 Tu3
Alternative Splicing Van Col V V V C C C Xu Zhang
ChIP chip treatment effect! Experimental Design same protocol/antibody dynamic binding model treatment effect Actual biological signal
Methods for labeling • Extract genomic 100ng DNA (single leaf) • Digest with either msp1 or hpa2 CCGG • Label with biotin random primers • Hybridize to array • Fit model Y = m + E * G + e
SFP detection on tiling arrays Delta p0 FALSE Called FDR 1.00 0.95 18865 160145 11.2% 1.25 0.95 10477 132390 7.5% 1.50 0.95 6545 115042 5.4% 1.75 0.95 4484 102385 4.2% 2.00 0.95 3298 92027 3.4%
methylated features and mSFPs Enzyme effect, on CCGG features GxE mQTL? >10,000 of 100,000 at 5% FDR 276 at 15% FDR
Chip genotyping of a Recombinant Inbred Line 29kb interval
100bibb mutant plants Mapbibb 100wt mutant plants
Array Mapping Hazen et al Plant Physiology 2005
Potential Deletions (wild lines) >500 potential deletions 45 confirmed by Ler sequence 23 (of 114) transposons Disease Resistance (R) gene clusters Single R gene deletions Genes involved in Secondary metabolism Unknown genes
Het Fast Neutron deletions FKF1 80kb deletion CHR1 cry2 10kb deletion CHR1
FLM natural deletion FLM Potential Deletions Suggest Candidate Genes FLOWERING1 QTL Chr1 (bp) Flowering Time QTL caused by a natural deletion in FLM (Werner et al PNAS 2005)
Col Female imprint Col allele expressed cis regulatory variation Col/Col Col/Van Van allele expressed Van/Col Van/Van
Array Haplotyping • What about Diversity/selection across the genome? • A genome wide estimate of population genetics parameters, θw, π, Tajima’D, ρ • LD decay, Haplotype block size • Deep population structure? • Col, Lz, Bur, Ler, Bay, Shah, Cvi, Kas, C24, Est, Kin, Mt, Nd, Sorbo, Van, Ws2 Fl-1, Ita-0, Mr-0, St-0, Sah-0
Chromosome1 ~500kb Col Ler Cvi Kas Bay Shah Lz Nd Array Haplotyping Inbred lines Low effective recombination due to partial selfing Extensive LD blocks
SFPs for reverse genetics 14 Accessions 30,950 SFPs` http://naturalvariation.org/sfp
Tajima’s D like 50kb windows RPS4 unknown
NaturalVariation.org NaturalVariation.org University of Chicago Xu Zhang Evadne Smith Ken Okamoto Yan Li Michigan State Shinhan Shui Purdue Ivan Baxter Sainsbury Laboratory Jonathan Jones University of Chicago Xu Zhang Evadne Smith Ken Okamoto Yan Li Michigan State Shinhan Shui Purdue Ivan Baxter Sainsbury Laboratory Jonathan Jones USC Magnus Nordborg Paul Marjoram Max Planck Detlef Weigel Scripps Sam Hazen University of Michigan Sebastian Zollner USC Magnus Nordborg Paul Marjoram Max Planck Detlef Weigel Scripps Sam Hazen University of Michigan Sebastian Zollner