1 / 35

30 Sept. 2010

Genome Sciences Centre. BC Cancer Agency, Vancouver, BC, Canada. ALEXA-Seq analysis reveals breast cell type specific mRNA isoforms www.AlexaPlatform.org. Malachi Griffith. 30 Sept. 2010. In most genes, transcript diversity is generated by alternative expression.

zalman
Download Presentation

30 Sept. 2010

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Genome Sciences Centre BC Cancer Agency, Vancouver, BC, Canada ALEXA-Seq analysis reveals breast cell type specific mRNA isoforms www.AlexaPlatform.org Malachi Griffith 30 Sept. 2010

  2. In most genes, transcript diversity is generated by alternative expression Types of alternative expression Gene expression

  3. Transcript variation is important to the study of human disease • Alternative expression generates multiple distinct transcript variants from most human loci • Specific transcript variants may represent useful therapeutic targets or diagnostic markers (Venables, 2006)

  4. Massively parallel RNA sequencing Tissues/Cell Lines Generate cDNA, fragment, size select, add linkers Isolate RNAs Luminal Myoepithelial vHMECs hESCs Sequence ends Map to genome, transcriptome, and predicted exon junctions 263 million paired reads 21 billion bases of sequence Discover isoforms and measure abundance

  5. Pipeline overview

  6. Summary of features for human: ~4 million total (14% ‘known’) 37k Genes 62k Transcripts 278k exons 2,210k exon junctions 407k alternative exon boundaries 560k intron regions 227k intergenic regions What is an ALEXA-Seq sequence ‘feature’

  7. ALEXA-Seq processing: 19 projects REMC + 18 others 105 libraries (200+ lanes) 3.9 billion paired-end reads 36-mers to 75-mers Data analyzed to date

  8. Expression, differential expression and alternative expression values for 3.8 million features for each library processed Library quality analysis Number of features expressed (above background) Genes, transcripts, exon regions, junctions, etc. Differential gene expression Ranked lists Alternative expression Ranked lists Alternative isoforms involving exon skipping, alternative transcript initiation sites, etc. Known or predicted novel isoforms Candidate peptides Ranked lists Output

  9. Goals Visualization, interpretation, design of validation experiments, distribute results to internal/external collaborators What kinds of questions does ALEXA-Seq allow us to ask/answer? http://www.alexaplatform.org/alexa_seq/Breast/Summary.htm ALEXA-Seq data browser(using REMC analysis as an example)

  10. Library summary Read quality Tag redundancy End bias Mapping rates Signal-to-noise hnRNA & gDNA contamination Features detected Is the RNA-Seq library suitable for alternative expression analysis?

  11. Is my favorite gene expressed? alternatively expressed?

  12. Expression Differential expression Alternative expression Provided for each feature type (gene, exon, junction, etc.) Ranked lists of events What are the most highly expressed genes, exons, etc. in each library?

  13. e.g. most highly expressed genes

  14. Candidate genes Each comparison DE or AE events Gains or Losses What are the top DE and AE genes for each tissue comparison?

  15. Summary page for vHMECs vs. Luminal

  16. Candidate features gained in vHMECs vHMECs vs. Luminal CD10

  17. Which exons/junctions and corresponding peptides might be suitable for antibody design?

  18. Candidate peptides gained in vHMECs vHMECs vs. Luminal

  19. Example housekeeping gene(Actin; no change)

  20. CD10 (used to sort myoepithelial cells) Myoepithelial & vHMECs Luminal 422-fold higher in Myoepithelial than Luminal

  21. CD227 (used to sort luminal epithelial cells) CD227 Luminal Myoepithelial CD227

  22. Differential gene expression of CASP14(Caspase 14 gained in vHMECs)

  23. Novel skipping of PTEN exon 6

  24. Exon 12 skipping of DDX5 (p68)

  25. Tissue specific isoforms of CA12 vHMECs Myoepithelial Luminal

  26. Alternative first exons of INPP4B

  27. Alternative first exons of SERPINB7

  28. FERM domain containing proteins are alternatively expressed * * (FRM6, FRM4A, FRMD4B are AE) (FRMD3, FRMD8 are DE)

  29. Novel isoforms observed only in vHMECs E7-E10 E6-E10

  30. Are novel junctions real? What proportion validate by RT-PCR and Sanger sequencing? Are differential/alternative expression changes observed between tissues accurate? How well do DE values correlate with qPCR? To answer these questions we performed ~400 validations of ALEXA-Seq predictions from a comparison of two cell lines… How reliable are predictions from ALEXA-Seq?

  31. Validation (qualitative) 33 of 189 assays shown. Overall validation rate = 85%

  32. Validation (quantitative) qPCR of 192 exons identified as alternatively expressed by ALEXA-Seq Validation rate = 88%

  33. ALEXA-Seq approach provides comprehensive global transcriptome profile Input: paired-end RNA sequence data Output: expression, differential expression, alternative expression, candidate peptides, etc. Detection of both known and novel isoforms Subset that differ between conditions Predictions are highly accurate 86% validation rate by RT-PCR, qPCR and Sanger sequencing www.AlexaPlatform.org Conclusions

  34. Acknowledgements Griffith M, Griffith OL, Morin RD, Tang MJ, Pugh TJ, Ally A, Asano JK, Chan SY, Li I, McDonald H, Teague K, Zhao Y, Zeng T, Delaney AD, Hirst M, Morin GB, Jones SJM, Tai IT, Marra MA. Alternative expression analysis by RNA sequencing. In review (Nature Methods). Supervisor Marco Marra Committee Joseph Connors Stephane Flibotte Steve Jones Gregg Morin Bioinformatics Obi Griffith Ryan Morin Rodrigo Goya Allen Delaney Gordon Robertson Richard Corbett Sequencing Martin Hirst Thomas Zeng Yongjun Zhao Helen McDonald Laboratory Trevor Pugh Tesa Severson 5-FU resistance Michelle Tang Isabella Tai Marco Marra Multiple Myeloma Rodrigo Goya Marco Marra Neuroblastoma Olena Morozova Marco Marra Morgen Pamela Hoodless Jacquie Schein Inanc Birol Gordon Robertson Shaun Jackman Iressa and Sutent Obi Griffith Steven Jones Lymphoma Ryan Morin Marco Marra

More Related