1 / 26

Next Generation Sequencing

Next Generation Sequencing. Itai Sharon November 11th, 2009 Introduction to Bioinformatics. 2001: Human Genome Project 2.7G$, 11 years. 2007: 454 1M$, 3 months. 2008: ABI SOLiD 60K$, 2 weeks. 2001: Celera 100M$, 3 years. 2010: 5K$, a few days?. 2009: Illumina, Helicos 40-50K$.

zarita
Download Presentation

Next Generation Sequencing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics

  2. 2001: Human Genome Project 2.7G$, 11 years 2007: 454 1M$, 3 months 2008: ABI SOLiD 60K$, 2 weeks 2001: Celera 100M$, 3 years 2010: 5K$, a few days? 2009: Illumina, Helicos 40-50K$ 2012: 100$, <24 hrs? 2000 Sequencing the Human Genome 10 8 6 Log10(price) 4 2 2005 2010 Year

  3. In this Talk: • Sequencing 1.0: Sanger • Assembly • Next generation sequencing (NGS) • NGS applications • Future directions

  4. Genome Sequencing • Goal • figuring the order of nucleotides across a genome • Problem • Current DNA sequencing methods can handle only short stretches of DNA at once (<1-2Kbp) • Solution • Sequence and then use computers to assemble the small pieces

  5. TG..GT TC..CC AC..GC CG..CA TT..TC TG..AC AC..GC GA..GC CT..TG AC..GC GT..GC AC..GC AA..GC AT..AT TT..CC Short DNA sequences ACGTGGTAACGTATACAC TAGGCCATAGTAATGGCG CACCCTTAGTGGCGTATACATA… ACGTGGTAATGGCGTATACACCCTTAGGCCATA ACGTGACCGGTACTGGTAACGTACACCTACGTGACCGGTACTGGTAACGTACGCCTACGTGACCGGTACTGGTAACGTATACACGTGACCGGTACTGGTAACGTACACCTACGTGACCGGTACTGGTAACGTACGCCTACGTGACCGGTACTGGTAACGTATACCTCT... Sequenced genome Genome Sequencing Genome Short fragments of DNA 5

  6. Sanger Sequencing • Mix DNA with dNTPs and ddNTPs • Amplify • Run in Gel • Fragments migrate distance that isproportional to their size

  7. Sanger Sequencing

  8. Sanger Sequencing • Advantages • Long reads (~900bps) • Suitable for small projects • Disadvantages • Low throughput • Expensive

  9. ~(length―1,000) ~500 bp ~500 bp 15Kbp mates contig 2 contig 1 resolving repeats Better assembly of contigs, gap lengths estimation 2Kbp mates Assembly Cut DNA to larger pieces (2Kbp, 15Kbp) and sequence both ends of each piece (Fleischmann et al., 1994) 9

  10. Lander and Waterman, 1988 Low coverage: A few pieces to assemble many contigs, many gaps High coverage: many pieces to assemble a few contigs, a few gaps Assembly: How Much DNA? Input Output    

  11. 1990 2000 1980 Sanger Sequencing 2007: Global Ocean Sampling Expedition ~3,000 organisms, 7Gbp (Venter et al.) 1994: H. Influenzae 1.8 Mbp (Fleischmann et al.) 1982: lambda virus DNA stretches up to 30-40Kbp (Sanger et al.) 2001: H. Sapiens, D. Melanogaster 3 Gbp (Venter et al.)

  12. Next Generation Sequencing: Why Now? • Motivation: HGP and its derivatives, personalized medicine • Short reads applications: (re-)sequencing, other methods (e.g. gene expression) • Advancements in technology

  13. High Parallelism is Achieved in Polony Sequencing Sanger Polony

  14. Generation of Polony array: DNA Beads (454, SOLiD) DNA Beads are generated using Emulsion PCR

  15. Generation of Polony array: DNA Beads (454, SOLiD) DNA Beads are placed in wells

  16. Generation of Polony array: Bridge-PCR (Solexa) DNA fragments are attached to array and used as PCR templates

  17. Sequencing: Pyrosequencing (454) Complementary strand elongation: DNA Polymerase

  18. Sequencing: Fluorescently labeled Nucleotides (Solexa) Complementary strand elongation: DNA Polymerase

  19. Sequencing: Fluorescently Labeled Nucleotides (ABI SOLiD) Complementary strand elongation: DNA Ligase

  20. Sequencing: Fluorescently Labeled Nucleotides (ABI SOLiD) 5 reading frames, each position is read twice

  21. Single Molecule Sequencing: HeliScope • Direct sequencing of DNA molecules: no amplification stage • DNA fragments are attached to array • Potential benefits: higher throughput, less errors

  22. Technology Summary *Source: Shendure & Ji, Nat Biotech, 2008

  23. What, When and Why • Sanger: Small projects (less than 1Mbp) • 454: De-novo sequencing, metagenomics • Solexa, SOLiD, Heliscope: • Gene expression, protein-DNA interactions • Resequencing

  24. Applications

  25. Applications

  26. Where Do We Go from Here? • Higher throughput, longer reads (Pacific BioSciences) • Computational bottleneck • Shift to sequencing-based technologies • Will it help to cure cancer?

More Related