1 / 12

Detecting Copy Number Variation With Short Paired Reads

Detecting Copy Number Variation With Short Paired Reads. Department of Computer Science University of Toronto Genome Informatics 2009. Paul Medvedev , Marc Fiume, Misko Dzamba, Tim Smith, Adrian Dalca, Mike Brudno. Copy Number Variants (CNVs).

niran
Download Presentation

Detecting Copy Number Variation With Short Paired Reads

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Detecting Copy Number Variation With Short Paired Reads Department of Computer Science University of Toronto Genome Informatics 2009 Paul Medvedev, Marc Fiume, Misko Dzamba, Tim Smith, Adrian Dalca, Mike Brudno

  2. Copy Number Variants (CNVs) • Large regions that appear a different number of times within different indiv. • CNVs are associated with • a number of diseases • Input • reference human genome • sequenced donor genome • Output • CNV annotations in ref

  3. Previous Approach Using depth of coverage: Campbell et al 2008 Chiang et al 2009 Yoon et al 2009 Ref DOC Ref CNV CNV • Our Approach: • Capture adjacency information about the donor genome in a graph. • Use these adjacencies together with DOC

  4. Donor Graph Step 1: represent referenceadjacencies

  5. Donor Graph Step 1: represent reference adjacencies

  6. Donor Graph Step 2: represent donor adjacencies Donor Ref

  7. Donor Graph Step 2: represent donor adjacencies Donor Ref

  8. Which walk is the donor? Path Use depth-of-coverage: Ref DOC Ref 1 2 2 1 1 1 1 CNV • We find a path that is “most faithful” to the DOC • using probabilistic model to score “faithfulness” • use network flow to find traversal counts of walk with max score

  9. Preliminary Results • NA18507 individual sampled with Illumina, hg18 reference • Total of 3730 CNV calls • 2165 losses, 1565 gains Size Distribution

  10. Preliminary Results Sensitivity: Kidd et al.’s (2008) LOSS calls (141 calls) Percentage of Kidd’s calls that overlap one of ours: After randomly shuffling our calls: Specificity: Database of Genomic Variants (DGV) Percent of our calls that overlap with DGV: • After randomly shuffling • our calls:

  11. Conclusion • Presented a method for detecting CNVs • Combines • depth-of-coverage • paired-end mapping • Improves • compared to paired-end mapping: • Increased sensitivity in repeating regions • segmental duplications • compared to depth-of-coverage methods: • better resolution (1Kb vs. 30Kb) • Global optimization approach

  12. Detecting Copy Number Variation Paul Medvedev Marc Fiume Misko Dzamba Tim Smith Adrian Dalca Mike Brudno Genome Informatics 2009

More Related