1 / 37

TCGA: Past, Present & Future

TCGA: Past, Present & Future. July 11, 2013 Kenna Shaw, Ph.D. Director TCGA Program Office National Cancer Institute. TCGA: Core Objectives. Launched in 2006 as a pilot and expanded in 2009, the goals of TCGA are to: Establish infrastructure for effective team science

yered
Download Presentation

TCGA: Past, Present & Future

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. TCGA: Past, Present & Future July 11, 2013 Kenna Shaw, Ph.D. Director TCGA Program Office National Cancer Institute

  2. TCGA: Core Objectives Launched in 2006 as a pilot and expanded in 2009, the goals of TCGA are to: • Establish infrastructure for effective team science • Develop a scalable “pipeline” • Demonstrate the feasibility of a large-scale, high throughput approach to identifying the molecular ‘parts-list’ • Make the data publicly and broadly available to the cancer community while protecting patient privacy

  3. TCGA: Timeline Analysis Completion Pilot Project Expansion 2005 2006-2009 2015-2016 2010-2014 GBM Report Ovarian Report ARRA Funding Rare Projects Initiated 9 tumor types closed 10,000 cases complete NCAB Report • Pilot Projects: GBM and Ovarian carcinoma (~500 cases ea.) • Establish infrastructure for effective team science • Develop a scalable “pipeline” • Demonstrate the feasibility of a large-scale, high throughput approach to identifying the molecular ‘parts-list’ • Make the data publicly and broadly available to the cancer community while protecting patient privacy • Expansion 2010 to 2014: • Add 25-35 tumor types • Enhancement of sample acquisition & program staff • Add Genome Data Analysis Centers • Publish “Benchmark Marker Papers” • Established FFPE protocols • Completely characterize 10,000th case • Analysis Completion 2015-2016: • Finish marker papers on rare & “challenging-to-accrue” tumors • Complete Pan-Cancer Analysis • Broader sharing of tools, analytical methods

  4. TCGA: “No Platform Left Behind” • Multiple data types • Clinical diagnosis • Treatment history • Histologic diagnosis • Pathologic report/images • Tissue anatomic site • Surgical history • Gene expression/RNA sequence • Chromosomal copy number • Loss of heterozygosity • Methylation patterns • miRNA expression • DNA sequence • RPPA (protein) • Subset for Mass Spec 25* forms of cancer glioblastoma multiforme(brain) squamous carcinoma(lung) serouscystadenocarcinoma(ovarian) Etc. Etc. Etc. Biospecimen CoreResource with more than 150 Tissue Source Sites 6 Cancer GenomicCharacterization Centers 3 GenomeSequencingCenters 7 Genome Data Analysis Centers Data Coordinating Center

  5. TCGA Tumor Types • Hepatocellular • Lower Grade Glioma • Lung adenocarcinoma* • Lung squamous* • Melanoma • Ovarian serous cystadenocarcinoma* • Papillary kidney • Pancreas • Prostate • Sarcoma (dedifflipo, UPS, leiomyosarcoma) • Papillary Thyroid* • AML • Breast Ductal* • Breast Lobular/Breast Other • Bladder (pap and non-pap) • Cervical adeno & squamous • Colorectal* • Clear cell kidney* • DLBCL • Endometrial carcinoma* • Esophageal adeno & squamous • Gastric adenocarcinoma • GBM* • Head and Neck Squamous* Red- Pilot tumors; *- Reached 500

  6. Rare Tumor Project (Approved March 2012) • Adrenocortical Carcinoma* • Chromophobe kidney* • Mesothelioma • Paraganglioma/Pheochromocytoma • Uterine Carcinosarcoma* • Thymoma • Uveal Melanoma • Testicular Germ Cell • Cholangiocarcinoma • Diffuse Large B Cell Lymphoma

  7. Tumor Project Progress * Manuscript submitted or published Analysis underway * * Sample acquisition phase * * * * * * * Rare tumor project * Only accepting AA cases/500 target reached

  8. Sample Criteria Limit ‘Askable’ Questions • Primary,adult tumors (except for melanoma and triplets) • Malignant (no in situ cases) • Snap frozen, <60min from clamp to LN2 • ~ 50 mg (biopsies starting to be feasible) • Pathology review of tissue sent to TCGA • No more than 20% necrosis ; ≥ 60%* tumor cells • No prior treatment • Matchedsource of germline: Blood (buffy coat/white cells)/saliva or skin for liquid tumors • Clinical annotation; but not pre-analytic variables • IRB approval for use in TCGA; proactive consenting for genomic studies • MTA w/out retention of IP 10,000 10

  9. TCGA: What’s in a Core Data Set? Data from Tissue Source Sites Data Generated by GCCs &GSCs Core Data Set • Synopic path report • Histology images • Required clinical data • Whole exome • SNP 6.0 array • mRNAseq • miRNAseq • Methylation array • Complete path report • Paired metastatic samples • Double normals • Treatment data • 50X WGS • 8X WGS • Methylseq • RPPA (May not apply to GBM/Ovarian cases collected during the pilot phase)

  10. TCGA: The Pipeline for Comprehensive Characterization Tissue Sample GDAC Pathology QC Sequencing Data Storage at DCC & CGHub Integrative Analysis DNA & RNAIsolation, QC Expression,CNA & LOH,Epigenetics Comprehensive Characterization of a Cancer Genome 3 months – 2 years SNP 6.0 ~45d ~90d Methylation ~60d miRNAseq ~105d mRNAseq ~120d ~12-24 months DNAseq Exome ~180d

  11. TCGA: Lessons Learned from the Data Pathway Analyses #1: It’s About Pathways and Integrated Analyses

  12. TCGA: Lessons Learned from the Data Survival Analyses

  13. TCGA: Lessons Across CancersLearned from the Data #2: Generating the comprehensive parts list will require many tumor types.

  14. TCGA: Lessons Learned from the Data ultramutators Female GBM 2.5y Alive and Well M. Lawrence & G. Getz/Broad Institute

  15. TCGA: Lessons Learned from the Data M. Lawrence & G. Getz/Broad Institute

  16. TCGA: Lessons Learned from the Data M. Lawrence & G. Getz/Broad Institute

  17. Genomic Data Complements Histopathologic Dx 1q amplification 24% of high-grade endometrioid tumors cluster with serous tumors (serous-like) • Andrew Cherniack, Broad

  18. TCGA: Lessons Learned from the Data #3: Publicly accessible, high-quality data is necessary but not sufficient to build a community of thousands of users.

  19. Access TCGA through PortalDownload Data Your Favorite Way

  20. Where to find TCGA Sequence Data • Moved from Short Read Archive (SRA at NCBI) to UCSC • Open for downloads as of January 2012 • Ready for 2 petabytes now, 5Pb (5 x 1015) total by 2014 • General Parallel File System, Dual RAID 6 subsystems, Redundant I/O paths • Currently holds >30,000 files • Co-location opportunities in same data center for groups who want to compute on the data • User support: support@cghub.ucsc.edu https://cghub.ucsc.edu

  21. Ongoing Pipeline Analysis Available http://gdac.broadinstitute.org

  22. Analysis Portal Examples: MSK’s cBio Phenotype & Clinical tracks Cancer tracks http://www.cbioportal.org/public-portal/ Are there differences in survival? I’m interested in “X” gene pathway in colorectal cancer… I’m interested in Patient TCGA-XX-YYYY “I’m curious about this specific region in detail” LBL’s BioSig Image Analysis Tool http://tcga.lbl.gov/biosig/tcgadownload.do UCSC’s Cancer Genome Browser http://genome.ucsc.edu/ MSKCC’s cBIO Genomics Portal http://www.cbioportal.org/public-portal/ MSKCC’s cBIO Genomics Portal http://www.cbioportal.org/public-portal/

  23. TCGA: The Pipeline for Comprehensive Characterization Tissue Sample GDAC Pathology QC Sequencing Data Storage at DCC & CGHub Integrative Analysis DNA & RNAIsolation, QC Expression,CNA & LOH,Epigenetics Comprehensive Characterization of a Cancer Genome • Model Informed Consents • Novel contracting methods to incentivize quality • FFPE sequencing/array protocols • Benchmarking datasets • Innovating cancer genomes analysis • Cross-tumor discovery • Innovating Data storage/release policies • New data standards (e.g. cancer .vcf) • FFPE isolation protocols • Remote pathology review • Reduction of batch effects

  24. Genomics-Driven Cancer Medicine: Guiding Principles for the Center for Cancer Genomics Principle #1: Molecular pathways involved in tumor survival and progression are often activated by genetic alterations. Garraway, J. Clin. Oncol., 2013

  25. Genomics-Driven Cancer Medicine: Guiding Principles for the Center for Cancer Genomics Principle #1: Molecular pathways involved in tumor survival and progression are often activated by genetic alterations. Principle #2: Anticancer agents targeting many oncogenic pathways have entered clinical trials. Garraway, J. Clin. Oncol., 2013

  26. Genomics-Driven Cancer Medicine: Guiding Principles for the Center for Cancer Genomics Principle #1: Molecular pathways involved in tumor survival and progression are often activated by genetic alterations. Principle #2: Anticancer agents targeting many oncogenic pathways have entered clinical trials. Principle #3: Genomics technologies enable robust tumor genomic profiling in the clinical arena.

  27. Center for Cancer Genomics (CCG)Portfolio • Signature CCG projects • Driven by CCG • Address large questions by coordinating across studies/divisions • e.g. 10,000 cases from CCG, DCTD, DCEG, CCR, DCCPS…. • Strategic partnerships with other NCI Divisions • Share bio-informatic tools/pipelines • Share SOPs for genomic analysis • Provide genome data quality assurance

  28. Strategic CCR partnerships with other NCI Divisions FFPE Samples from Clinical Trials Biospecimen Core Repository (BCR) • Tumor Pathology QC • % Tumor Nuclei • % Necrosis • Dx Confirmation via histology and pathology report • Molecular Analyte QC • Spectrophotometry • RNA Bioanalyzer • Electrophoresis • Genotyping TCGA-like Genome Characterization Pipeline Exome seq Whole genome seq RNA seq MicroRNA seq Affy 6.0 SNP DNA Methylation Genome Data Analysis Centers (GDACs) Genetic aberrations:Mutations Copy number Translocations Data analysis: GEP / miRNA / methylation subgroups Co-occurrence / exclusion Comparison to prior TCGA samples Data integration: Functional vs. structural Master regulator analysis Pathway analysis

  29. Shifting the NCI Paradigm for Clinical Studies • Collect, store and annotate biospecimens from all/most patients on NCI clinical trials • FULL genomic characterization • Competition for pivotal trials with central funds • Encourage competition from grant pool • Incorporate essential epidemiological data Ex: ALChEMIST questionnaire

  30. CCG Role in Precision Medicine • Enable NCI clinical trials to use genomic data optimally: • Understand response/resistance to Rx • Comparison of cases on different trials • Unify data and computational resources

  31. Acknowledgements Kenna Liming Roy Zhining Emma Margi John Julia Martin Greg Pritty Shaw Yang Tarnuzzer Wang Spaulding Sheth Demchok Zhang Ferguson Eley Joshi

  32. TCGA Program Office Responsibilities 25+ >500 1 3 Sequencing Centers Disease Working Groups Tissue Source Sites 5 Characterization Centers Biospecimen Core Resource 2 Data Portals • MTA/IRB review • Path review • Molecular co-isolation • Analyte QC including RIN • Door to door tracking of all samples • Collection of clinical data • Distribution of all molecular and clinical data to DCC • Coordination of expert pathology committees • MSI, SNP, HPV testing • Shipments to centers • MTA • IRB • Contract • Specimen Collection • Pre screening & path review • Deliver • Pathology Reports • Images • Clinical Data • Follow up • Inclusion specifications • CRFs • Identification of Tissue Source Sites • Often more than one group for each of the 25+ tumor projects 6 GDACs

  33. Biospecimen Core Resource Review Whole Genome Amplification (Qiagen) BCR Characterization Centers Collection of Clinical Data Elements Tissue Source Sites (TSS) Preliminary Pathology Review Qualified • Tumor Pathology QC • % Tumor Nuclei • % Necrosis • Dx Confirmation via histology and pathology report Data Coordinating Center • Molecular Analyte QC • Spectrophotometery • RNA Bioanalyzer • Electrophoresis • Genotyping Sequencing Centers Diagram courtesy R. Penny, International Genomics Consortium

  34. TCGA: Platforms- Then and Now *- Not a core platform; Not all samples will have data file for this platform More information on platforms and data available at: http:/tcga-data.nci.nih.gov/tcga/tcgaPlatformDesign.jsp

  35. Minimum Clinical Data Set (100% required) • TSS name/identifier/patient #  • Primary site of disease • Histological type • Tumor grade • Gender • Date of birth (month/year)  • Prior diagnosis of prior neoplasm (if yes, another form required) • Date of initial pathologic diagnosis (month/year) • Neoadjuvant therapy • Tumor stage • Date of last contact (month/year) • Vital status (month/year) • Date of death, if applicable (month/year) • Tumor-type specific elements developed by Tumor Project Group

  36. Examples of Tumor Specific Questions • Smoking status (lung/bladder) • ER/PR/Her2+ (breast) • Biochemical recurrence (prostate) • Residual tumor after surgery (ovarian/sarcoma) • Prior malignancy details (melanoma) • International Prognostic Index (lymphoma)

  37. TCGA Data Portal https://tcga-data.nci.nih.gov/tcga/

More Related