Cell. Volume 173 Issue 2: p400-416.e11, 5 April 2018 10.1016/j.cell.2018.02.052
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of >11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear consistent with cancer genomics studies independent of the TCGA effort, and provide new opportunities for investigating cancer biology using clinical correlates at an unprecedented scale.
Data in the GDC
- GDC Manifests
- Open-Access Data - Download Manifest (1 File)
Supplemental Data
- Miscellaneous Files
- TCGA-CDR - TCGA-CDR-SupplementalTableS1.xlsx
Additional Resources
- Broad Institute FireCloud (link is external) The Broad Institute
- cBioPortal for Cancer Genomics (link is external) Memorial Sloan-Kettering Cancer Center
- PanCanAtlas Additional Files
Instructions for Data Download
Open Access Data
- Download the appropriate manifest file from the publication page
- Use the manifest file to download data using the GDC Data Transfer Tool (DTT) or the GDC API
- GDC DTT ( Download, User's Guide)
- GDC API ( User’s Guide)
Controlled Access Data
- Download the appropriate manifest file from the publication page
- Download a token from the GDC Data Portal
- GDC Data Portal ( Launch, User’s Guide)
- Use the manifest file and token to download data using the GDC DTT or the GDC API
- GDC DTT ( Download, User’s Guide)
- GDC API ( User’s Guide)
For assistance, please contact the GDC Help Desk: support@nci-gdc.datacommons.io.