Nature. Volume 543: p378-384.
Cervical cancer remains one of the leading causes of cancer-related deaths worldwide. Here we report the extensive molecular characterization of 228 primary cervical cancers, the largest comprehensive genomic study of cervical cancer to date. We observed striking APOBEC mutagenesis patterns and identified SHKBP1, ERBB3, CASP8, HLA-A, and TGFBR2 as novel significantly mutated genes in cervical cancer. We also discovered novel amplifications in immune targets CD274/PD-L1 and PDCD1LG2/PD-L2, and the BCAR4 lncRNA that has been associated with response to lapatinib. HPV integration was observed in all HPV18-related cases and 76% of HPV16-related cases, and was associated with structural aberrations and increased target gene expression. We identified a unique set of endometrial-like cervical cancers, comprised predominantly of HPV-negative tumors with high frequencies of KRAS, ARID1A, and PTEN mutations. Integrative clustering of 178 samples identified Keratin-low Squamous, Keratin-high Squamous, and Adenocarcinoma-rich subgroups. These molecular analyses reveal new potential therapeutic targets for cervical cancers.
Data in the GDC
- GDC Manifests
- Open-Access Data - Download Manifest (25 Files)
- Controlled-Access Data - Download Manifest (6 Files) | WGS Files (10 Files)
Supplemental Data
- APOBEC Mutagenesis Analysis - APOBEC_output.zip
- Supplementary RPPA data: a cleaned-up version used for CESC analysis - RPPA_Data_Processed_155_Samples_4-10-15.xlsx
- Participant List, Sample List, BAM File List, and Full Listing of TCGA Archives for Data Freeze
- Participant List - CESC_case_freeze.txt
- Sample List - CESC_sample_freeze.txt
- Data Freeze List - CESC_data_freeze.tsv
- BAM File Freeze List - TCGA_CESC_GRCh37_Exome_BAMs_UUID.tsv
- Extended Case List: Used in analysis that have RPPA and/or mutation data - SupplementalTable2-CESC_Extended_Set_Data_Summary_Table.xlsx
- Clinical
- XML Files Containing Biospecimen Processing and Clinical Data - nationwidechildrens.org_CESC.bio.Level_1.tar.gz
- Mutations
- Publication MAF file - genome.wustl.edu_CESC.IlluminaGA_DNASeq_curated.Level_2.1.0.0.somatic.maf
- Reverse Phase Protein Array (RPPA) Expression
- Level 3 Data Archives - CESC.MDA_RPPA_Core.Level_3.tar.gz
- Level 2 Data Archives - CESC.MDA_RPPA_Core.Level_2.tar.gz
- Level 1 Data Archives - CESC.MDA_RPPA_Core.Level_1.tar.gz
- MAGE-TAB Data Archives - CESC.MDA_RPPA_Core.mage-tab.tar.gz
- RNA Expression from IlluminaHiseq RNASeq
- Level 3 Data Archives - unc.edu_CESC.IlluminaHiSeq_RNASeqV2.Level_3.1.11.0.tar.gz
- MAGE-TAB Data Archives - unc.edu_CESC.IlluminaHiSeq_RNASeqV2.mage-tab.1.12.0.tar.gz
- SNP and Copy Number variation from Affymetrix SNP6
- Level 3 Data Archives - CESC.Genome_Wide_SNP_6.Level_3.tar
- Level 2 Data Archives - CESC.Genome_Wide_SNP_6.Level_2.tar
- Level 1 Data Archives - CESC.Genome_Wide_SNP_6.Level_1.tar
- MAGE-TAB Data Archives - CESC.Genome_Wide_SNP_6.mage-tab.1.2016.0.tar.gz
- IlluminaHiSeq Low Pass Whole Genome Sequencing
- Level 3 Data Archives - CESC.LowPass_CN.Level_3.tar.gz
- MAGE-TAB Data Archives - CESC.LowPass_CN_mage-tab.tar
- Structural Variants - lowpassWGS.50.fusions.txt
- Purity and Ploidy Estimates - purity_ploidy_CESC.txt
- Copy Number Variation Summary - cnv_summary_conf_99_CESC_Core_List.zip
- Somatic Single nucleotide variation in mtDNA - CESC.20140618.somatic.maf.tab
- miRNA from Illumina HiSeq 2000
- Level 3 Data Archives - CESC.IlluminaHiSeq_miRNASeq.Level_3.tar.gz
- MAGE-TAB Data Archives - CESC.IlluminaHiSeq_miRNASeq.mage-tab.tar.gz
- Methylation from Illumina Infinium Human Methylation450
- Level 3 Data Archives - jhu-usc.edu_CESC.HumanMethylation450.Level_3.tar.gz
- Level 2 Data Archives - jhu-usc.edu_CESC.HumanMethylation450.Level_2.tar.gz
- Level 1 Data Archives - jhu-usc.edu_CESC.HumanMethylation450.Level_1.tar.gz
- MAGE-TAB Data Archives - jhu-usc.edu_CESC.HumanMethylation450.mage-tab.tar.gz
Additional Resources
- Broad Institute FireBrowse Portal (link is external) The Broad Institute
- cBioPortal for Cancer Genomics (link is external) Memorial Sloan-Kettering Cancer Center
- TCGA Batch Effects (link is external) MD Anderson Cancer Center
- Regulome Explorer (link is external) Institute for Systems Biology
- Next-Generation Clustered Heat Maps (link is external) MD Anderson Cancer Center
Instructions for Data Download
Open Access Data
- Download the appropriate manifest file from the publication page
- Use the manifest file to download data using the GDC Data Transfer Tool (DTT) or the GDC API
- GDC DTT (Download, User's Guide)
- GDC API (User’s Guide)
Controlled Access Data
- Download the appropriate manifest file from the publication page
- Download a token from the GDC Data Portal
- GDC Data Portal (Launch, User’s Guide)
- Use the manifest file and token to download data using the GDC DTT or the GDC API
- GDC DTT (Download, User’s Guide)
- GDC API (User’s Guide)
For assistance, please contact the GDC Help Desk: support@nci-gdc.datacommons.io.