3 August 2020; 10.1038/s41588-020-0673-7
Cervical cancer is the most common cancer affecting sub-Saharan African women. Persistent human papillomavirus (HPV) infection is necessary but not sufficient for the development of cervical cancer. Cervical cancer is considered to be an AIDS-defining malignancy as it is 5 times more likely to develop in HIV-positive (HIV+) women than in those that are HIV-negative (HIV-). The Human Tumor Molecular Characterization Project performed comprehensive profiling of cancer genomes, transcriptomes, or epigenomes to fill the knowledge gap in this population. We characterized 118 tumors from East-African patients, of which 72 were HIV+, and performed extended mutation analysis on an additional 89 cases, 16 of which were HIV+. Of 12 significantly mutated genes in our cohort, PIK3CA was the most recurrent as reported in other studies. Some genes which were mutated in other cohorts were not in ours. Analysis of copy number landscapes revealed that broad copy number alterations were comparable between HIV+ and HIV- samples. We detected HPV clade-specific differences in tumor DNA methylation, promoter- and enhancer-associated histone marks, gene expression and pathway dysregulation. Histone modification changes at HPV integration events were correlated with upregulation of nearby genes and endogenous retroviruses.
To probe the impact of viral gene expression on tumor gene expression, we performed unsupervised clustering of viral E1,E2, E6, and E7 transcripts which had annotations associated with them in GenBank (download date: December 2019). A list of these references can be accessed via the “CGCI HTMCP-CC HPV Transcript References (December 2019)” link in the “Supplemental Data” section below.
This study characterized CGCI cases as described below:
118 Discovery Cases | 89 Validation Cases | 5 Excluded Cases* |
---|---|---|
118 cases - mRNA-seq | 89 cases - Targeted Capture Sequencing | 5 cases - mRNA-seq |
113 cases - miRNA-seq | 5 cases - miRNA-seq | |
115 cases - Methylation Array | 5 cases - Methylation Array | |
52 cases - ChIP-Seq | 5 cases - WGS | |
118 cases - WGS |
* 5 cases were excluded from the study because it was determined that they did not have cervical cancer, but instead had uterine or endometrial cancer. Data related to these excluded cases was still submitted as part of this publication.
In order to access controlled CGCI HTMCP-CC data, users must submit an application via dbGaP. To begin the application process, please view the information provided on the dbGaP Authorized Access Login Page under “dbGaP Data Download”.
Supplemental Data
- GDC Manifests
- Controlled-Access Data Download Manifest (5 Files)
- Open-Access Data Download Manifest (7 Files)
- CGCI HTMCP-CC mRNA-Seq
- CGCI HTMCP-CC mRNA-Seq Level 3 Data
- CGCI HTMCP-CC mRNA-Seq Level 3 Data (Controlled)
- CGCI HTMCP-CC miRNA-Seq
- CGCI HTMCP-CC ChIP-Seq
- CGCI HTMCP-CC ChIP-Seq Level 2 Data (Controlled)
- CGCI HTMCP-CC ChIP-Seq Level 3 Data (Controlled)
- CGCI HTMCP-CC ChIP-Seq Metadata
- CGCI HTMCP-CC Methylation Array
- CGCI HTMCP-CC DNA-Seq
- CGCI HTMCP-CC WGS Level 3 Data (Controlled)
- CGCI HTMCP-CC Targeted Capture Sequencing Level 3 Data (Controlled)
- CGCI HTMCP-CC Targeted Capture Sequencing Metadata
- CGCI HTMCP-CC Sample Matrix
- CGCI HTMCP-CC Transcript References
Additional Resources
- Office of Cancer Genomics (link is external)
- OCG CGCI Data Matrix (link is external)
Instructions for Data Download
Open Access Data
- Download the appropriate manifest file from the publication page
- Use the manifest file to download data using the GDC Data Transfer Tool (DTT) or the GDC API
- GDC DTT ( Download, User's Guide)
- GDC API ( User’s Guide)
Controlled Access Data
- Download the appropriate manifest file from the publication page
- Download a token from the GDC Data Portal
- GDC Data Portal ( Launch, User’s Guide)
- Use the manifest file and token to download data using the GDC DTT or the GDC API
- GDC DTT ( Download, User’s Guide)
- GDC API ( User’s Guide)
For assistance, please contact the GDC Help Desk: support@nci-gdc.datacommons.io.