About the GDC

The Genomic Data Commons (GDC) is a research program of the National Cancer Institute (NCI). The mission of the GDC is to provide the cancer research community with a unified data repository that enables data sharing across cancer genomic studies in support of precision medicine.

The National Cancer Institute, part of the National Institutes of Health (NIH), is the federal government's principal agency for cancer research and training. NCI’s mission is to lead, conduct, and support cancer research across the nation to advance scientific knowledge and help all people to live longer, healthier lives. NCI’s scope of work spans a broad spectrum of cancer research across a variety of disciplines and supports research training opportunities at career stages across the academic continuum.

GDC Promoting Precision Medicine in Oncology

About the GDC: Promoting Precision Medicine in Oncology
June 1, 2016

The National Cancer Institute’s (NCI’s) Genomic Data Commons (GDC) is a data sharing platform that promotes precision medicine in oncology. It is not just a database or a tool; it is an expandable knowledge network supporting the import and standardization of genomic and clinical data from cancer research programs.

The GDC contains NCI-generated data from some of the largest and most comprehensive cancer genomic datasets, including The Cancer Genome Atlas (TCGA) and Therapeutically Applicable Research to Generate Effective Therapies (TARGET). For the first time, these datasets have been harmonized using a common set of bioinformatics pipelines, so that the data can be directly compared.

As a growing knowledge system for cancer, the GDC also enables researchers to submit data, and harmonizes these data for import into the GDC. As more researchers add clinical and genomic data to the GDC, it will become an even more powerful tool for making discoveries about the molecular basis of cancer that may lead to better care for patients.

The GDC provides the Research Community with the Following Benefits

  • Access to high-quality standardized biospecimen, clinical, and molecular data
  • Web-based tools supporting fine-grained queries, advanced visualization, smart search technologies, and personalized download facilities
  • Data harmonization pipelines supporting DNA and RNA sequence harmonization against a common reference genome
  • Programmatic interfaces supporting data retrieval, download, and submission by third party applications
  • Resources supporting the high performance retrieval, download, and submission of GDC data
  • Data submission tools for validating and submitting data into GDC
  • Data generation pipelines supporting the high level data generation of DNA sequence variants, mutation analyses, SNP chip genotypes, and expression analyses
  • Interfaces to eRA Commons and dbGaP for secure access to controlled data sets

GDC Involvement with Cancer Research

Learn more about how the GDC supports the translation of  biomedical research into clinical medicine:

Find cancer research highlights and publications:

GDC Team

The GDC is managed by a project team comprised of scientists and specialists from NCI, contracting organizations, and members of several focused external advisory groups.

From the GDC FAQ

What is the NCI Genomic Data Commons (GDC)?

The NCI Genomic Data Commons (GDC) is the next generation cancer knowledge network supporting the import and standardization of genomic and clinical data from cancer research programs (e.g. TCGATARGETCGCI), the harmonization of sequence data to the genome / transcriptome, and the application of state-of-the art methods for derived data (e.g. mutation calls, structural variants, etc.). The NCI part of the National Institutes of Health (NIH) and the U.S. Department of Health and Human Services (DHHS) established the GDC to provide the cancer research community with a data service supporting the receipt, quality control, integration, storage, and redistribution of standardized cancer genomic data sets derived from cancer studies. Please visit About the GDC for additional information.

Stay Informed

The latest news about the Genomic Data Commons (GDC):