Main Content

Submit Data

Submitting Data to the GDC

Submitting data from your projects to the GDC is beneficial because the GDC:

  • Provides easy-to-use tools and guides supporting best practices in the submission of high-quality datasets. Tools validate data against GDC standard data types and file formats to enable cross study comparisons.
  • Implements data harmonization pipelines supporting DNA and RNA sequence harmonization against the latest reference genome (GRCh38)
  • Interfaces to NIH eRA Commons and the database of Genotypes and Phenotypes (dbGaP) for secure access to controlled data sets and submissions

Get Started: 

Documentation

Data Submission Tools

The GDC provides web-based tools for submitting small volumes of data as well as client tools for submitting high volumes of molecular data.

Data Submission Policy

Organizations interested in submitting data into the GDC must first apply for data submission through the NCBI database of Genotypes and Phenotypes (dbGaP).

Collaborating with GDC

The GDC encourages data sharing in support of precision medicine. Tools are provided to guide data submissions. For more information, please contact the GDC Help Desk:

Contact the GDC Help Desk

What’s New with GDC Data Submissions

The GDC provides announcements of new data submitted to GDC.

News and Announcements

From the GDC FAQ

Why do the metadata files I am trying to submit fail to validate?

The GDC Data Submission Portal checks XML, JSON, and TSV metadata files for validity at the time they are submitted. If your files fail to validate, please check the error report and review the GDC Data Dictionary for troubleshooting these errors. Additional information on supported files and formats can be found on the GDC Data Model and File Formats pages, and in the GDC Data Submission Portal User's Guide.

Need Assistance?

Need help with data retrieval, download, or submission?

Visit the GDC Support Page