Main Content

Submit Data

Submitting Data to the GDC

Submitting data from your projects to the GDC is beneficial because the GDC:

  • Provides easy-to-use tools and guides supporting best practices in the submission of high-quality datasets. Tools validate data against GDC standard data types and file formats to enable cross study comparisons.
  • Implements data harmonization pipelines supporting DNA and RNA sequence harmonization against the latest reference genome (GRCh38)
  • Interfaces to NIH eRA Commons and the database of Genotypes and Phenotypes (dbGaP) for secure access to controlled data sets and submissions

Get Started: 

Documentation

Data Submission Tools

The GDC provides web-based tools for submitting small volumes of data as well as client tools for submitting high volumes of molecular data.

Data Submission Policy

Organizations interested in submitting data into the GDC must first apply for data submission through the NCBI database of Genotypes and Phenotypes (dbGaP).

Collaborating with GDC

The GDC encourages data sharing in support of precision medicine. Tools are provided to guide data submissions. For more information, please contact the GDC Help Desk:

Contact the GDC Help Desk

What’s New with GDC Data Submissions

The GDC provides announcements of new data submitted to GDC.

News and Announcements

From the GDC FAQ

How many bytes are there in a megabyte or gigabyte?

There has been long standing debate about prefixes for multiples of bytes. We have chosen to utilize the standard supported by the International System of Units (SI) where 1 gigabyte (GB) = 109 bytes or 1 megabyte (MB) = 106 bytes. This convention is also supported by the IEEE, EU, NIST, and the International System of Quantities. Where appropriate, we utilize the IEEE 1541 recommendations for binary representation where 10243 bytes = 1 gibibyte (GiB) or 10242 bytes = 1 mebibyte (MiB).

Need Assistance?

Need help with data retrieval, download, or submission?

Visit the GDC Support Page