Submit Data

Submitting Data to the GDC

Submitting data from your projects to the GDC is beneficial because the GDC:

  • Provides easy-to-use tools and guides supporting best practices in the submission of high-quality datasets. Tools validate data against GDC standard data types and file formats to enable cross study comparisons.
  • Implements data harmonization pipelines supporting DNA and RNA sequence harmonization against the latest reference genome (GRCh38)
  • Interfaces to NIH eRA Commons and the database of Genotypes and Phenotypes (dbGaP) for secure access to controlled data sets and submissions
Illustration: Submit Data User Friendly Submission Tools Validated Data Types and Formats Extensive Support

Data Submission Tools

The GDC provides web-based tools for submitting small volumes of data as well as client tools for submitting high volumes of molecular data.

Data Submission Policy

Organizations interested in submitting data into the GDC must first apply for data submission through the NCBI database of Genotypes and Phenotypes (dbGaP).

Collaborating with GDC

The GDC encourages data sharing in support of precision medicine. Tools are provided to guide data submissions. For more information, please contact the GDC Help Desk:

Contact the GDC Help Desk

What’s New with GDC Data Submissions

The GDC provides announcements of new data submitted to GDC.

News and Announcements

From the GDC FAQ

Does the GDC Data Transfer Tool use random or sequential read/write? Does the choice of protocol make a difference?

The GDC Data Transfer Tool uses sequential read/write for each file segment that is being transferred. By default, the tool executes multipart transfers, which results in multiple parallel, sequential read or write operations. To turn off multipart transfers, users can set the number of processes to 1.

Need Assistance?

Need help with data retrieval, download, or submission?