Raw sequence files, typically stored as BAM or FASTQ, make up the bulk of data in the GDC. The size for a single file can vary greatly depending on the specific analysis; However, some of the whole genome BAM files in The Cancer Genome Atlas (TCGA) reach sizes of 200-300 GB. In such cases, a high-performance data download and submission tool, such as the GDC Data Transfer Tool, is essential.
The GDC Data Transfer Tool provides an optimized method of transferring data to and from the GDC and enables resumption of interrupted transfers. The GDC Data Transfer Tool consists of:
Below are basic instructions and links for downloading the GDC Data Transfer Tool Client and UI. For additional instructions, please visit the GDC Data Transfer Tool User's Guide.
Platform | GDC Data Transfer Tool Client | GDC Data Transfer Tool UI |
---|---|---|
MAC OSX x64 | ![]() |
![]() |
Windows x64 | ![]() |
![]() |
Ubuntu x64 | ![]() |
![]() |
If you are a user of RedHat Enterprise Release 6 and wish to use the Data Transfer Tool Client, please contact the GDC Help Desk for assistance. For building the binaries from the source code, please refer to the GDC DTT Client GitHub Repository.
NIH… Turning Discovery Into Health ®