The raw sequence files, typically stored as BAM or FASTQ, make up the bulk of data. The size for a single file can vary greatly depending on the specific analysis; However, some of the whole genome BAM files in The Cancer Genome Atlas (TCGA) reach sizes of 200-300 GB. In such cases, a high-performance data download and submission tool is essential.
Below are basic instructions and links for downloading the GDC Data Transfer Tool client-based interface for data downloads and submission and user interface (beta version) for data downloads. For additional instructions, please visit the GDC Data Transfer Tool User's Guide.
The GDC Data Transfer Tool Client provides a command-line interface supporting both GDC data downloads and submissions.
The system recommendations for using the GDC Data Transfer Tool Client are as follows:
Links to the binary distributions for supported platforms are provided below.
Latest Version of Data Transfer Tool
If you are a user of RedHat Enterprise Release 6 and wish to use the Data Transfer Tool Client, contact the GDC Help Desk for assistance.
Release Notes are available on the GDC Data Transfer Tool Client Release Notes page.
Please visit the GDC Help Desk
The GDC Data Transfer Tool User Interface provides a user-friendly interface to the GDC Data Transfer Tool Client for downloading data from the GDC. The GDC Data Transfer Tool User Interface is a Beta version only that is undergoing testing by the research community.
The system recommendations for using the GDC Data Transfer Tool User Interface are as follows:
Links to the binary distributions for supported platforms are provided below.
Release Notes are available on the GDC Data Transfer Tool UI Release Notes page.
Please visit the GDC Help Desk
NIH… Turning Discovery Into Health ®