GDC Data Transfer Tool

GDC Data Submission Client Tool icon The GDC provides a standard client-based mechanism in support of high-performance data downloads and submission.

The raw sequence files, typically stored as BAM or FASTQ, make up the bulk of data. The size for a single file can vary greatly depending on the specific analysis; However, some of the whole genome BAM files in The Cancer Genome Atlas (TCGA) reach sizes of 200-300 GB. In such cases, a high-performance data download and submission tool is essential.

Below are basic instructions and links for downloading the GDC Data Transfer Tool client-based interface for data downloads and submission and user interface (beta version) for data downloads. For additional instructions, please visit the GDC Data Transfer Tool User's Guide.

Downloading the GDC Data Transfer Tool Client

Description

The GDC Data Transfer Tool Client provides a command-line interface supporting both GDC data downloads and submissions.

System Recommendations

The system recommendations for using the GDC Data Transfer Tool Client are as follows:

  • OS: Linux (Ubuntu 14.x or later, CentOS 7), OS X (10.9 Mavericks or later), or Windows (7 or later)
  • CPU: Eight 64-bit cores, Intel or AMD
  • RAM: 8 GiB or more
  • Storage: Enterprise-class storage system capable of ≥1 Gb/s (Gigabit per second) write throughput and sufficient free space for BAM files, most of which are in the 50 MB - 40 GB size range, with some reaching sizes of 200-300 GB.

Binary Distributions

Links to the binary distributions for supported platforms are provided below.

If you are a user of RedHat Enterprise Release 6 and wish to use the Data Transfer Tool Client, contact the GDC Help Desk for assistance.

Source Code

Access GitHub Repository

Release Notes

Release Notes are available on the GDC Data Transfer Tool Client Release Notes page.

Support

Please visit the GDC Help Desk

Downloading the GDC Data Transfer Tool User Interface (Beta)

Description

The GDC Data Transfer Tool User Interface provides a user-friendly interface to the GDC Data Transfer Tool Client for downloading data from the GDC. The GDC Data Transfer Tool User Interface is a Beta version only that is undergoing testing by the research community.

System Recommendations

The system recommendations for using the GDC Data Transfer Tool User Interface are as follows:

  • OS: Linux (Ubuntu 14.x or later), OS X (10.9 Mavericks or later), or Windows (7 or later)
  • CPU: Eight 64-bit cores, Intel or AMD
  • RAM: 8 GiB or more
  • Storage: Enterprise-class storage system capable of ≥1 Gb/s (Gigabit per second) write throughput and sufficient free space for BAM files, most of which are in the 50 MB - 40 GB size range, with some reaching sizes of 200-300 GB.

Binary Distributions

Links to the binary distributions for supported platforms are provided below.

Source Code

Access GitHub Repository

Release Notes

Release Notes are available on the GDC Data Transfer Tool UI Release Notes page.

Support

Please visit the GDC Help Desk