Main Content

GDC Data Portal

Data Portal

The GDC Data Portal is a web-based platform that allows users to search, analyze, and download data from cancer genomic studies. 

GDC Data Portal Key Features

Create Custom Cohorts

The GDC Data Portal centers around the idea of building cohorts, or groups of cases, before analyzing or downloading data. With the Cohort Builder, users filter and select cases according to clinical, biospecimen, and other available data elements to create custom cohorts for analysis.

Analyze Custom Cohorts

Users can find a variety of interative analysis tools centrally located with the Analysis Center. Mutation frequency, gene expression clustering, clinical data analysis, and a variety of other gene- and variant-level analysis tools are available.

Browse Harmonized Data

The Repository is where data files associated with each case in the current cohort can be browsed and downloaded. It also offers file filters for identifying files of interest.

View Project Information

Data in the GDC is organized by projects. Typically, projects are comprised of a particular type of cancer and are undertaken as part of a larger cancer research program. The GDC Data Portal allows users to access aggregate project-level information via the Projects tool and Project Summary Pages.

 

Create Personalized Cart

As users navigate throughout the Repository, they may add files to their personalized cart. The cart provides detailed statistics about the added files including total volume, associated projects, and access level.

Download Data Quickly and Securely

The GDC Data Portal provides two primary channels for downloading data:

When downloading data via the GDC Data Transfer Tool, the GDC Data Portal generates a file manifest that can be imported into the GDC Data Transfer Tool to initiate the download. Note that certain files require users to obtain controlled access authorization, which is managed by the NIH database of Genotypes and Phenotypes (dbGaP). Please visit Obtaining Access to GDC Data and Resources for information about how to obtain permission to access controlled data sets through dbGaP.

View Slide Images

The GDC Data Portal provides an image viewing option allowing researchers to view, zoom, and pan tissue slide images associated with a case. Slide images can also be downloaded in the original format (SVS) and are accessible via the GDC API.

Detailed instructions for the above features are available in the GDC Data Portal User's Guide. Information on GDC Data Portal Releases is available in the GDC Data Portal Release Notes.

Accessing the GDC Data Portal

Supported Internet Browsers

The GDC Data Portal is compatible with most modern web browsers, including:

  • Most recent supported stable version of Microsoft Edge on Microsoft Windows
  • Most recent stable version of Google Chrome
  • Most recent stable version of Mozilla Firefox

Release Notes

Release Notes are available on the GDC Data Portal Release Notes page.