The GDC provides access to multiple contributed datasets, including data from The Cancer Genome Atlas (TCGA), a landmark cancer genomics program that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types.
Throughout TCGA data, there are TCGA Codes and TCGA Study Abbreviations that assist users in understanding the data. TCGA Codes provide information on data batches, types, and levels, as well as sample and center level information. TCGA Barcodes in particular are the primary identifiers of TCGA biospecimen data. TCGA Study Abbreviations identify individual TCGA studies which are typically organized by cancer type.
TCGA data also includes annotations that contain important information about TCGA cases and samples needed for complete and accurate analysis and interpretation. See TCGA Annotations Overview for a description of each annotation.
Additional information is also available in TCGA FAQs.
I have been working with the TCGA, thanks for this wonderful resource!