An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics.

Cell
Authors
Keywords
Abstract

For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale.

Year of Publication
2018
Journal
Cell
Volume
173
Issue
2
Pages
400-416.e11
Date Published
2018 04 05
ISSN
1097-4172
DOI
10.1016/j.cell.2018.02.052
PubMed ID
29625055
PubMed Central ID
PMC6066282
Links
Grant list
U24 CA143882 / CA / NCI NIH HHS / United States
U24 CA143866 / CA / NCI NIH HHS / United States
P30 CA016086 / CA / NCI NIH HHS / United States
U54 HG003273 / HG / NHGRI NIH HHS / United States
U24 CA144025 / CA / NCI NIH HHS / United States
U24 CA143840 / CA / NCI NIH HHS / United States
U24 CA143843 / CA / NCI NIH HHS / United States
U24 CA143858 / CA / NCI NIH HHS / United States
U24 CA143848 / CA / NCI NIH HHS / United States
U24 CA210949 / CA / NCI NIH HHS / United States
R01 CA163722 / CA / NCI NIH HHS / United States
U24 CA143867 / CA / NCI NIH HHS / United States
U24 CA210990 / CA / NCI NIH HHS / United States
P30 ES010126 / ES / NIEHS NIH HHS / United States
P30 CA016672 / CA / NCI NIH HHS / United States
U54 HG003067 / HG / NHGRI NIH HHS / United States
U24 CA143835 / CA / NCI NIH HHS / United States
U24 CA210950 / CA / NCI NIH HHS / United States
U24 CA143845 / CA / NCI NIH HHS / United States
U24 CA143799 / CA / NCI NIH HHS / United States
U24 CA210957 / CA / NCI NIH HHS / United States
U54 HG003079 / HG / NHGRI NIH HHS / United States
U24 CA210988 / CA / NCI NIH HHS / United States
U24 CA143883 / CA / NCI NIH HHS / United States