An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer.

Mol Cell Proteomics
Authors
Keywords
Abstract

Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis.

Year of Publication
2016
Journal
Mol Cell Proteomics
Volume
15
Issue
3
Pages
1060-71
Date Published
2016 Mar
ISSN
1535-9484
URL
DOI
10.1074/mcp.M115.056226
PubMed ID
26631509
PubMed Central ID
PMC4813688
Links
Grant list
U24 CA160034 / CA / NCI NIH HHS / United States
U24 CA159988 / CA / NCI NIH HHS / United States
U24CA159988 / CA / NCI NIH HHS / United States
P50 CA058223 / CA / NCI NIH HHS / United States
U24CA160036 / CA / NCI NIH HHS / United States
U24 CA160036 / CA / NCI NIH HHS / United States
P30 CA091842 / CA / NCI NIH HHS / United States
U24 CA160019 / CA / NCI NIH HHS / United States
U24 CA160035 / CA / NCI NIH HHS / United States
U24CA160034 / CA / NCI NIH HHS / United States
U24CA160035 / CA / NCI NIH HHS / United States
U24CA160019 / CA / NCI NIH HHS / United States
T32 GM088118 / GM / NIGMS NIH HHS / United States