Avant-garde: an automated data-driven DIA data curation tool.
Authors | |
Abstract | Several challenges remain in data-independent acquisition (DIA) data analysis, such as to confidently identify peptides, define integration boundaries, remove interferences, and control false discovery rates. In practice, a visual inspection of the signals is still required, which is impractical with large datasets. We present Avant-garde as a tool to refine DIA (and parallel reaction monitoring) data. Avant-garde uses a novel data-driven scoring strategy: signals are refined by learning from the dataset itself, using all measurements in all samples to achieve the best optimization. We evaluate the performance of Avant-garde using benchmark DIA datasets and show that it can determine the quantitative suitability of a peptide peak, and reach the same levels of selectivity, accuracy, and reproducibility as manual validation. Avant-garde is complementary to existing DIA analysis engines and aims to establish a strong foundation for subsequent analysis of quantitative mass spectrometry data. |
Year of Publication | 2020
|
Journal | Nat Methods
|
Volume | 17
|
Issue | 12
|
Pages | 1237-1244
|
Date Published | 2020 Dec
|
ISSN | 1548-7105
|
DOI | 10.1038/s41592-020-00986-4
|
PubMed ID | 33199889
|
PubMed Central ID | PMC7723322
|
Links | |
Grant list | U24-CA210979 / U.S. Department of Health & Human Services | NIH | NCI | Division of Cancer Epidemiology and Genetics, National Cancer Institute (National Cancer Institute Division of Cancer Epidemiology and Genetics)
U24-CA210986 / U.S. Department of Health & Human Services | NIH | NCI | Division of Cancer Epidemiology and Genetics, National Cancer Institute (National Cancer Institute Division of Cancer Epidemiology and Genetics)
U24 CA210986 / CA / NCI NIH HHS / United States
U54 HG008097 / HG / NHGRI NIH HHS / United States
U24 CA210979 / CA / NCI NIH HHS / United States
U01 CA214125 / CA / NCI NIH HHS / United States
|