Tutorial: guidelines for quality filtering of whole-exome and whole-genome sequencing data for population-scale association analyses.
Authors | |
Abstract | Genetic sequencing technologies are powerful tools for identifying rare variants and genes associated with Mendelian and complex traits; indeed, whole-exome and whole-genome sequencing are increasingly popular methods for population-scale genetic studies. However, careful quality control steps should be taken to ensure study accuracy and reproducibility, and sequencing data require extensive quality filtering to delineate true variants from technical artifacts. Although processing standards are harmonized across pipelines to call variants from sequencing reads, there currently exists no standardized pipeline for conducting quality filtering on variant-level datasets for the purpose of population-scale association analysis. In this Tutorial, we discuss key quality control parameters, provide guidelines for conducting quality filtering of samples and variants, and compare commonly used software programs for quality control of samples, variants and genotypes from sequencing data. As sequencing data continue to gain popularity in genetic research, establishing standardized quality control practices is crucial to ensure consistent, reliable and reproducible results across studies. |
Year of Publication | 2025
|
Journal | Nature protocols
|
Date Published | 03/2025
|
ISSN | 1750-2799
|
DOI | 10.1038/s41596-025-01169-1
|
PubMed ID | 40155705
|
Links |