The variant call format and VCFtools.

Bioinformatics
Authors
Keywords
Abstract

SUMMARY: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

AVAILABILITY:

Year of Publication
2011
Journal
Bioinformatics
Volume
27
Issue
15
Pages
2156-8
Date Published
2011 Aug 1
ISSN
1367-4811
DOI
10.1093/bioinformatics/btr330
PubMed ID
21653522
PubMed Central ID
PMC3137218
Links
Grant list
075491/Z/04 / Wellcome Trust / United Kingdom
086084 / Wellcome Trust / United Kingdom
090532 / Wellcome Trust / United Kingdom
54 HG003067 / HG / NHGRI NIH HHS / United States
R01 HG004719 / HG / NHGRI NIH HHS / United States
RG/09/012/28096 / British Heart Foundation / United Kingdom
RG/09/012/28096 / Wellcome Trust / United Kingdom
U01 HG005208 / HG / NHGRI NIH HHS / United States
Intramural NIH HHS / United States