CAUSEL: an epigenome- and genome-editing pipeline for establishing function of noncoding GWAS variants.

Nat Med
Authors
Keywords
Abstract

The vast majority of disease-associated single-nucleotide polymorphisms (SNPs) mapped by genome-wide association studies (GWASs) are located in the non-protein-coding genome, but establishing the functional and mechanistic roles of these sequence variants has proven challenging. Here we describe a general pipeline in which candidate functional SNPs are first evaluated by fine mapping, epigenomic profiling, and epigenome editing, and then interrogated for causal function by using genome editing to create isogenic cell lines followed by phenotypic characterization. To validate this approach, we analyzed the 6q22.1 prostate cancer risk locus and identified rs339331 as the top-scoring SNP. Epigenome editing confirmed that the rs339331 region possessed regulatory potential. By using transcription activator-like effector nuclease (TALEN)-mediated genome editing, we created a panel of isogenic 22Rv1 prostate cancer cell lines representing all three genotypes (TT, TC, CC) at rs339331. Introduction of the 'T' risk allele increased transcription of the regulatory factor 6 (RFX6) gene, increased homeobox B13 (HOXB13) binding at the rs339331 region, and increased deposition of the enhancer-associated H3K4me2 histone mark at the rs339331 region compared to lines homozygous for the 'C' protective allele. The cell lines also differed in cellular morphology and adhesion, and pathway analysis of differentially expressed genes suggested an influence of androgens. In summary, we have developed and validated a widely accessible approach that can be used to establish functional causality for noncoding sequence variants identified by GWASs.

Year of Publication
2015
Journal
Nat Med
Volume
21
Issue
11
Pages
1357-63
Date Published
2015 Nov
ISSN
1546-170X
URL
DOI
10.1038/nm.3975
PubMed ID
26398868
PubMed Central ID
PMC4746056
Links
Grant list
DP1 GM105378 / GM / NIGMS NIH HHS / United States
1K99CA184415-01 / CA / NCI NIH HHS / United States
P50 CA090381 / CA / NCI NIH HHS / United States
R01 CA129435 / CA / NCI NIH HHS / United States
K99 CA184415 / CA / NCI NIH HHS / United States
R01 GM107427 / GM / NIGMS NIH HHS / United States
R01CA193910 / CA / NCI NIH HHS / United States
R01 CA193910 / CA / NCI NIH HHS / United States
U19CA148112 / CA / NCI NIH HHS / United States
U19 CA148537 / CA / NCI NIH HHS / United States
DP1 OD006862 / OD / NIH HHS / United States
DP1 GM105378 / DP / NCCDPHP CDC HHS / United States
U19 CA148112 / CA / NCI NIH HHS / United States
K08 DK002883 / DK / NIDDK NIH HHS / United States
U19CA148537 / CA / NCI NIH HHS / United States