QTLtools-fenrich(1)

Functional enrichment of molecular QTLs

Section 1 qtltools bookworm source

Description

QTLtools-fenrich

NAME

QTLtools fenrich - Functional enrichment of molecular QTLs

SYNOPSIS

QTLtools fenrich --qtl significanty_genes.bed --tss gene_tss.bed --bed TFs.encode.bed.gz --out output.txt [OPTIONS]

DESCRIPTION

This mode allows assessing whether a set of QTLs fall within some functional annotations more often than what is expected by chance. The method is detailed in <https://www.nature.com/articles/ncomms15452>. Here, we mean by chance is what is expected given the non-uniform distributions of molQTLs and functional annotations around the genomic positions of the molecular phenotypes. To do so, we first enumerate all the functional annotations located nearby a given molecular phenotype. In practice, for X phenotypes being quantified, we have X lists of annotations. And, for the subset Y of those having a significant molQTL, we count how often the Y molQTLs overlap the annotations in the corresponding lists: this gives the observed overlap frequency fobs(Y) between molQTLs and functional annotations. Then, we permute the lists of functional annotations across the phenotypes (e.g, phenotype A may be assigned the list of annotations coming from phenotype B) and for each permuted data set, we count how often the Y molQTLs do overlap the newly assigned functional annotations: this gives the expected overlap frequency fexp(Y) between molQTLs and functional annotations. By doing this permutation scheme, we keep the distribution of functional annotations and molQTLs around molecular phenotypes unchanged. Now that we have the observed and expected overlap frequencies, we use a fisher test to assess how fobs(Y) and fexp(Y) differ. This gives an odd ratio estimate and a two-sided p-value which basically tells us first if there is and enrichment or depletion, and second how significant this is.

OPTIONS

--qtl in.bed

List of QTLs of interest in BED format. REQUIRED.

--bed functional_annotation.bed.gz

Functional annotations in BED format. REQUIRED.

--tss genes.bed

List of positions of all phenotypes you mapped QTLs for, in BED format. REQUIRED.

--out output.txt

Output file. REQUIRED.

--permute integer

Number of permutation to run. DEFAULT=1000

INPUT FILES

--qtl file

List of QTLs of interest. An example:

1

15210

15211

1_15211

ENSG00000227232.4

-

1

735984

735985

1_735985

ENSG00000177757.1

+

1

735984

735985

1_735985

ENSG00000240453.1

-

1

739527

739528

1_739528

ENSG00000237491.4

+

The column definitions are:

Image grohtml-80937-1.png

--bed file

List of annotations in BED format. An example:

1

254874

265487

1

730984

735985

1

734984

736585

1

739527

748528

The column definitions are:

Image grohtml-80937-2.png

--tss file

List of positions of all phenotypes you mapped QTLs for. An example:

1

29369

29370

ENSG00000227232.4

1_15211

-

1

135894

135895

ENSG00000268903.1

1_985446

-

1

137964

137965

ENSG00000269981.1

1_1118728

-

1

317719

317720

ENSG00000237094.7

1_15211

+

The column definitions are:

Image grohtml-80937-3.png

OUTPUT FILE

--out file

Space separated results output file detailing the enrichment with the following columns:

Image grohtml-80937-4.png

EXAMPLE

1

You need to prepare a BED file containing the positions of the QTLs of interest. To do so, extract all significant hits at a given FDR threshold (e.g. 5%), and then transform the significant QTL list into a BED file:

Rscript ./script/qtltools_runFDR_cis.R results.genes.full.txt.gz 0.05 results.genes

cat results.genes.significant.txt | awk ’{ print $9, $10-1, $11, $8, $1, $5 }’ | tr ’ ’ ’\t’ | sort -k1,1V -k2,2g > results.genes.significant.bed

2

Prepare a BED file containing the positions of all phenotypes you mapped QTLs for:

zcat results.genes.full.txt.gz | awk ’{ print $2, $3-1, $4, $1, $8, $5 }’ | tr ’ ’ ’\t’ | sort -k1,1V -k2,2g > results.genes.quantified.bed

3

Run the enrichment analysis:

QTLtools fenrich --qtl results.genes.significant.bed --tss results.genes.quantified.bed --bed TFs.encode.bed.gz --out enrichment.QTL.in.TF.txt

SEE ALSO

QTLtools(1)

QTLtools website: <https://qtltools.github.io/qtltools>

BUGS

o

Please submit bugs to <https://github.com/qtltools/qtltools>

CITATION

Delaneau, O., Ongen, H., Brown, A. et al. A complete tool set for molecular QTL discovery and analysis. Nat Commun 8, 15452 (2017). <https://doi.org/10.1038/ncomms15452>

AUTHORS

Olivier Delaneau (olivier.delaneau@gmail.com), Halit Ongen (halitongen@gmail.com)