-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathREADME
36 lines (29 loc) · 2.07 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
Q is a fast saturation-based ChIP-seq peak caller.
Q works well in conjunction with the irreproducible discovery rate (IDR) procedure.
Q was extensively tested on publicly available datasets from ENCODE
and shown to perform well with respect to reproducibility of the called peak set,
consistency of the peak sets with respect to predicted transcription factor binding motifs contained in them,
as well as overall run time.
Q is implemented in C++ making use of the Boost and SeqAn library.
There are a number of useful features for the primary analysis of ChIP-seq data.
Q can be run with or without data from a control experiment.
Duplicate reads are removed on the fly without altering the original BAM file,
and the number of duplicated reads is then shown in Q's output.
The average fragment length of the sequencing library,
which is an essential parameter for peak calling and for downstream analysis,
is estimated automatically from the data.
This is done by examing the vector of read start positions along individual chromosomes
and calcuting the shift that is associated with the smallest Hamming distance.
This procedure yields an equivalent estimation of the average fragment length
as the cross-correlation plot of SPP but is approximately three times faster.
As a part of this procedure, Q also calculates the relative strand cross-correlation coefficient (RSC),
which allows a global quality assessment of the enrichment.
In addition Q offers its own quality metrics, which can be used for trouble-shooting and quality control of the results.
If desired, Q also generates fragment coverage profiles which can be uploaded to UCSC's genome browser,
where they can be displayed in the context of other related data for example ChIP-seq data for histone modifications
and cofactors or expression data.
Please see http://charite.github.io/Q/ for more information.
Q uses the SeqAn library revision 14083 (http://www.seqan.de/).
All SeqAn files necessary for compilation are included within
the distribution of Q (src/seqan).
Q uses the Boost library (tested version 1.49) (http://www.boost.org/).