Hello
I have been using DiffBind to perform differential enrichment analysis on my ChIP-seq dataset where I have 2 sample groups, WT and KO, with 4 replicates in each sample group.
The analysis reports only 2 significantly differential enriched peaks at FDR < 0.05, and the situation does not change even if, hypothetically, I choose as threshold FDR < 0.99. Basically, all the remaining peaks in the consensus peakset report FDR=1.
I used a consensus peakset generated by myself, basically merging all peaks called (by MACS2, FDR<0.05) for WT and KO against input. This way, I could create a peakset including all significant peaks among all samples. However, peak calling has been performed on merged samples, where all WTs have been merged, and the same for the KO samples.
When I performed some exploratory analysis, such as PCA, I noticed some variation between replicates of each sample group, but nothing too dramatic.
I am still unsure if my decision to feed into the analysis my own consensus peakset, generated as explained above, is correct.
In the Genome Browser, I can clearly notice how some loci show different enrichment for this mark, but I am looking to find a quick way to identify loci that significantly differ in enrichment between WT and KO. Also in the peak calling results, from merged samples, there is a clear difference in the number of called peaks between the two sample groups. But anyway, in DiffBind I do not really get any significant differentially enriched peaks since almost all of them have FDR=1.
Any comment or suggestion would be highly appreciated, either about my approach with DiffBind or recommending other tools.
Thanks in advance!
Did you find the reason? I meet a similar problem and I got 0 significantly differential enriched peaks at FDR < 0.05, nothing change even with a threshold FDR<0.5. And my result of plotMA() is just a flat line with no point upper or lowwer than log2fc0.
Ri and Marco, did you solve the problem? If yes, How?
I am facing the sample issue excatly the same as Ri described. In addition the p-values looks reasonable (a range from <0.05 to >0.05), however, they all have FDR=1.
I am using
DiffBind 3.10.0
.Thanks!
Hmmn, this has now been reported three times, which is concerning.
If someone can send me their DBA object after analysis (or a link to where I can download it) I can have a look at what is happening.
Dear Dr. Strack,
Sorry for my late reply! I have just email the DiffBind outputs to you. I will post what we come up here.
Thank you very much for helping! :)
Hi, Has this been figured out? I have duplicate CHIPseq data, which have obvious difference in IGV and significant FDR, logFC by usual edgeR or DESEQ analysis using consensus peak sets.
However, using diffbind,
shows clear difference but
shows just a flat line at 0 (all the logFC difference seems to be normalized to 0)
I guess this is a normalization problem.
Any solution for this? I'm using DiffBind 3.2.
Thanks,