How to select top significant ChiPseq peaks from MACS2 output file
2
1
Entering edit mode
8.1 years ago
Mike ★ 1.9k

Hi all,

I have Macs2 output results in xls format , there are 10 columns in results files (including -log10(pvalue), fold_enrichment & -log10(qvalue), How to select top significant peaks from this file.

Thanks in advance

ChIP-Seq macs2 peaks • 7.0k views
ADD COMMENT
4
Entering edit mode
8.1 years ago
igor 13k

Since p- and q- values are measures of significance, you can sort by those values. Since they are -log10, the most significant are the one with the highest values.

ADD COMMENT
0
Entering edit mode

Great, Thanks a lot,

And what about fold_enrichment ?

ADD REPLY
3
Entering edit mode
8.1 years ago

In practice I would suggest to look at some peaks with different fold changes to get a feel for what a sensible threshold should be for a peak to be convincing. Often I find that peaks with fold change ~2 don't look "peaky" at all so I discard anything below that threshold. Then for ranking peaks, it shouldn't matter very much if you use fold change, p-value or qvalue since these are very well correlated.

Also, take care that peaks with very high fold change (or very low p/q value) can be suspicious.

ADD COMMENT
0
Entering edit mode

True. You can also use something like IDR to refine your peak list.

But it really depends on what your goal is. I've seen papers where they select the top 500,000 peaks. Clearly, at those numbers, many of them are not going to be very good at all.

ADD REPLY

Login before adding your answer.

Traffic: 1525 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6