Question

Adjusting Peak Calling For Broad Enrichment Of Histone Modifications.

8

Entering edit mode

14.6 years ago

Dave Gerrard ▴ 190

We are working on some ChIP-seq data targeting histone modifications which may be widely dispersed across the genome. We expect the enrichment to be much less peaky and more 'foothills vs. plains'. Does anyone have any tips or tricks for getting the most out of peak calling software in this situation or any alternative strategies? We are using Macs14 and QuEST but are open to trying other methods.

chip-seq peak-calling • 7.2k views

ADD COMMENT • link updated 14.6 years ago by Mikael Huss 4.8k • written 14.6 years ago by Dave Gerrard ▴ 190

Ram · Answer 1 · 2011-01-07

7

Entering edit mode

14.6 years ago

Alastair Kerr 5.3k

We have looked at this type of data in the following papers:

We ended up writing our own set of perl scripts and changed variables based on the type of data examined (full details and rationales in the methods section of these papers)

Our 'peak finder' defines a peak as an area where values are above the height threshold, less than the specified gap apart and the whole area falls within the length thresholds. It outputs the peaks as a Bed file with statistics. I'll put our peak finding script here but please be warned that is was not intended for release and it is still largely undocumented. If there is a need, we can tidy it up and put it on the galaxy toolshed.

ADD COMMENT • link updated 5.9 years ago by Ram 45k • written 14.6 years ago by Alastair Kerr 5.3k

1

Entering edit mode

Can you please update the link to your perl script!

ADD REPLY • link 8.5 years ago by Deepak Tanwar ★ 4.2k

0

Entering edit mode

I removed this script as there now exists a range of decent pack callers.. See this wikipedia page

ADD REPLY • link 8.5 years ago by Alastair Kerr 5.3k

score 2 · Answer 2 · 2011-01-07

Nice discussion topic -- I've also struggled with large histone modification peaks. One approach which we had some success with is using MACS 1.4 with the call-subpeaks option to subdivide the larger peaks using PeakSplitter:

http://www.ebi.ac.uk/bertone/software.html

We then overlapped these with nucleosome position calls from NPS:

http://liulab.dfci.harvard.edu/NPS/

This was helpful to get a more refined set of reference nucleosome regions that could then be used for comparisons between experiments. Here's the python code used to combine the NPS and MACS calls:

https://github.com/chapmanb/mgh_projects/blob/master/cy_histone_chipseq/merge_nps_macs.py

score 2 · Answer 3 · 2011-01-07

2

Entering edit mode

14.6 years ago

Mikael Huss 4.8k

I've used SICER and CCAT with reasonable success for this kind of problem. The former is specifically designed for diffuse enrichment regions (but not TFs) and the latter has a "peak mode" for TFs and similar cases and a "region mode" for cases where you expect a more spread-out enrichment.

ADD COMMENT • link 14.6 years ago by Mikael Huss 4.8k