Question

call differential peaks H3K27ac

0

Entering edit mode

8.1 years ago

sophialovechan ▴ 80

Hi guys, I have H3K27ac chip-seq data from two different samples and attempted to call differential peaks in these two dataset. I have finished the analysis by MACS2 but when I looked back to the peaks by IGV, there were a lot of significantly differential peaks that have been missed. I checked all scripts step by step but did not find anything wrong. The only issue that I am concerned is the actual effective depths. The actual effective depth that I got from different samples have three times differences: one is 10 million, the other is 29 million. Do you guys think that is the reason that I have this trouble? And how can I fix it? Thank you.

ChIP-Seq • 4.7k views

ADD COMMENT • link updated 8.1 years ago by Ian 6.1k • written 8.1 years ago by sophialovechan ▴ 80

score 2 · Answer 1 · 2016-11-14

2

Entering edit mode

8.1 years ago

Ian 6.1k

I would suggest using specific tools for this as they handle differences in coverage.

diffReps moves a window of sequence along the genome and calculates differential binding. It has the advantage of not requiring peak calling results (which is arbitrary based on the chosen cut offs), and does not require duplicates (although use of duplicates can give more statistically rigorous results).

DiffBind requires regions from peak calling, and requires duplicates. However, in my experience it can give more specific/stronger results when multiple peaks are in close vicinity to each other.

ADD COMMENT • link 8.1 years ago by Ian 6.1k

0

Entering edit mode

Unfortunately, diffReps is unavailable in our computer resource center. I have to use MACS2 to call differential peaks. I got the results but there was a problem bothering me much. The final result missed some important peaks. For example, gene A is a marker gene for cell type A. I have seen the peaks (fold enrichment: 3-4; the peak length 200-1200) in the peak file from cell type A after broad peak calling by MACS2, which were absent in cell type B. However, after call differential binding events for cell type A and B, those peaks were not in the final results. I have tried used different parameters to call differential binding events, like -g 60 or 100 or 200. But the problem was still unsolved. Any ideas?

ADD REPLY • link 8.1 years ago by sophialovechan ▴ 80

0

Entering edit mode

1) What is the exact command that you used?

2) How did you calculate the effective depths (--d1 and --d2 values for macs2 bdgdiff)?

ADD REPLY • link 8.1 years ago by harold.smith.tarheel ★ 5.0k

0

Entering edit mode

macs2 bdgdiff --t1 A_treat_pileup.bdg --c1 A_control_lambda.bdg --t2 B_treat_pileup.bdg --c2 B_control_lambda.bdg --d1 17119467 --d2 17119467 -g 60 -l 120 --o-prefix diff_A_vs_B

I used the command from the tutorial of MACS2 to get the effective depths (egrep). There was some problems with the input data from one sample so I used the same control for both samples.

ADD REPLY • link 8.1 years ago by sophialovechan ▴ 80

0

Entering edit mode

You stated in the original message that one sample had a read depth of 10M, so how can the effective depth be ~17M?

ADD REPLY • link 8.1 years ago by harold.smith.tarheel ★ 5.0k

0

Entering edit mode

I changed the control file (input), that's why the effective depth has changed.

ADD REPLY • link 8.1 years ago by sophialovechan ▴ 80

0

Entering edit mode

Effective depth ≠ control depth. From the tutorial:

the effective sequencing depth is the smaller number of treatment and control.

And it's unclear how changing the control data set would change the effective depth of both treatment data sets to 17M (from 10M and 29M in the OP).

ADD REPLY • link 8.1 years ago by harold.smith.tarheel ★ 5.0k

0

Entering edit mode

$ egrep "tags after filtering in treatment|tags after filtering in control" cond2_peaks.xls

here is the script that I used to evaluate the effective depth. I am sure that the peak calling is right because all important peaks were there. I just got problems with differential peaks calling. I have checked other unregulated genes and they have corresponding peaks. I don't know whether I need to stick on this one gene, which is very important gene though.

ADD REPLY • link 8.1 years ago by sophialovechan ▴ 80

0

Entering edit mode

You need to determine the effective depth for both conditions, and specify the appropriate depth for each when running bdgdiff.

ADD REPLY • link 8.1 years ago by harold.smith.tarheel ★ 5.0k

0

Entering edit mode

Before running bdgDiff did you use the --SPMR setting as a basic normalisation method.

ADD REPLY • link 8.1 years ago by Ian 6.1k

0

Entering edit mode

From the tutorial:

--SPMR is not compatible with bdgdiff, so avoid using it.

ADD REPLY • link 8.1 years ago by harold.smith.tarheel ★ 5.0k

0

Entering edit mode

No. I used MACS2 call peaks (-broad) as my peak calling method. What will that make difference?

ADD REPLY • link 8.1 years ago by sophialovechan ▴ 80

0

Entering edit mode

Try using the exact commands described in this tutorial.

ADD REPLY • link 8.1 years ago by harold.smith.tarheel ★ 5.0k

score 0 · Answer 2 · 2016-11-09

0

Entering edit mode

8.1 years ago

Bioradical ▴ 60

You can normalize your samples for sequencing depth using Deeptools bamCoverage and choosing the appropriate options. You can find differentially expressed peaks using the command line tool diffReps.

ADD COMMENT • link 8.1 years ago by Bioradical ▴ 60