Question

how to process spike-in with bamCoverage and plotHeatmap (with deeptools)

2

Entering edit mode

8.3 years ago

alessandro.pastore ▴ 40

Hi I have a question, I have Drosophila genome spike-in in a chip-seq experiment. Does anybody have experience on the best way to normalised and process those samples to generate bw and count matrix? ( I am using deeptools)

I am following the instruction of the vendor:

map reads to the sample genome (human, mouse, etc.) map reads to the DM genome count unique DM sequence tags. Identify sample with least amount of tags Count DM tags from other samples. Create a normalization factor = lowest_sample/sample_of_interest

then I use the normalisation factor ${SPIKE} in deeptools as follow:

bamCoverage -b ${SAMPLE}/${SAMPLE}.nodup.norm.bam -o /scratch/${PBS_JOBID}/${SAMPLE}/${SAMPLE}.bw \ -p 1 --normalizeTo1x ${genomesize} \ --scaleFactor ${SPIKE} \ --ignoreForNormalization chrX chrY \ -bl ${ENCODEBED} \ --smoothLength 40 \ --binSize 10 \ --centerReads --extendReads 150

Since I am getting a modest increase, and the vendor results are a modest decrease of signal I am wondering if I am doing something wrong with bamCoverage --scaleFactor argument or if the discrepancy is due to the different processing of the bw between my pipeline (deeptools based) and the vendor one (custom made).

Thanks a lot for your support and advice

ChIP-Seq spike deeptools normalization • 6.8k views

ADD COMMENT • link updated 3.6 years ago by Friederike 9.0k • written 8.3 years ago by alessandro.pastore ▴ 40

0

Entering edit mode

I actually got a question back for you: what vendor did you obtain the Drosphilla genome spike-in. I am looking for it myself as well. thanks.

ADD REPLY • link 8.0 years ago by RogelioG • 0

0

Entering edit mode

Active-Motiv.

You can also try your self if you have access to S2 drosophila. Just crosslink the drosophila independently. Mix with your target cells at ratio of 1:10 ( we do 1:100). And then process to sonication and all downstream chip-Seq as usual.

Aligns and deduplicate the target genome and Drosophila genome.

Use the number of drosophila unique mapped reads (not mapping to target genome.) for normalization.

ADD REPLY • link 8.0 years ago by alessandro.pastore ▴ 40

score 3 · Accepted Answer · 2017-04-28

3

Entering edit mode

8.3 years ago

mmmmcandrew ▴ 200

Did the vendor instructions mention that they also normalized to 1x? That could be the problem...

It's possible that you should only be applying the scaleFactor alone and not also normalizing to 1x. If you apply only the scaleFactor (i.e. run that command without the --normalizeto1x argument), would you see a modest decrease of signal as the vendor does?

The deepTools google group is another great resource for questions like this one, and they may be able to help more.

ADD COMMENT • link 8.3 years ago by mmmmcandrew ▴ 200

2

Entering edit mode

Just to confirm for future readers: normalization should be turned off with deepTools if you only want the manually supplied scale factor to be used. I.e. --normalizeUsing None. Another informative and related thread is https://www.biostars.org/p/452905/

ADD REPLY • link 3.6 years ago by Friederike 9.0k

0

Entering edit mode

Yes! Sorry I missed your answer! But the 1x Nomralization was the difference between my and vendor settings.

ADD REPLY • link 8.0 years ago by alessandro.pastore ▴ 40