Question

chip-seq data for histone, tag count

1

Entering edit mode

9.9 years ago

tonja.r ▴ 600

Average ChIP-Seq tag counts were calculated in windows of 50 bp for a region of 5 kb up- and down-stream of the orientated transcription start sites (TSS). Tag counts were normalized globally, as a fold increase over the genome average tag count in a window of 50 bp for the following modifications:...

I am trying to reproduce it. So, they took a window of 50bp, calculated the number of reads falling into these region and then devided by 50. But they still plot the coverage per base not per window, right? So, they mapped the average chip-seq counts calculated in 50 bp region back to each base, didn't they? I am also not quite sure how they normalized the counts.

image from paper

ChIP-Seq • 4.6k views

ADD COMMENT • link updated 2.9 years ago by Ram 45k • written 9.9 years ago by tonja.r ▴ 600

Ram · Answer 1 · 2015-09-21

2

Entering edit mode

9.9 years ago

Devon Ryan 105k

No, they took regions of 50bp, counted the number of reads mapping there and continued with that number. We'll call this result (A). They then additionally calculated this number in regions not around the TSS and took the average of that, which we'll call (B). What is then plotted is the mean(Ai/B), where i denotes all of the bins/windows some given distance from the TSS (e.g., -5kb away, or +200bp).

Edit: Apparently I can't use subscripts. Imagine "i" as a subscript if that helps.

ADD COMMENT • link 9.9 years ago by Devon Ryan 105k

0

Entering edit mode

If i is a number of bins/windows and size of each bin/window is 50bp then -5kb away from TSS will have only 100 windows. The first window will start 5kb away from TSS and end 4950 away from TSS. Next window will start at 4950 from TSS away and end at 4900 etc. If they have calculated coverage for 100 windows, how could they plot the coverage for each base pair (-5kb away)?

ADD REPLY • link updated 2.9 years ago by Ram 45k • written 9.9 years ago by tonja.r ▴ 600

0

Entering edit mode

Ai is all windows 4950 bases away from the TSS (to choose a random distance). What's plotted at that position is the average of the ratio (or maybe the ratio of the average value, they probably specify that somewhere).