Question

How Reads Are Distributed Over My Targeted Regions

0

Entering edit mode

13.3 years ago

Vikas Bansal ★ 2.4k

Hello everyone,

We have sequenced some targeted regions and mapped reads (36bp) to whole reference genome (hg18). I have a bed file which have coordinates of captured regions. I want to see (plot), what is the distribution of reads mapped to my captured regions relative to 100bp up and 100bp down to captured regions. Example - I have 10,000 regions captured but I mapped reads to whole genome. All captured regions have different length and I want to calculate relative value for all the regions so that I can make a nice plot. If I have a region, say,

chr1    153     400

Now I want to see, how many reads are mapped to 153-100 i:e from 53 to 153 and 400+100 i:e from 400 to 500 and of course from 153 to 400. So I have 3 read depth values. I want to do it for all captured regions. I can calculate this for all regions using bedtools coverageBed but in the end, I have to plot the relative values to show how reads are distributed from 100 bp upstream of captured regions, then captured regions and then 100bp downstream of captured regions.

Any suggestions? How would you do it?

Thanks and Best regards,

Vikas

read plot • 3.7k views

ADD COMMENT • link updated 13.3 years ago by Jorge Amigo 14k • written 13.3 years ago by Vikas Bansal ★ 2.4k

1

Entering edit mode

Do you want to generate a plot summarizing all regions, or do you want to generate one plot per region of interest?

One way to generate nice per-region graph would be to take your SAM/BAM files, convert them to wiggle files, then visualize them as a custom track in the UCSC genome browser.

ADD REPLY • link updated 13.3 years ago by Istvan Albert 102k • written 13.3 years ago by Eric Fournier ★ 1.4k

0

Entering edit mode

Thanks for your reply. I want to generate a plot summarizing all regions. I think, I have to apply some statistics to generate relative values overall.

ADD REPLY • link 13.3 years ago by Vikas Bansal ★ 2.4k

score 1 · Answer 1 · 2012-04-10

1

Entering edit mode

13.3 years ago

Jorge Amigo 14k

since you are interested in those 3 region sets (upstream, proper region, downstream), I would generate 2 or 3 different bed files containing such regions (2 in case you want to merge upstream and downstream regions, 3 if not), then get the coverages through bedtools' coverageBed, and summarize those numbers "manually" at the end.