Question

Is strand important when it comes to DNAse and ChIP-Seq peak calling?

1

Entering edit mode

9.2 years ago

ashah90 ▴ 30

Hello everyone,

I have recently been using data from the Epigenomics Roadmap Project and am a little confused. I have been downloading the data from http://egg2.wustl.edu/roadmap/web_portal/processed_data.html#ChipSeq_DNaseSeq under section "c. Peak Calling" for NarrowPeak data. The data will have the following format:

chrom    chromStart    chromEnd    rank    score    strand    signal_value    pvalue    qvalue    peak
chr20    30298908    30300550    Rank_1    423    .    11.54602    42.39392    34.03034    1197

As you can see the "strand" column is left blank. What I am trying to do is check whether peaks in different cell lines and histone marks map to regulatory regions within the genome. I often will use the 2000 bp and 500 bp downstream criteria when defining a promoter region, but taking strand into account and with the way UCSC defines their txstart and txend fields, for negative strand genes, I have to go 2000 bp downstream the TSS rather than upstream. This brings up the question as to whether these peaks are being mapped on the positive or the negative strand, or does it not matter? Are the coordinates such that it is the positive strand orientation?

Thanks you for your help

gene dnase peak-calling ChIP-Seq genome • 4.1k views

ADD COMMENT • link updated 2.2 years ago by Ram 44k • written 9.2 years ago by ashah90 ▴ 30

Ram · Answer 1 · 2015-08-28

The concept of strand on a gene comes from transcription. You could have a gene transcribed from the positive strand or negative strand of DNA. You can tell from the mRNA which strand the gene was transcribed from. For DNAse-seq and ChIP-seq you are assaying a position in the genome (thus no strand). This assay is determines where the DNA is less compact or where a protein is binding to DNA (no concept of +/- strand)

score 2 · Answer 2 · 2015-09-04

2

Entering edit mode

9.2 years ago

Ming Tommy Tang ★ 4.5k

There is no strandness for ChIP-seq and Dnase-seq peaks. When counting reads in the peaks, both + reads and - reads will be counted.

ADD COMMENT • link 9.2 years ago by Ming Tommy Tang ★ 4.5k

0

Entering edit mode

If DNAse-seq peaks are called by MACS, is there any standard threshold for annotating genes as accessible or not accessible (based on the peak tag density score)?

ADD REPLY • link 8.6 years ago by Bioinformatist Newbie ▴ 270

0

Entering edit mode

There is no standard threshold. You need to set it as long as it makes biological sense.

ADD REPLY • link 8.6 years ago by Ming Tommy Tang ★ 4.5k

0

Entering edit mode

Can you point me out towards a publication mentioning the reason behind the threshold they selected?

ADD REPLY • link 8.6 years ago by Bioinformatist Newbie ▴ 270