Why does DiffBind use the count of reads that fall within peaks?
2
0
Entering edit mode
2.7 years ago
Aspire ▴ 370

DiffBind starts with peaks (per sample), and then counts the reads that fall within those peaks. Differential expression of the counts of the reads is calculated using DESeq2 / edgeR.

What is the reason for the move from peaks to reads? What benefit does it yield that would not be possible otherwise?

DiffBind • 751 views
ADD COMMENT
2
Entering edit mode
2.7 years ago
Rory Stark ★ 2.1k

Modelling the distribution of reads across all replicates in the sample groups (regardless of whether a peak was identified in any specific sample replicate) enables a robust quantitative assessment of the evidence for differential binding, including calculation of useful statistics such as p-value, FDR, and fold change (using the underlying statistical analysis packages).

The DiffBind vignette has a section that compares the results of an occupancy analysis using only peak calls to an affinity analysis that models read counts.

ADD COMMENT
3
Entering edit mode
2.7 years ago

It allows you to quantitate differences in instances where a peak is called in both groups, but the magnitude of said peak is not the same. So rather than a binary question of "is the peak shared between groups", the question is "does the signal under the peak differ between groups?", which is often informative.

ADD COMMENT

Login before adding your answer.

Traffic: 3086 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6