I noticed that when I first read in my sample sheet, I get a DBA object with 1226 sites (consensus peaks), yet after running dba.count()
on this same object, it is down to 1188 sites. I did not apply any blacklists/greylists.
Also, the FRiP column is missing after I run dba.count()
. Does anyone have ideas on what could have gone wrong?
There are 5 biological replicates of wildtype and 6 of mutant, but only one sequencing run was done per biological replicate (so no technical replicates). Peaks were called with MACS2 --broad
. I made .bed
files from the first 6 columns of the .broadPeak
output.
> WS
11 Samples, 1226 sites in matrix (13233 total):
ID Condition Replicate Intervals
1 705 Mut 1 1284
2 709 WT 1 319
3 719 Mut 1 8909
4 720 Mut 1 1994
5 721 WT 1 6
6 722 WT 1 15
7 728 WT 1 49
8 729 Mut 1 38
9 739 Mut 1 0
10 744 WT 1 2542
11 748 Mut 1 1
> plot(WS)
> WS <- dba.count(WS)
Computing summits...
Re-centering peaks...
> WS
11 Samples, 1188 sites in matrix:
ID Condition Replicate Reads
1 705 Mut 1 18209602
2 709 WT 1 17207846
3 719 Mut 1 14440635
4 720 Mut 1 19380340
5 721 WT 1 14392339
6 722 WT 1 16246924
7 728 WT 1 18220300
8 729 Mut 1 16485420
9 739 Mut 1 15516176
10 744 WT 1 16693032
11 748 Mut 1 15745849
dba.count
will filter peaks who's "cpm < 1" (dba.count(filter = 1)
) by default, this may decreasing peak numbers.