Hello
I am analyzing bulk ATAC-seq data using ENCODE atac-seq analysis pipeline, then DiffBind.
I have 5 treatments (will be stored in "Condition" column in DiffBind sample sheet) and 3 biological replicates ("Factor" column).
Q1) When I call dba.contrast(), am I correct in using the design as follows? From this previous post, it looks like the categories option should NOT be used together with the design parameter. Is this correct?
dba.contrast(dbObj, design="~Factor+Condition")
Q2) When I was using DESeq2 with bulk RNA-seq, I used similar design formula as above and extracted pairwise results of interest using the result() function. Is the process similar for DiffBind where I run dba.count(), dba.normalize(), dba.contrast(), dba.analyze(), and I can extract any pairwise comparisons between the 5 treatments later?
Q3) If anyone has experience feeding ENCODE atac-seq pipeline results into DiffBind, is this the correct narrowpeak file to use? My understanding is that I should use peak files that have NOT gone through IDR or inter-sample overlapping.
trim.merged.srt.nodup.no_chrM_MT.tn5.pval0.01.300K.bfilt.narrowPeak.gz
Thank you in advance for your inputs!
Thank you for taking the time to come back and post a solution to provide closure.