I have some ATAC-seq PE samples and Dnase-seq SE samples, I like to do peak calling for them using MACS2. I run MACS2 command with --nomodel -- shift -100 -- extsize 200 For DNase samples but for ATAc seq samples I am not sure which one of --nomodel -- shift -100 -- extsize 200 or only use -f BAMPE options of MACS2 is better?
Our previous recommendation was to run MACS2 with -f BAMPE, which is
similar to the default analysis mode of Genrich (inferring full
fragments, rather than cut site intervals). Others have attempted to
interpret cut site intervals with MACS2 by using the --shift and
--extsize arguments, but these arguments are ignored in BAMPE mode. They do work in the default (BAM) mode, but then, with paired-end
reads, most of the alignments are automatically discarded (half of the
properly paired alignments and all of the unpaired alignments;
secondary alignments are never considered). Is it worse to interpret
full fragments that may be less informative biologically, or to
disregard more than half of the sequence data? A complicated question.
The correct answer is: use Genrich.
The main issue with --nomodel -- shift -100 -- extsize 200 -f BAM is that you lose a lot of reads (approx 50%). As the MACS2 manual states specifically that for the -f BAM parameter:
If the BAM file is generated for paired-end data, MACS will only keep
the left mate(5' end) tag."
So, as I understood, If I use BAM file it is better to use -f BAMPE but If I change BAM file to BED file I can use both -f BEDPE and --nomodel -- shift -100 -- extsize 200 -f BED. Am I right?
I think so. When I use --nomodel -- shift -100 -- extsize 200 -f BED I have 1.5 times more peaks compared to --nomodel -- shift -100 -- extsize 200 -f BAM.
Hi- My 2p, I just started using Genrich on ATAC-seq with replicates and my first impression is actually very good when comparing it to MACS2 (I may post a better explanation at some point...).
Please try using the search function as the answers to your question have been presented in the past: ATAC-seq peak calling with MACS