Hello guys
I have an MNase-seq bam file (~400 million reads) that I'm trying to split into two different fragment sizes
1) 1-80 fragments (for TF-sized nucleosomes)
2) 135 -165 fragments (for mononucleosomes)
To do that I aligned the fastq files with Bowtie2 using the parameters "-I 1 -X 80" and "-I 135 -X 165" and ended up with a 368011030 mapped-read bam file and a 368011030 mapped-read bam file respectively (YES! The same amount of reads)
Later I used the bamCoverage
command (--MNase
) from Deeptools to normalise the 2 files and subsequently plot Coverage over TSS
The bam file containing the mononucleosomes was properly normalised and Coverage was plotted. On the other hand, the bam containing the TF-sized nucleosomes (~80 bp fragments) returned an error during normalisation
Error: The generated bedGraphFile was empty. Please adjust
your deepTools settings and check your input files.
Is the above approach I used for splitting files appropriate? The fact that I got the exact same amount of reads after alignment is bugging me
Any help would be much appreciated. This is the first time I'm analysing MNase-Seq data and I'm completely lost
Thanks
Just out of curiosity, what is a TF-sized nucleosome ? Do you mean MNase protection provided by transcription factors binding ?
Yeap that's it. Sorry, hope it didn't sound confusing the way I addressed it
No not at all, I'm just not familiar with the concept but it's interesting :)
Also just curious what you're basing the 80bp on, like any particular papers/studies? Just wanted to take a look, as Im trying to also track down good boundaries to set. Thanks!
This paper is a great example.