Hi all
What is the widely accepted method of combining biological replicate data from ATAC-seq (or other high-throughput sequencing) to create a single file per treatment/condition for signal track visualization?
The following are possible strategies I have seen:
- Concatenate alignment (bam) files for each replicate into a single bam file, then create bedgraph and bigwig files
- Create bedgraph/bigwig files separately for each replicate, then "combine" them into a single bigwig using wiggletools mean function
Because replicates will have different read depths, with the first method, replicates with more reads may get over-represented? My understanding is that when bedgraph/bigwig files are created, signals are normalized. Is calculating the mean of normalized signal (the second method) better?
Thanks for your inputs!
And the scaling factors are totalNumberOfReadsFile1 / totalNumberOfReadsFile2 ? We were considering adding a "scalingFactor" statement to bigWig display, so you wouldn't have to scale the bigWigs....