Entering edit mode
13.0 years ago
Paul
▴
40
I was wondering if there was software to generate histograms of a set of input bedgraph files so that we could look at the output and compare the signals of the tracks. Our goal is to see if these values are comparable; that the data sets are of a similarity quality.
Thank you. It turns out your solution is technically correct, unfortunately we are now running into the problem that our data sets are extremely large. The signal file itself is 2.5G which takes an unreasonable amount of RAM in R.
Ah, you know, I was wondering about that... You could sample the file somehow ahead of time if you're only interested in the general distribution of scores...