Has anyone looked into the similarity of two samples of chip-seq like data using cross-correlation or auto-correlation? Basically I wish to get some sort of metric that describes how similar two wiggle files are on a genome-wide scale. Or inversely how much variance in one sample can be described by looking at the pattern of distribution in the other. If you have, what software or library did you use or alternatively what different approach have you used?
The way I understand it is that cross-correlation gets you the lag between signals; autocorrelation gets the periodicity within a single signal. Do you mean plain 'ol correlation, like on a bp-by-bp or window-by-window basis?
Good points. I was hoping that cross-correlation would not require there to be a periodicity and the signal itself would be enough for a measurement of similarity (I am still not clear on this). Regular correlation/regression might work I guess but I am concerned about the quantity of data for this approach. Perhaps a 100bp window per chromosome is the only way and avoid comparing two null or two background signals as per Jeremy's point below.