I intend to find common peaks ( in ChIP-Seq data, ) defined as peaks that exist in any pair of the 3 replicates I have. Of course there are 3 pairs, so peaks that overlap in 1 base in any pair of those would be considered valid.
If peak A and peak B overlap between two replicates, I will define the final "peak" as the union between the area that peak A spans and the area that peak B spans.
The issue is that I would like to have some way to rank the final peaks, at least to decide which peaks are "the best peaks".
Each final peak is a combination of at least two peaks; while the original peaks have meaningful p-values and q-values, I don't think their combination would be meaningful for the combined peak.
Can you suggest a meaningful way to rank the combined peaks?
I thought of something (perhaps it is very naive).
What I am interested is a crude measure of ranks, so that the (biologist) researcher can look at the "best" peaks, and see whether there are interesting genes among the top ranking peaks. Those best genes will be further validated (perhaps with ChIP-PCR on the original samples) to see whether the interesting results are true.
It seems to me that I do not need a well-defined statistical framework of high-ranking peaks. So, perhaps I will simply
I added a point 4) inspired by this.
Man, I've been facing this question so many times and I still don't have a satisfying answer.