Hi all,
I have a ChIPseq of a TF (narrow peaks), without replicates, which peaks were found with a good quality using an INPUT dna as control. QUESTION: Is there any statistical way to validate the final list of peaks in the downstream analysis, in order to filter those peaks which are called as statistically significant, but when looked by eye in a browser the enrichment they present seems to be like background?
Below I put an example of what I'm describing in a region of 76 Kb region:
In the first pair of tracks, one could appreciate the enrichment of the TF (red) versus the input (black) highlighted as blue boxes. But in the second pair of tracks, the peak caller mark just the small peak in the center of the blue box (marked also with another track in black as a little profile, just above the peak). The statistical parameters used to get the peaks are pvalue < 10e-5 and FDR < 1%.
I read in papers about validating peaks by PCR, or finding motifs in overall list of peaks but my concern is to find a method which would maybe re-filter the peaks so close to background and keep only visible and statistical significant peaks
I don't think computational validation can be done, ChIP-seq is a technique to assay genome wide protein occupancy and the only way to validate it is to use another tool to generate a dataset that also gives information on protein occupancy (such as ChIP-qPCR). IMO the question you are asking boils down to 'I ran several peak callers but some regions that it found are not believable by eye, is there any way to avoid them?' The typical answers would be to a) try another peak caller (if you want a very conservative one, you could try sole search) b) use different cutoffs and from your response you did something similar to b) where you made a special filter rule but the disadvantage of doing something like this is it might seem arbitrary and hard to justify. One thing you could try is overlap the regions found by the different peak callers and call that the 'concordant set' or something
Daniel, just a question - unfortunately not concerning the original question. How did you draw these kind of signal profiles? I think these are wig signal profiles but i dont know how to construct such figures without using e.g. UCSC browser export function. Best regards!