Hello everyone,
I would like to know if there are tools that feeded with ChIP-seq data for various histone marks, identify the enhancers by the "right" methylation/acetylation combinations. Until now, I found CSI-ANN, but I was wondering if there are other tools.
Thanks
I'll supplement Ryan's excellent answer with more information on the chromatin signature of Enhancers.
Generally as peeyushashu stated, Enhancers are denoted by a high amount of H3K4me1. This seems to be the single agreed on feature in published papers.
Outside of that you have varying 'classes' of enhancers. The three most common are those termed Active, Primed, and Latent Enhancers. Active enhancers are typically DNase hypersensitive sites, with H3K27Ac methylation, Primed enhancers are marked with H3K4me1, low levels of H3K27Ac and are DNase hypersensitive, while latent enhancers show low or absent H3K4me1, H3K27Ac and are often not DNase hypersensitive which means that chromatin at that region is closed.
I'd also like to emphasize that Enhancers do not have to be absent of H3K4me3. There's literature that has shown that Enhancers marked with H3K4me3 are considered to be active.
There are other enhancer classes that are denoted by H3K27me3, H2A.Z, and a few other markers I can't think off the top of my head, so make sure you read around.
Yes I have been reading aroung, actually I am looking for a tool because I know that there are multiple signatures that identify enhancers, so I am afraid of missing out some combination while overlapping the different chromatin marks myself. I thought a tool intended for this would check for all the combinations (given I provide the data for different histone marks of course). Thanks for the overview anyway, much appreciated.
Thanks so much Ryan, finding the search terms is always the most difficult to me. I will definitely try ChromHMM.
It's useful to keep in mind that ChromHMM does not annotate the different enhancer combinations for you. You'll have to do the annotation yourself.
Yep -- see specifically A: ChromHMM annotation new cell line
I read something about ChromHMM, but I have a doubt. Why would I use it and "learn" from my data, instead of just looking at the overlap of different histon marks? Especially in the case when I am not searching for something "new", I just want to identify the active enhancers in my specific cell line. Does it make sense to complicate things with machine learning in this case?
The problem is that there is not a good definition of "enhancer". Do you use "overlap of H3K4me1 and H3K4me3"? Or maybe "high ratio of H3K4me1 to H3K4me3". Maybe require DNase hypersensitivity. And maybe require some threshold of acetylation. Ideally you would also require evidence of long-range interaction with at least one promoter. There are many differing definitions in the literature along these lines. Another option is to use chromatin segmentation tools which find patterns in your data that you can then interpret as "enhancer-like". An advantage to this is that you can find some interesting results, like different classes of enhancers that you otherwise might not have found. And you can always use the ad hoc enhancer definitions to look for enrichment in chromatin segmentations.