I've been doing some work that involves characterizing potential chromatin states in a HCT116 cell model. I've successfully been able to run ChromHMM
to identify chromatin states using a variety of histone markers and then overlap them with other datasets in order to double-check their annotation.
However, I am having a problem understanding some of the output files that ChromHMM
automatically generates. Specifically _emissions.txt
and _*.bed
. I know there's a couple of people here that are really familiar with the software and could probably help me out.
I have already searched google, and read the ChromHMM manuscript, but neither provided answers.
Appreciate the information!
Do the emission values go directly on the png, or do they first have to be modified in some way? I have some values of 0.02 as an example, but a 6 in others. Would the 0.02 be treated as a 0?
Not sure if they're normalized in some way. To figure that out, you need to read the source code or try to reproduce the png given the txt file (and see what, if any, normalization needs to happen). Given the lack of a colormap though, my guess would be that each emissions.txt file is divided by the max of that file.
How to figure out the label of each state? I got the output of chromHMM,but can't find the annotation information of each state?Tanks
The label of each state is subjective. Coming up with good labels requires looking carefully at the enrichments (from running OverlapEnrichment) and emissions heatmaps to decide what you want to name them.