I am trying to analyze reprocessed consolidated roadmap epigenome data and I have downloaded the data regarding H3K4ME1 in Male fetal brain from:
http://egg2.wustl.edu/roadmap/data/byFileType/alignments/consolidated/E081-H3K4me1.tagAlign.gz
but looking at the line contents, its a bit confusing how to interpret the data since 1) the files have no header for each column, 2) column 3 is always just 'N' and 3) column 4 is always '1000' and 4) There are sometime overlap between coordinate of different lines.
here are first 5 lines:
chr1 10149 10185 N 1000 +
chr1 10153 10189 N 1000 +
chr1 10239 10275 N 1000 -
chr1 10314 10350 N 1000 -
chr1 13043 13079 N 1000 +
I searched a lot to figure out what each column represent and how the data should be interpreted but I could not find any good tutorial for that.
I appreciate if anybody could help with this .