As the UCSC website showed us, the ENCODE project used chip-seq to find regulatory elements of human genome. Different type of antibody can combine to special element on DNA chain. I downloaded TFBS file containing the narrowPeak file and got coordinate of some elements. However, I could not get some more important elements directly, such as promoter, insulator, enhancer, silencer and LCR. There are some wig format files on UCSC, but those files just give tracks of those elements like the list below: http://genome.ucsc.edu/ENCODE/downloads.html
Regulation Super-track
DNase Clusters Digital DNaseI Hypersensitivity Clusters from ENCODE
Layered H3K27Ac H3K27Ac Mark (Often Found Near Active Regulatory Elements) on 7 cell lines from NCODE
Layered H3K4Me1 H3K4Me1 Mark (Often Found Near Regulatory Elements) on 7 cell lines from ENCODE
Layered H3K4Me3 H3K4Me3 Mark (Often Found Near Promoters) on 7 cell lines from ENCODE
Transcription Transcription Levels Assayed by RNA-seq on 7 Cell Lines from ENCODE
Txn Factor ChIP Transcription Factor ChIP-seq from ENCODE
How can I get coordinate of above regulatory elements? Thanks for your response!
Hi Gjain Thanks for your response about this problem. I followed the first advice to get data. And after searching on the UCSC website, I find that there is a bigWigToBedGraph tool. Is it the one I can use to get the coordinates of bigwig files? But I did not know how to use the tool on my own pc. Wait for your response again. My email is lixianfengdyx@126.com.
Hi Xiaofeng, If you are using a windows computer then you should install cygwin (http://www.cygwin.com/) and then run the program bigWigToBedGraph using these instruction:
bedGraphToBigWig in.bedGraph chrom.sizes out.bw where in.bedGraph is a four column file in the format: [?] [?] [?] [?] and chrom.sizes is two column: [?] [?] and out.bw is the output indexed big wig file. The input bedGraph file must be sorted, use the unix sort command: sort -k1,1 -k2,2 unsorted.bedGraph > sorted.bedGraph options: -blockSize=N - Number of items to bundle in r-tree. Default 256 -itemsPerSlot=N - Number of data points bundled at lowest level. Default 1024 -unc - If set, do not use compression.
bedGraphToBigWig v 4 - Convert a bedGraph program to bigWig.
The bigwig file will give you the coordinates.
Let me know if you have any other question.
Hi Gjain, as you known, bigWigToBedGraph tool from the "utilities" section is a binary file. I don't know how to use that tool on Linux environment.
Hi Xizfeng,
Here is what you can do.
bigWigToBedGraph
and put it in the folder where your bigwig files are.$
typecd path_to_the_folder_containing_files_and_binary
./bigWigToBedGraph your_bigwig_file.bigWig output_bedGraph.bedgraph
This should convert it for you. Once you have the bedgraph file, you can the coordinates from that file.
Please let me know if you have any troubles.