Hi all!
Does anyone know how to download all chip-seq data(in BED format) of a certain cell type(let's say GM12878) in ENCODE.
I tried search via matrix but can't select specific cell type for download. It can only select a series of cell type(Like I can select all immortalized cell lines, which include GM12878). I only want to download the data related to GM12878.
Any hints?
Thanks a lot for your time and help! : )
I know it is possible that I download several of files using the way you mentioned, but it will be inconvenient if I want to download a number of data
1) Check the Table icon and download option from the search result where they have instructions for batch/bulk download.
https://www.encodeproject.org/search/?type=experiment&replicates.library.biosample.uuid=d8ca0867-13cd-40df-9de0-29f9da53d935&status!=deleted&status!=revoked&status!=replaced&limit=all
2) I hope the above option works. If not this will be your next option,
https://www.encodeproject.org/files/ENCFF002COO/@@download/ENCFF002COO.bed.gz
https://www.encodeproject.org/files/ENCFF002CPY/@@download/ENCFF002CPY.bed.gz
Since each bed file (for different antibody) for this cell type placed under different project id, it will be difficult to reterive even using globbing function with unix. The only option is you can parse the highlighted ids from array to wget using simple bash. Also for that you have to collect all 52 pairs (or less if you choose filter for only chipseq) of ids from the search result manually.
3) Check the ftp site and use globbing using wildcard with bash script if your mentioned cell type is available,
ftp://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/
Thanks a lot!!! That really help me out. : )
Please upvote if the solution worked.