Do specialized Hi-C databases exist?
5
5
Entering edit mode
9.3 years ago
svet.sidorov ▴ 110

Dear colleagues,

Does somebody know, if there are specialized databases for Hi-C raw data (paired-end reads)? Of course, there are many raw Hi-C datasets on GEO, but I failed to find any specialized Hi-C database on the Web.

database Hi-C • 6.6k views
ADD COMMENT
0
Entering edit mode

Are you only interested in Hi-C or also interested in other chromosomal looping datasets such as 4C/5C and ChIA-PET

ADD REPLY
0
Entering edit mode

I'm interested predominantly in Hi-C, but other 'C' datasets maybe also interesting.

ADD REPLY
3
Entering edit mode
9.3 years ago
Ryan Dale 5.0k

I wouldn't expect anyone to duplicate storage of raw reads beyond what SRA and GEO do. But as far as aggregating multiple experiments, the WashU Epigenome Browser has long-range chromatin interaction experiment tracks available. Currently there are 73 tracks available for human, 27 each for mouse and fly.

ADD COMMENT
0
Entering edit mode

Ryan, thank you for the information!

ADD REPLY
6
Entering edit mode
7.9 years ago
Fidel ★ 2.0k

We have been working on an initiative to host and visualize Hi-C data. We download the data from SRA and processes it with our pipeline to generate downloadable matrix files (stored on hdf5 files). So far we have some data for drosophila, mouse and human available.

The website is:

http://chorogenome.ie-freiburg.mpg.de

Feedback and support is welcome.

A screenshot for human GM12878 cell lines looks like this:

enter image description here

ADD COMMENT
1
Entering edit mode

Fidel, this is amazing. A much-needed and much-appreciated initiative.

ADD REPLY
3
Entering edit mode
8.3 years ago

There is a LOGIQA (Long-range genome interactions quality assessment) database. However it hosts only quality scores (assessed over Hi-C, 4C, ChIA-PET, Capture-C, etc.). Nevertheless it is very convenient for searching data you look for.

ADD COMMENT
0
Entering edit mode

The above mentioned link is not working.

http://ngs-qc.org/logiqa/index.php

ADD REPLY
0
Entering edit mode
9.1 years ago
Bryan Lajoie ▴ 20

I don't believe there are any specialized Hi-C databases on the web...

(perhaps that will change with ENCODE3 + 4D Nucleome efforts)

Though Ryan is correct - I doubt anyone would duplicate what is already stored on GEO/SRA. Most paper submissions (GEO) include not only the fastq files but also some form of the interaction data. The data will either be in a tsv (3 column) format, tsv (matrix) format, or hdf5.

I think GEO will be your best bet for the time being.

ADD COMMENT
0
Entering edit mode
7.6 years ago
encoder • 0

This website might be helpful to search Hi-C datasets available: http://promoter.bx.psu.edu/hi-c/view.php

ADD COMMENT

Login before adding your answer.

Traffic: 1210 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6