Forum:Most important International Big Data Projects: Genetics and Epigenetics
2
6
Entering edit mode
6.3 years ago
Shicheng Guo ★ 9.5k

Here, I summarized the most important international bigdata project which need to be followed. All of them have large number of genetics and epigeneitcs data to be downloaded and to be used for next model building and knowledge achievement.

International Human Epigenome Consortium (IHEC)

NIH Roadmap Epigenomics Mapping consortium

The Canadian Epigenetics, Environment and Health research Consortium (CEEHRC)

CREST / IHEC - Team Japan

The ENCODE project

The German Epigenome Program - DEEP

International Cancer Genome Consortium

EpiGeneSys - Network of Excellence

Blueprint Epigenome Project

International cancer Genome Consortium Project

Pan Cancer Analysis of Whole Genomes (PCAWG) Project

Human Epigenome Atlas

Recount2: analysis-ready RNA-seq gene and exon counts datasets

big-data • 2.3k views
ADD COMMENT
4
Entering edit mode

What about TCGA?

P.S. A one line description of what type of data one can find in each of the project would be helpful.

ADD REPLY
1
Entering edit mode

I will follow All of US project. Just want to know when they will publish some preliminary data.

ADD REPLY
2
Entering edit mode

It may not be for some time. That project just got off to a start a couple of months ago.

ADD REPLY
1
Entering edit mode

As your estimation, how long time they need to publish the first paper? I guess Nature again, right? 3 years or 5 years? Which group will be charge of 'Data Analysis' section?

ADD REPLY
2
Entering edit mode

Your guess is as good as mine. It will all depend on enrollment and when they can actually start doing the analysis. Vanderbilt University Med Center, Verily (Google's life science arm) and Broad Institute have been named data centers but there may be additional ones as project gets underway.

ADD REPLY
1
Entering edit mode

Miss the big train again. Big project has been totally monopolized by these big organization.

Department of Biostatistics at Vanderbilt University

ADD REPLY
0
Entering edit mode

GTEX, transcriptomics though.

ADD REPLY
3
Entering edit mode
6.3 years ago
GenoMax 147k

Would be good to indicate public data availability. Probably others that are larger but not public.

Genomics England (10000 genomes from UK)
All of US from NIH - Just starting, million participants eventually.
Million veterans project in US.

Model organisms:
1001 Arabidopsis genomes
Collaborative cross Mice genomes.
E. coli evolution (50000 generations).

ADD COMMENT
1
Entering edit mode

I would add UK Biobank to that list since it's being Exome sequenced and Whole Genome sequenced. I think Tempus RNA stabilisation tubes were collected from participants in addition to samples for DNA sequencing (for phase 2 at least).

ADD REPLY
0
Entering edit mode

Hi genomax, do you know how to access individual-level Million veterans project data?

ADD REPLY

Login before adding your answer.

Traffic: 2288 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6