Is anyone aware of a publication or pre-publication data set that includes the following for a single human tumor/normal sample pair:
- Illumina whole genome sequence (WGS) to at least 30-50X depth for both tumor and normal (e.g. blood). Preferably this would be HiSeq paired end data generated with v3 chemistry
- Illumina exome sequence data for the same tumor/normal pair but to considerably higher depth (say at least 150-200X)
- Illumina RNA-seq data generated for the same tumor sample. Bonus points if there is also RNA-seq for a matched normal sample (same tissue as the tumor as opposed to blood DNA that would typically be used for a normal comparison at the DNA level).
Since this data would primarily be used for methods development, the tumor/normal pair could be from a tumor cell line and matched lymphoblastoid 'normal' cell line derived from the same individual. Some data along these lines is being made available here:
TCGA Mutation/Variation Calling Benchmark 4 at CGHub
https://cghub.ucsc.edu/benchmark_download.html
However, this is whole genome data only. No exome or RNA-seq data yet. Plenty of RNA-seq data can be found in GEO and elsewhere but I'm not aware of any projects where there is corresponding WGS or Exome data. Plenty of exome data and WGS data are being generated for TCGA but again I'm not aware of any publications describing combinations of all three types.
Large scale cancer sequencing projects that might have performed such a comparison:
The Cancer Genome Atlas (TCGA)
Cancer Genome Project (CGP)
International Cancer Genome Consortium (ICGC)
Didn't explore it in detail, but I think this is what ICGC http://icgc.org/ is trying to do - put together all data in one place for the same set of patients. Dataset summary http://dcc.icgc.org/pages/summary/
Hi Malachi,
Did you find some rnaseq data from normal/tumor pair on any cancers, which are processed.
But can I download matched normal/tumor paired exome data from icgc ? I want to work with bam files, is it possible to download them from icgc? I tried from TCGA but for WES the samples are not open. I want to get access to normal/tumor serous ovarian cancer exome data. I want the aligned files. TCGA does not have it open but does any other portal have them open?