In "Integrative eQTL-Based Analyses Reveal the Biology of Breast Cancer Risk Loci", the author did this analysis. Anyone can tell me how to do this? Thanks,
In "Integrative eQTL-Based Analyses Reveal the Biology of Breast Cancer Risk Loci", the author did this analysis. Anyone can tell me how to do this? Thanks,
The Cancer Genome Atlas (TCGA) "Level 3" CNV data is highly processed and provides mean copy number estimate of segments covering the whole genome. Are you having trouble locating the URL to download the data, or are you having trouble opening/interpreting the files provided in your download of Level 3 patient data? Here is a link to the TCGA "Data Matrix" which allows you to query the database and download requested samples.
I can't find a mention of a log2 ratio in TCGA documentation. Level 3 data is typically processed and normalized, but I believe it would be very clear in the documentation or in the file if a transformation was performed or a ratio was taken. Here's a description of the levels of data for different data types: https://tcga-data.nci.nih.gov/tcga/tcgaDataType.jsp . If you're still unsure, why not just start from the Level 1 data? It looks like this Bioconductor package/tutorial allows you to get segment means from raw/Level 1 TCGA data: http://www.bioconductor.org/packages/release/bioc/vignettes/cghMCR/inst/doc/findMCR.pdf .
If you want the latest segmentation files, you can download them from the Broad's Firehose archives (under the stddata folder):
http://gdac.broadinstitute.org/runs/
For example, for GBM:
http://gdac.broadinstitute.org/runs/stddata__2013_11_14/data/GBM/20131114/
and more specifically:
Alternatively you can use http://www.broadinstitute.org/tcga and use their own ready-to-go IGV webstart options.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
You'll get a better response if you provide some indication of having tried to research the question for yourself: e.g. linking to the article, have you read it, have you explored TCGA, if so what problems did you have.
Thanks. I have read the paper, and I am wondering "how to retrieve the segmented copy-number scores of the tumor samples and the paired-normal control from the level 3 data?". To my knowledge, the level-3 data gives the log2 ratio (segment-mean), am I right? Then how can the author get the copy-number scores for both tumor and control samples? Maybe they only mean to get the copy-number fold-change?