GDC_Gene expression quantification values for matched tumor normal
1
0
Entering edit mode
8.2 years ago

Hi All,

i want to download Gene expression quantification data for tumor and matched normal pairs from GDC portal. Can someone suggest how can I do d same.

Thank you in advance

RNA-Seq • 2.0k views
ADD COMMENT
0
Entering edit mode
8.1 years ago
thomaskuilman ▴ 850

In R you can use the fantastic TCGAbiolinks package. You can find a thorough explanation here at the Bioconductor webpage. I have used the following script to do just what you want to do:

library("TCGAbiolinks")
library("SummarizedExperiment")

DIRPREFIX <- "/PATH/"

## Download raw counts for indicated sets
## Example of TCGA.data.sets: c("TCGA-LUAD", "TCGA-SKCM")
for (id in TCGA.data.sets){
    query <- GDCquery(project = id,
        ## get data.category using for instance TCGAbiolinks:::getProjectSummary("TCGA-LUAD")
        data.category = "Transcriptome Profiling",
        ## see link above for data.types and workflow.types
        data.type="Gene Expression Quantification",
        workflow.type = "HTSeq - Counts")
    GDCdownload(query, method = "client", directory = "~/Downloads/")
    data <- GDCprepare(query, directory = "~/Downloads/")
    save(data, file = file.path(DIRPREFIX, paste0(id, ".Rdata")), compress = "xz")
}
ADD COMMENT

Login before adding your answer.

Traffic: 2597 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6