Using TCGABioLinks to query multiple cancer studies
1
0
Entering edit mode
3.8 years ago
ivykosater • 0

I have a list of cancer samples that I wish to download transcriptome data for using TCGABiolinks. I use the GDCquery() function for downloading data, however my list spans multiple TCGA projects. Here is my code

query <- GDCquery(project = "", data.category = "Transcriptome Profiling",
                  data.type = "Gene Expression Quantification", 
                  workflow.type = "HTSeq - Counts",
                  barcode = acc_numbers)

I am unsure what to put in the project argument. I do not think there is an argument for the TCGA pan cancer atlas, so does anyone know what I should do here?

RNA-Seq R TCGA • 974 views
ADD COMMENT
0
Entering edit mode
4 weeks ago

https://rdrr.io/bioc/TCGAbiolinks/man/GDCquery.html

See example from the manual to perform query on more than one study:

query.met <- GDCquery(project = c("TCGA-GBM","TCGA-LGG"),
                      legacy = TRUE,
                      data.category = "DNA methylation",
                      platform = "Illumina Human Methylation 450")

If you want all to perform across all projects then you can replace list of projects with TCGAbiolinks:::getGDCprojects()$project_idor filter it as needed.

ADD COMMENT

Login before adding your answer.

Traffic: 1928 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6