help with GDCprepare
0
0
Entering edit mode
6.7 years ago

Hello Sir/madam,

I'm using GDC data and analyzing the cancer data using TCGAbiolinks package of R. initially I'm following the TCGAbiolinks tutorials (https://bioconductor.org/packages/release/bioc/vignettes/TCGAbiolinks/inst/doc/analysis.html) to get familiar with the workflow. my R version is 3.4.3 and i have updated the Bioconductor too.

But whenever I run the GDCprepare command to prepare the data for further analysis after data downloading it throws following error,

# Query platform Illumina HiSeq with a list of barcode 

query <- GDCquery(project = "TCGA-BRCA",
    data.category = "Gene expression",
                  data.type = "Gene expression quantification",
                  experimental.strategy = "RNA-Seq",
                  platform = "Illumina HiSeq",
                  file.type = "results",
                  barcode = listSamples, 
                  legacy = TRUE)

# Download a list of barcodes with platform IlluminaHiSeq_RNASeqV2

GDCdownload(query)

# Prepare expression matrix with geneID in the rows and samples (barcode) in the columns

# rsem.genes.results as values

BRCARnaseqSE <- GDCprepare(query, save = TRUE, 
                           save.filename = "TCGA-BRCA_gene_exp.rda",
                           summarizedExperiment = TRUE)

 |=========================================================================| 100%Request to BioMart web service failed.
The BioMart web service you're accessing may be down.
Check the following URL and see if this website is available:
http://feb2014.archive.ensembl.org:80/biomart/martservice?type=registry&requestid=biomaRt
Error in fix.by(by.y, y) : 'by' must specify a uniquely valid column

and when i checked the given website, it showed something like this,

<MartRegistry>
  <MartURLLocation database="ensembl_mart_75" default="1" displayName="Ensembl Genes 75" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_ENSEMBL" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="1" />
  <MartURLLocation database="sequence_mart_75" default="" displayName="Sequence" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_SEQUENCE" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="" />
  <MartURLLocation database="ontology_mart_75" default="" displayName="Ontology" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_ONTOLOGY" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="" />
  <MartURLLocation database="genomic_features_mart_75" default="" displayName="Genomic features 75" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_GENOMIC" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="" />
  <MartURLLocation database="snp_mart_75" default="" displayName="Ensembl Variation 75" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_SNP" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="1" />
  <MartURLLocation database="functional_genomics_mart_75" default="" displayName="Ensembl Regulation 75" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_FUNCGEN" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="1" />
  <MartURLLocation database="vega_mart_75" default="" displayName="Vega 55" host="feb2014.archive.ensembl.org" includeDatasets="" martUser="" name="ENSEMBL_MART_VEGA" path="/biomart/martservice" port="80" serverVirtualSchema="default" visible="1" />
  <MartURLLocation database="pride_mart_1" default="1" displayName="PRIDE (EBI UK)" host="www.ebi.ac.uk" includeDatasets="" martUser="" name="pride" path="/pride/biomart/martservice" port="80" redirect="1" serverVirtualSchema="default" visible="1" />
</MartRegistry>

I have searched through the similar errors which were regrading the useMart, biomaRt and none were same as this error I'm not able to understand this problem. Please help me with this issue. any suggestions would be very helpful,

Thank you so much.

R GDC TCGAbiolinks GDCprepare • 3.3k views
ADD COMMENT
0
Entering edit mode

I was able to download and prepare it. Could be the biomart servers were down, so try again?. And if it still isn't working, you could consider downloading the developer version of tcgabiolinks if you haven't done that. Alternatively, it could be due to the fact that you're asking legacy data. Do you really need hg19 data? Otherwise, try leaving legacy=TRUE out?

ADD REPLY

Login before adding your answer.

Traffic: 1999 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6