From A Geo Gsm Id, How To Obtain The Corresponding Raw File(S) Hosted On Sra?
1
6
Entering edit mode
14.6 years ago
Nico ▴ 190

We often refer to Sequencing libraries with the GSM number (from NCBI GEO). I'd love to find a way to obtain the corresponding information, on GEO (such as GSE number, or annotations / description / metadata) and on SRA (where the raw files are hosted (most of the time, that is...)). From SRA, I'd like all the numbers (SRR, SRP, SRR, SRX and whatnot), but most importantly an automated way of downloading the files, usually .fastq

I believe I can use the NCBI e-Utils (http://www.ncbi.nlm.nih.gov/geo/info/geo_paccess.html) for GEO, but I haven't find a way to link to SRA.

As I'd like to do it for >100 libraries, the most automated (or programmable), the better it is!

Any pointers?

Thanks,

fastq r bioconductor sra geo • 11k views
ADD COMMENT
8
Entering edit mode
14.6 years ago

Check out these R packages:

They are pretty nice for doing this type of thing. SRAdb is probably where it sounds like you would want to start.

Sean

Fair advertising--I am one of the authors of the packages....

ADD COMMENT

Login before adding your answer.

Traffic: 2815 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6