Dear all,
I have a list of NCBI GEO Series Accession numbers and Platform IDs, and want to download the raw data in batch. A previous post on Biostars presents a good example of batch download (How to download raw sequence data from GEO/SRA ), but that solution is based on project ID rather than GEO Series Accession number. Does anyone know how to work out this task? Thank you very much!
Can you post an example of the Accession number you are interested in? @Istvan's solution with eUtils should be able to accommodate your needs.
Thank you for your comment, genomax! The GEO Series Accession Number is something like GSE65022, and the Platform ID is like GPL19657. I want to get the SRA number something like SRR4024915.
This may be helpul batchentrez
Hi, Buffo, thanks for your comment! However, after uploading a list of Platform ID (eg, GPL19657), I could not get the SAR run number, which is something like SRR4024915.
Hi, Just in case you are only interested in SRR ids, SRA run selector is a very good option. You can either enter GSE65022 in the run selector and it should pull all the relevant metadata for you. For example is this url https://www.ncbi.nlm.nih.gov/Traces/study/?acc=GSE65022&go=go