download all samples from a study in BioProject
1
2
Entering edit mode
9.1 years ago

How to download all samples (fastq files) in a particular study (e.g., PRJNA223640) from bioproject.

sequencing sra • 11k views
ADD COMMENT
0
Entering edit mode

i did manage to downlaod th run data one after each other.

ADD REPLY
7
Entering edit mode
9.1 years ago
piet ★ 1.9k

If you want to download from NCBI, then the key is to get the so-called 'runinfo' file. The 'runinfo' file is a simple CSV table which lists sequencing runs in the 'sra' database and their download links. After you have obtained the 'runinfo' file, you can write a short script to download the run data one after each other.

You can download the 'runinfo' file with wget:

wget 'http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?save=efetch&rettype=runinfo&db=sra&term=PRJNA223640' -O - | tee SraRunInfo.csv
ADD COMMENT
0
Entering edit mode

great. could you show a sample script to download the run data?

ADD REPLY
0
Entering edit mode

This project comprises about 150 sequencing runs. Each run has a size of more than 1 Gb. You should have a good concept of how to organize and further process the data before actually starting the download.

ADD REPLY
0
Entering edit mode

yes. Is there any way to process this data.

ADD REPLY
0
Entering edit mode

Echoing ifudontmind_plzz would it be possible to get an example of the code to use the csv file for the download?

ADD REPLY
0
Entering edit mode

Is this command provided by piet still supposed to work? It returned an empty file for me.

[UPDATE] I realised my bioproject does not have SRA files, but "biosamples" - that's why it's not working..

ADD REPLY

Login before adding your answer.

Traffic: 2098 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6