Download NCBI data using sratoolkit in anaconda
1
0
Entering edit mode
13 months ago
sooni ▴ 20

Hello

I would like to download data from the PRJNA834801 project for analysis practice. I succeeded in installing sratoolkit in anaconda. How do I download all the data (725 data sets in the BioProject) of the project I want to download at once?

Please help me.

Thank you. :)

NCBI sratoolkit anaconda • 834 views
ADD COMMENT
0
Entering edit mode

How do I download all the data (725 data sets in the BioProject) of the project I want to download at once?

This is a very basic task... after installing sratoolkit what did you try?

ADD REPLY
0
Entering edit mode

You can also use the program mentioned here: Access to fastq files on SRA Run browser

Note: This is close to 2TB of data so make sure you have enough bandwidth and storage available.

ADD REPLY
0
Entering edit mode
13 months ago

you could use my bio package to get all the SRR numbers

pip install bio 

then

bio search PRJNA834801 --csv > runinfo.csv

and then loop over all the SRR numbers to fetch the data

cat runinfo.csv | csvcut -c 1 | parallel fastq-dump -X 1000 --split-files -o reads {}
ADD COMMENT

Login before adding your answer.

Traffic: 1979 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6