I want to download all the SRA files in an SRA study (usually named as SRPXXXXX). One way to do this is to download using a "For loop", but that is slow.
Does someone know a better, and faster way to download all the SRA files in an SRA study?
I want to download all the SRA files in an SRA study (usually named as SRPXXXXX). One way to do this is to download using a "For loop", but that is slow.
Does someone know a better, and faster way to download all the SRA files in an SRA study?
Download them from the European Nucleotide Archive using Aspera and
GNU parallel
, see Fast download of FASTQ files and metadata from the European Nucleotide Archive (ENA).
Alternatively, if you want to stick with NCBI, make a list with accessions, line:
SRRXXXXX1
SRRXXXXX2
SRRXXXXXn
install GNU parallel and then use cat accessions.txt | parallel -j $JOBS "prefetch {}"
. In any case, install Aspera for faster download. See the NCBI documentation on it.
where $JOBS
is the number of parallel downloads, depending on your I/O capacity. This can then be converted to fastq wth fastq-dump
or (my recommendation) parallel-fastq-dump.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Fast download of FASTQ files and metadata from the European Nucleotide Archive (ENA)