Here is my script
project='PRJNA1100523'
esearch -db sra -query $project | efetch -format runinfo > runinfo.csv
cat runinfo.csv | cut -d "," -f 1 > SRR.numbers
cat SRR.numbers | parallel fastq-dump --split-files --origfmt --gzip -X 1000 {} ## just test
cat SRR.numbers | parallel fastq-dump --split-files --origfmt --gzip {} ## download complete
I just want to download SRR files here.
In PRJNA1100523, there are only 8 SRR files and SRR ID is saved in SRR.numbers.
But when running cat SRR.numbers | parallel fastq-dump --split-files --origfmt --gzip {}
, there are extra strange SRR related fastq files download in folder.
SRR28698742
SRR28698743
SRR28698744
SRR28698745
SRR28698738
SRR28698739
SRR28698740
SRR28698741
Such as SRR29377445, SRR29377574 and SRR29413198, I don not know where they come from.
I am sure these strange SRR IDs are not saved in SRR.numbers.
And I don not know where it is wrong with my script. So I hope some of you could give me some advice or solutions.
Thanks in advance.
Thanks, I will have a try.