Downloading individual reads from SRA
2
0
Entering edit mode
5.9 years ago
mnmalash • 0

How can I download selected individual reads from SRA without downloading the whole run file using command line?

sequence sequencing next-gen gene • 2.4k views
ADD COMMENT
3
Entering edit mode
5.9 years ago
vkkodali_ncbi ★ 3.8k

You can probably use fastq-dump to download SRR1803613.479767.1 as follows but I haven't tried paired reads, etc with this.

fastq-dump -A SRR1803613 -N 479767 -X 479767 --fasta
ADD COMMENT
0
Entering edit mode

perfect. This is what I want. I will go for this and see what I will get and will try it for paired end reads and tell you. I will work on both.

ADD REPLY
0
Entering edit mode

It worked well with single ends and paired ends. However, in paired ends, it downloads both not either of them alone. '-N' and '-X' options do not accept other than numerals. They do not accept the dot (.1 or .2) do you have a way to download either of them when wanted?

ADD REPLY
0
Entering edit mode

I got an error message due to the missing parameter for the fasta option. I give a corrected example here: C: How to retrieve an individual read with fastq-dump

ADD REPLY
0
Entering edit mode
5.9 years ago
h.mon 35k

What do you mean by "download selected individual reads"? How do you know which reads you want to download?

fastq-dump allows streaming the output, so you can combine it with head to download the first whatever number of reads you want. However, there is random access to these files, so you can't access "individual" reads, and there is no way of skipping download a certain number of reads - you can combine head and tail, for example, but you still have to download the reads before discarding them.

ADD COMMENT
0
Entering edit mode

I mean downloading individual reads I know their IDs just like this for example SRR123456.4564 and don't want to download the whole SRR123456 run file which may be large and will take time. Provided I want a list of reads from many SRA runs.

ADD REPLY
1
Entering edit mode

Just out of curiosity, how / why do you know which reads do you want?

ADD REPLY
0
Entering edit mode

I have already searched the whole sequences for HMMs of interest and those are the positive reads and their IDs are in the report. I want them separately.

ADD REPLY

Login before adding your answer.

Traffic: 1966 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6