Hello,
I have downloaded a file from the SRA repository. Can I extract the fastq files from it? and how?
Hello,
I have downloaded a file from the SRA repository. Can I extract the fastq files from it? and how?
Use fastq-dump
-
Usage Examples(stolen from link):
fastq-dump -X 5 -Z SRR390728
Prints the first five spots (-X 5) to standard out (-Z). This is a useful starting point for verifying other formatting options before dumping a whole file.
fastq-dump -I --split-files SRR390728
Produces two fastq files(--split-files) containing ".1" and ".2" read suffices (-I) for paired-end data.
Also if you want to download the fastqs directly - next time try the following: Fast download of FASTQ files from the European Nucleotide Archive (ENA)
You might want to check fasterq-dump
, which is advertised as "a faster fastq-dump".
Personally, I don't think the disk space you save by using the SRA
instead of FASTQ
is worth the trouble of going through the painful prefetch
, fastq-dump
, I always check ENA first for the fastq files.
fasterq-dump
is based on my limited testing slower than parallel-fastq-dump
given the same number of threads but an improvement towards fastq-dump. It does not offer gzip-compression, so it is nonsense for large datasets such as WGS imho. I would always start from sra though since all *_dump tools are too unstable to directly load fastq from NCBI. Still, pulling from ENA via Aspera is the fastest. One should be aware of the alternatives though since ENA will never mirrow restricted data such as dbGaP datasets.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
sra-explorer : find SRA and FastQ download URLs in a couple of clicks
SRA from NCBI?
Downloading paired end fastq from SRA
How to convert to .SRA files to .FQ (FASTQ)
Downloading SRA data using the SRA Toolkit
how to get SRA data
How to split paired end SRA file into 2 correct fastq files
sra to fastq
How to convert .sra to .fa instead of .fastq?
Download fastq files from SRA database using biosample identifier
Fast download of FASTQ files from the European Nucleotide Archive (ENA)