Hi,
I am trying to download direct RNA seq data produced by oxford nanopore sequencing from SRA (SRP174366). I know how to use sratoolkit prefetch or fastq-dump to download fastq files, but I do not know how to download nanopore fast5 files from SRA?
Any help would be much appreciated,
Thanks, Vahid.
Is there a way to do this via command line (if not using AWS)? I used SRA Toolkit to grab fastq files for Illumina sequences, but I need a few Nanopore sequences as well now. I'd like to keep doing everything from command line, but it seems I can't do it with SRA Toolkit. Wondering if there is another way... (EDIT: Never mind. I found out from a colleague that NCBI normally prefers fastq instead of fast5 because of the size and format of fast5s. So the fast5 files are unavailable and I am just assuming that the only step that has been applied is conversion from fast5 to fastq, and no cleaning or trimming done).
How did you end up getting the Nanopore sequencing data off of SRA in fastq format? fastq-dump from SRA toolkit is giving me a segmentation fault error
Which accession number are you looking at? If you look at the
Data access
tab of that accession you may be able to download the fastq files directly.I am looking at SRR2037194 and SRR1980727. The data access tab as far as I can see does not give access to fastq files (I got a binary file when I downloaded via data access, which I am assuming is in SRA format). I was able to use prefetch to get the SRA file as well but when I use fastq-dump, it still gives me a segmentation error.
fastq-dump -X 25 -F SRR2037194
worked for me. If you can't make it work usingsratoolkit
then you could get fastq files from ENA/EBI. Links below.SRR2037194 (LINK)
SRR1980727 (LINK)
The fastq-dump command still doesn't work for me. But the ENA/EBI links will work great, thank you so much for your help!