Where can I get RNAseq raw data (paired end fastq format) for expression analysis?
Where can I get RNAseq raw data (paired end fastq format) for expression analysis?
Dominating resources are Sequence Read Archive (SRA) database and European Nucleotide Archive (ENA).
From ENA you can download fastq files directly, from SRA you have to download .sra format and then convert it to fastq using fastq-dump
for example. I personally prefer SRA because I like its search and query building engine.
I am trying to download the data from the following link: https://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?exp=SRX317694&cmd=search&m=downloads&s=seq sra_data.fastq.gz >3.0GB as it is downloading since last 4hrs. I suppose I am downloading the correct file.
I downloaded the fastq.gz from https://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?exp=SRX317694&cmd=search&m=downloads&s=seq. But its usless, its 6GB zipped file..after extraction it is 17GB fastq file. Now I am splitting into paired ends using: fastq-dump --split-3 SRR925687. It is taking very long time. But I understood, should have directly used fastq-dump --split-3 SRR925687 command instead of downloading from the web.
The speed of downloading obviously depends on internet connection. But to speed up fetching you can use aspera https://www.ncbi.nlm.nih.gov/books/NBK242625/
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
You will get something relevant here I guess : Where Can I Find Fastq Data (Ngs Raw Data) And Published Results?