Hello everyone,
I am trying to get the fastq files for the dataset with GEO number GSE122960. Unfortunately, fftp links for such files are not available (I could only find links to an SRA formatted fastq which I cannot use). So I decided to use fastq-dump...
I did the following:
prefetch SRR8085151
And then:
fastq-dump --split-files --outdir fastq --gzip --skip-technical --readids --read-filter pass --dumpbase --clip --defline-seq '@$ac.$si.$sg/$ri' --defline-qual '+' /content/SRR8085151 -I
However, fastq-dump returns a single fastq, instead of two fastqs as specified by --split-files. What am I doing wrong? The resulting file name is SRR8085151_pass_1.fastq.gz
Thanks so much!!!
It seems that the file was originally uploaded as BAM. Possibly the reads have incompatible name https://www.ncbi.nlm.nih.gov/sra/docs/submitformats/#bam-files . Please check original BAM from https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR8085151 -> Data Access https://sra-pub-src-1.s3.amazonaws.com/SRR8085151/D246ali2_possorted_genome.bam.1 (Sorry I didn't checked by myself because I only have so narrow internet access now.) (Update) oops sorry genomax has posted the nearly the same thing while I was checking the link data.
Hello, thank you so so much!! Here is the funny part though... I had tried that, but it turns out that that sample doesn't belong to GSE122960, but to GSE121600... I have been trying to understand what is going on here but I am unable... Specifically, the bam file you have linked corresponds to GSM3439913, which just isn't one of the sample of the GSE! Am I going crazy??
Where did you get SRR8085151?
In the page of GSE122960 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE122960 , There are comments "Raw data not provided for this record Processed data provided as supplementary file".
The comments in the same section of https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE121600 are "Raw data are available in SRA Processed data provided as supplementary file"
I assume GSE122960 does not have SRA entries.