Entering edit mode
5.2 years ago
bsmith030465
▴
240
Hi,
I was trying to get started with scRNA seq analysis. I downloaded a test dataset from bioproject (NCBI). However, each sample is in three fastq files (e.g. SRR123_1.fastq.gz, SRR123_2.fastq.gz, SRR123_3.fastq.gz).
How do I process these in Kallisto? Do I need to combine all of these files (how?), or split each file into forward and reverse reads before combining?
Else, what fastq-dump command do I need to issue to download the forward and reverse reads as separate files? My current command is:
fastq-dump -I --split-files SRR123
thanks!
could you post the command of :
ok so I guess that the SRR123_3.fastq.gz is the sequencing barcode ( to multiplex multiple samples). Could you maybe post the link to the bioproject please ?
I got the fastq files by executing: fastq-dump --split-files --gzip SRR8611970
Looking at SRA webpage : https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR8611970 it seems that _1 = R1 ; _2 = R2 and _3 = sample index.
check here for more details : https://bioinformatics.stackexchange.com/questions/5178/what-is-the-index-fastq-file-sample-i-fastq-gz-generated-when-demultiplexing