Entering edit mode
10 months ago
charms.charms
•
0
Hi I'm trying to download data using the SRA toolkit:
prefetch SRR24516563
and then I used:
fastq-dump SRR24516563 -split-files
The output are three different files SRR24516563_1.fastq, SRR24516563_2.fastq and SRR24516563_3.fastq , this is from a paired end sequencing experiment (single-cell RNA seq), I'm planning to use Cell Ranger to align the reads, my question is: usually the output are 3 files, how do I knew which files should I use ? Should I use files _1 and _2 ?
thanks
Note that you have to rename the files in order for cellranger to understand what is what.
https://www.10xgenomics.com/support/software/cell-ranger/latest/analysis/inputs/cr-specifying-fastqs
And yes, you can simply delete the very smallest file, it's just the sample index, and cellranger doesn't actually need it to be there.
This is a very naive question, since I'm new with this type of data, but how are able to tell the reads number, I tried looking at SRA webpage, but wasn't able to get further details.
thanks again
What do you mean by "reads number"? ATPoint is using their knowledge of single cell RNA-seq and read lengths to deduce which files you need to use.
If you look here you can see that there are three reads involved: https://trace.ncbi.nlm.nih.gov/Traces/?view=run_browser&acc=SRR24516563&display=metadata
Thank you for your reply !
A small educational note: if an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one answer if they work. This will help future users that might find this post find the right answer.