I have downloaded public single-cell RNA sequencing data with the following GSM IDs: GSM5718030, GSM5718026, GSM5718021, GSM5718019, and GSM5718013. The data was generated using the Illumina HiSeq 2500 platform, and the sequencing was implemented through the 10x Genomics Chromium platform according to the recommended procedures.
I am attempting to process the data using CellRanger, which should be compatible since the data was processed with the Chromium platform. Here is the code snippet I'm using for mapping:
# run cellranger count (maximum CPUs 8; maximum RAM 24GB)
cellranger count \
--id=${srr_id} \
--transcriptome=$ref \
--fastqs=$fastq \
--sample=${srr_id} \
--create-bam=true \
--chemistry=auto \
--localcores=8 \
--output-dir=${srr_id}_results
However, I encountered the following error during chemistry detection:
[error] Pipestance failed. Error log at: SRR17134429_results/SC_RNA_COUNTER_CS/SC_MULTI_CORE/MULTI_CHEMISTRY_DETECTOR/DETECT_COUNT_CHEMISTRY/fork0/chnk0-u43bcd1dc45/_errors
Log message: An extremely low rate of correct barcodes was observed for all the candidate chemistry choices for the input: Sample SRR17134429 in "/data/project/bio/shivashankar/sunho/old_data/scPBMC_breast/FASTQ_files2". Please check your input data.
- 0.1% for chemistry SC3Pv4
- 0.1% for chemistry SC3Pv3
- 0.1% for chemistry SC3Pv3HT
- 0.0% for chemistry SC5P-PE-v3
- 0.0% for chemistry SC5P-PE
- 0.0% for chemistry SC3Pv2
- 0.0% for chemistry ARC-v1
- 0.0% for chemistry SC3Pv3LT
It seems that the CellRanger is failing to detect the correct chemistry for the input data, resulting in a very low rate of correct barcodes for all candidate chemistry options.
Does anyone know why this might be happening? Is there something specific about the data or the code that I might need to adjust to resolve this issue?