Question

Error in cellranger count

0

Entering edit mode

17 months ago

tien ▴ 40

Hello, I am trying to use cellranger for aligning data. I have two different datasets. I could successfully run cellranger count on the first one. However, the second dataset prompted error as following

[stdout]
chunk_reads vVERSION
chunks: [("R1", "/test/Batch1_S1_L006_R1_001.fastq.gz", false), ("R2", "/test/Batch1_S1_L006_R2_001.fastq.gz", false)]
opening: "/test/Batch1_S1_L006_R1_001.fastq.gz"
opening: "/test/Batch1_S1_L006_R2_001.fastq.gz"
got recs: 28810048
got recs: 48304343
error: corrupt gzip stream does not have a matching checksum
caused by: corrupt gzip stream does not have a matching checksum
------------
If you believe this is a bug in chunk_reads, please report a bug to support@10xgenomics.com.

running chunk reads: [['chunk_reads', '--reads-per-fastq', '5000000', '/test/SC_RNA_COUNTER_CS/SC_RNA_COUNTER/_BASIC_SC_RNA_COUNTER/CHUNK_READS/fork0/chnk0-u90ff8a153e/files/', 'fastq_chunk', '--martian-args', 'chunk_args.json', '--compress', 'lz4']]

I tried to apply cellranger on only these two files and the same error was returned. According to 10x , I ran md5sum -c on these two files, it returned OK for both. So I assumed that they were complete by using file checksums.

Is there any other reason or check that I should do? Or If anyone know how can I solve this problem?

Thanks in advance.

cellranger • 860 views

ADD COMMENT • link 17 months ago by tien ▴ 40

0

Entering edit mode

It may be best to validate if the files are not corrupt directly.

You can use a fastq validation program (https://github.com/nunofonseca/fastq_utils use fastq_info) to check.