Phred qual error after fastq trimming
1
0
Entering edit mode
4.3 years ago

Hi everyone, I would like to trim the beginning of all the reads in fastq file by a given length, before mapping to the genome with bowtie2. I have used Cutadapt:

cutadapt -u 48 -o output.fastq.gz input.fastq.gz

my fastq files after trimming looks like this:

gunzip -c output.fastq.gz | head

@NB502143:99:HFF7TAFX2:1:11101:4133:1019 1:N:0:ATCACG
CATGAAAAAGAGCTCATTTTCAGATGCAGGAATTCCTATCCG
+
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
@NB502143:99:HFF7TAFX2:1:11101:19790:1020 1:N:0:ATCACG
CATGATCCACTTTTCCACGCGCTTTGACGACCATTTTATAA
+
EEEEE<EEEEEEEEEEEEEEEEE<EE/EEAEEEEEEEEEEE
@NB502143:99:HFF7TAFX2:1:11101:6327:1020 1:N:0:ATCACG
CATGATCTCAGTAAAGGCATTTGTGGTTGTTAAGTAGCCATT

When I try to map it with bowtie, I get the following error message:

Saw ASCII character 10 but expected 33-based Phred qual.

I don't get this error if I map input.fastq.gz, so I suspect something wrong is happening during the trimming but I can't figure out what! Thanks for your help.

next-gen mapping bowtie2 cutadapt • 1.6k views
ADD COMMENT
0
Entering edit mode

I check with Fastqc, both files are Sanger / Illumina 1.9 encoded.

Tried to run:

cutadapt -u 48 --quality-base=33 -o output.fastq.gz input.fastq.gz

it didn't change anything.

However, Fastqc shows low read quality towards the end of the reads after trimming, which is not the case before trimming. So it's clear that the quality scores are been messed up, but I don't get how!

ADD REPLY
0
Entering edit mode
4.3 years ago
Shalu Jhanwar ▴ 540

I'd suggest first check quality encoding as suggested in this post. Then instruct Cutadapt to use correct quality scores.

ADD COMMENT
0
Entering edit mode

I check with Fastqc, both files are Sanger / Illumina 1.9 encoded.

Tried to run:

cutadapt -u 48 --quality-base=33 -o output.fastq.gz input.fastq.gz

it didn't change anything.

However, Fastqc shows low read quality towards the end of the reads after trimming, which is not the case before trimming. So it's clear that the quality scores are been messed up, but I don't get how!

ADD REPLY
0
Entering edit mode

To be more precise, it's the "per tile" quality wich changes, not the "per base"

ADD REPLY
0
Entering edit mode

Post before and after FastQC plots (How to add images to a Biostars post ). It is possible that your fastq files itself my be corrupted somewhere to generate that error.

ADD REPLY

Login before adding your answer.

Traffic: 2045 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6