Filter reads by calculating the distribution of phred score in each individual read
1
0
Entering edit mode
6.2 years ago
Nagesh ▴ 10

Dear All, I would like to know if there is any tool or script to calculate the distribution of PHRED score for each individual read and filter, if it has the high distribution low quality bases by providing cut-off value. Thanks in advance.

next-gen RNA-Seq sequencing • 2.4k views
ADD COMMENT
1
Entering edit mode

You mean a preprocessing tool as FastQC or fastp ?

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

Is bbduk what you are looking for?

bbduk.sh in=reads.fq out=clean.fq maq=10

This will discard reads with average quality below 10. If quality-trimming is enabled, the average quality will be calculated on the trimmed read.
ADD REPLY
0
Entering edit mode

By calculating the average quality, we may miss reads which are having moderate distribution of low quality bases. For example I want to retain reads which are having less than 2% bases with 20 PHRED score.

ADD REPLY
0
Entering edit mode

Fastp has an option for that. BTW, in practice base qualities tend to be fairly bimodal.

ADD REPLY
0
Entering edit mode
6.2 years ago

As mentioned in the comments, bbduk and fastp can both do this.

ADD COMMENT

Login before adding your answer.

Traffic: 1553 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6