Entering edit mode
6.2 years ago
Nagesh
▴
10
Dear All, I would like to know if there is any tool or script to calculate the distribution of PHRED score for each individual read and filter, if it has the high distribution low quality bases by providing cut-off value. Thanks in advance.
You mean a preprocessing tool as FastQC or fastp ?
Try FastQC - https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Is bbduk what you are looking for?
By calculating the average quality, we may miss reads which are having moderate distribution of low quality bases. For example I want to retain reads which are having less than 2% bases with 20 PHRED score.
Fastp has an option for that. BTW, in practice base qualities tend to be fairly bimodal.