How to determine the encoding method of quality score in a fastaq file?
1
0
Entering edit mode
23 months ago
octpus616 ▴ 120

As you know, there are two general standards for the sequencing quality of fastq files: Phred+33 and Phred+64.

I have a very old fastq file that was generated on Genome Analyzer IIx platform (FC-104-50xx). I am not sure about the score of this fastaq has it gone through some conversions, is there some simple way to infer from the fastq total whether his sequencing quality score is Phred+33, Solexa+64 or Phred+64

fastq NGS samtools bwa • 2.1k views
ADD COMMENT
1
Entering edit mode

Try seqkit convert which converts FASTQ quality encoding between Sanger, Solexa and Illumina.

ADD REPLY
0
Entering edit mode

Since this file has been generated for a long time, I'm not quite sure if it has undergone some kind of conversion, is it possible to test the encoding way?

ADD REPLY
0
Entering edit mode

yeah, its works, thanks

ADD REPLY
2
Entering edit mode
23 months ago
Juke34 8.9k

You can use gaas_fastq_guessMyFormat.pl from GAAS

e.g. with conda

conda create -n gaas_env gaas
conda activate gaas
gaas_fastq_guessMyFormat.pl -i myfile.fastq.gz
ADD COMMENT
0
Entering edit mode

its also works, thanks

ADD REPLY

Login before adding your answer.

Traffic: 1909 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6