read quality of sequence by fastqc
1
0
Entering edit mode
8.3 years ago
nora ▴ 40

hello, please help me when i tried to read quality of sequence by fastqc in galaxy interface i received a message that says: No known encodings with chars < 33 ( the fasta file was downlaoded from genbank and uniprot) thank you in advance

software error • 2.6k views
ADD COMMENT
0
Entering edit mode

fasta files don't have quality lines. Can you paste an example of the input file?

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

On this page there are no fastq format files (hence you can't use FastQC, as indicated by @Asaf). If you are actually using a fastq formatted file then provide a direct link for it.

ADD REPLY
0
Entering edit mode

i converted the fasta file to fastq (tabular lines were written as FASTQ reads)

ADD REPLY
0
Entering edit mode

See convert FASTA into FASTQ using linux for information about why converting FASTA to FASTQ will not give meaningful quality scores (which is most of what FastQC operates on). Additionally, most of the FastQC metrics are only useful if you have more than one read, or sequence, in your file. It sounds like you have one FASTA file and want some information about its nucleotide content maybe?

ADD REPLY
0
Entering edit mode

I wanted to know what the meaning of this message No known encodings with chars < 33 because when I used the full file I have not got a result but when I used just a part of the sequence the fastqc tool works

ADD REPLY
0
Entering edit mode

Did you see the link to Wikipedia article on FastQ format that @Ido provided in his answer below?

BTW: How did you convert the fasta to fastq (did you use your own code)? Tools I know of (reformat.sh from BBMap) generally set all Q-scores to a fixed fake value for all bases.

ADD REPLY
0
Entering edit mode

i converted fasta file to tabular to fastq in galaxy interface

ADD REPLY
1
Entering edit mode
8.3 years ago
Ido Tamir 5.2k
  • The galaxy Q&A is https://biostar.usegalaxy.org/
  • check if its really a fastq file by looking at it and compare it to https://en.wikipedia.org/wiki/FASTQ_format (maybe its fasta?)
  • Unfortunately for beginners you might have to set the correct fastq format for some tools. I think its enough to get to "edit attributes" (oencil symbol) "datatype" and then I think its fastqsanger. The is also the fastq groomer tool but I think its not necessary. And fastqc normally copes with all format, maybe its really not fastq
ADD COMMENT

Login before adding your answer.

Traffic: 2570 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6