How to fix Prokka`s annotation software error?
1
0
Entering edit mode
7.1 years ago

How to fix Prokka`s annotation software error:

./prokka.sh <file_name.fasta>

it returns:

[15:27:44] Using genetic code table 11. [15:27:44] Loading and checking input file: FastaSequences.fasta [15:27:44] Uh oh! Sequence file 'FastaSequences.fasta' contains duplicate sequence ID: SRR1181196.1

I have tried BBMap`s dedupe.sh in=FastaSequences.fasta am=f ac=f fo c pc rnc=f mcs=4 mo=100 s=1 pto cc qin=33 csf=stats.txt pattern=cluster_%.fasta dot=graph.dot out=result.fa

and it finds no duplicates

What to do to get an annotation files from Prokka?

prokka annotation fasta software error • 2.1k views
ADD COMMENT
0
Entering edit mode

Maybe post a bit of your file or the output from grep -A1 SRR1181196.1 FastaSequences.fasta

ADD REPLY
0
Entering edit mode

Maybe post a bit of your file or the output from grep -A1 SRR1181196.1 FastaSequences.fasta

ADD REPLY
0
Entering edit mode

Did you get this data from SRA? Appears that you must not have used the option to retrieve the original Illumina format fastq read headers. You could get the fastq files directly from EBI-ENA and use those with prokka.

ADD REPLY
0
Entering edit mode
7.1 years ago

Maybe post a bit of your file or the output from grep -A1 SRR1181196.1 FastaSequences.fasta

ADD COMMENT
0
Entering edit mode

just testing comments on this thread according to issue:

https://github.com/ialbert/biostar-central/issues/444

ADD REPLY
0
Entering edit mode
ADD REPLY

Login before adding your answer.

Traffic: 1191 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6