Question

Prokka Python Script for Gene Annotation

0

Entering edit mode

5.0 years ago

Fid_o ▴ 40

I have over 900 assembled genomes which I want to annotate using prokka. Previously I have annotated few sequences by running each one by one, but with 900 I need a python script to automate the process.

Can anyone share a script that can be adopted to do my annotation, please.

Regards.

Python Prokka Script Annotation Linux • 2.1k views

ADD COMMENT • link updated 5.0 years ago by Mensur Dlakic ★ 28k • written 5.0 years ago by Fid_o ▴ 40

0

Entering edit mode

Why do you think you need python? Since this is a straight command line you should be able to do this via a shell script.

ADD REPLY • link 5.0 years ago by GenoMax 147k

score 1 · Answer 1 · 2019-12-12

I second genomax suggestion to do this in shell scripts. Assuming your files have .fna extension and that you have bash shell:

for i in *.fna
do
prokka --outdir ${i%.fna} --force --prefix ${i%.fna} --locustag ${i%.fna} --rfam --cpus # $i
done

If you have C-shell:

foreach i ( *.fna )
prokka --outdir $i:r --force --prefix $i:r --locustag $i:r --rfam --cpus # $i
end

If all of them are the same genus and/or kingdom, you could add --genus and --kingdom switches as well. You may want to remove --rfam if not interested in non-coding RNAs because that part of the search is slowest. Also, specify a number instead of # after the --cpus switch.