Prokka Python Script for Gene Annotation
1
0
Entering edit mode
5.0 years ago
Fid_o ▴ 40

I have over 900 assembled genomes which I want to annotate using prokka. Previously I have annotated few sequences by running each one by one, but with 900 I need a python script to automate the process.

Can anyone share a script that can be adopted to do my annotation, please.

Regards.

Python Prokka Script Annotation Linux • 2.1k views
ADD COMMENT
0
Entering edit mode

Why do you think you need python? Since this is a straight command line you should be able to do this via a shell script.

ADD REPLY
1
Entering edit mode
5.0 years ago
Mensur Dlakic ★ 28k

I second genomax suggestion to do this in shell scripts. Assuming your files have .fna extension and that you have bash shell:

for i in *.fna
do
prokka --outdir ${i%.fna} --force --prefix ${i%.fna} --locustag ${i%.fna} --rfam --cpus # $i
done

If you have C-shell:

foreach i ( *.fna )
prokka --outdir $i:r --force --prefix $i:r --locustag $i:r --rfam --cpus # $i
end

If all of them are the same genus and/or kingdom, you could add --genus and --kingdom switches as well. You may want to remove --rfam if not interested in non-coding RNAs because that part of the search is slowest. Also, specify a number instead of # after the --cpus switch.

ADD COMMENT

Login before adding your answer.

Traffic: 2214 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6