Genome Annotation Output Files Naming
1
0
Entering edit mode
4.7 years ago
Fid_o ▴ 40

I ran this annotation command: for k in *.fasta; do prokka $k --outdir "$k".prokka.output; echo $k; done on hundreds of assembled sequences. Each sequence annotation output produced a folder named "sequence_name.prokka.output" but the actual files in the folders were named according to the date e.g. "PROKKA_03062020.gff" for all the folders.

Now I want the actual output be named with the actual FASTA file ID. Example: sequence ID of D02502 should be like this: "PROKKA_D02502.gff"

What do I add (or remove) from the code above?

Annotation genome • 2.4k views
ADD COMMENT
3
Entering edit mode
4.7 years ago
Mensur Dlakic ★ 28k
for k in *.fasta; do prokka $k --outdir "$k".prokka.output --prefix PROKKA_$k; echo $k; done
ADD COMMENT
0
Entering edit mode

Thank you sooooo much Mensur Dlakic.

I am learning Bioinformatics and your help is great.

ADD REPLY
0
Entering edit mode

Is there a way to put all prokka output from all fasta files in one folder rather than the traditional output (one folder for the output of each fasta)?

ADD REPLY

Login before adding your answer.

Traffic: 2729 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6