Entering edit mode
9.2 years ago
dago
★
2.8k
I am trying to submit bacterial genomes to NCBI.
I use 'tbl2asn' to create .sqn file using the .fsa and the .tbl files obtained from the annotation I did using PROKKA.
However, I obtained many errors concerning the locus-tag.
Here some errors reported in the discrep file as example:
FATAL: DiscRep_ALL:INCONSISTENT_LOCUS_TAG_PREFIX::4718 features have locus tag prefix AD2.
FATAL: DiscRep_ALL:DISC_SOURCE_QUALS_ASNDISC::strain (all present, some duplicate)
DiscRep_SUB:DISC_SOURCE_QUALS_ASNDISC::169 sources have 'AD2' for strain
I looked into the help NCBI provide but I cannot really understand where is the problem and how to fix it.
Any suggestion?
It would be probably easier in that case to convert the gff output file into EMBL flat file with gff3toembl or EMBLmyGFF3 and submit it to the european database ENA, which is part to the INSDC.