Entering edit mode
3.6 years ago
robert.murphy
▴
90
I have generated some EMBL files from gff3 format and am ending up with multiple CDS features at the same locus tag. From the ones I have checked, it is 2 CDS features per locus tag. Is this because there is one CDS on the forward strand and one on the reserves strand?
The tool I am wanting to use does not allow for multiple CDS on the same locus tag so how can I avoid this happening or a simple way to change it automatically?
Example EMBL: https://pastebin.com/LkzfB0nF
Is there a way I can avoid this? Or at least give different locus tags. I want to use antiSMASH and it does not allow multiple CDS at one locus tag
Prior conversion you can use AGAT to keep only longest isoforms.
This have solved the issue, thank you!
You can accept the parent answer to provide closure to this thread.