Hi all,
I've annotated a eukaryotic genome (11,600 genes) using GenSAS, and after creating a merged GFF3 file of all features, I have noticed that the gene accession number of the best matching blastp search result has been put in 'notes=' (underlined in attached image). Could anyone help me by telling me tool that could automatically replace the NCBI accession for the protein with the name, and also, how then to make an annotated whole genome file ready for submission to NCBI. I tried using ANNIE, but this only seems to work for Swissprot accession numbers. I am aware of GAG, but I am stuck trying to produce a an annotation file with the names of the proteins instead of accession numbers to feed into GAG. Any ideas?
Many thanks in advance