Hello all,
I have a .csv file containing both gene IDs and the corresponding RefSeq ID (in the XP_ format) that I got from a genome annotation pipeline that I used to annotate a fungal genome that we sequenced. I would like to be able to find and add functional annotation descriptions (e.g. "isocitrate/isopropylmalate dehydrogenase") to each gene. My table is in the following format:
LS400g001490.m01 XP_007679962.1 LS400g001500.m01 XP_033673675.1 LS400g001500.m02 XP_007678865.1 LS400g001510.m01 XP_024777452.1 LS400g001510.m02 XP_031885916.1 LS400g001510.m03 XP_031885916.1 LS400g001520.m01 XP_033388258.1 LS400g001530.m01 XP_035366954.1 LS400g001540.m01 XP_001806663.1 LS400g001550.m01 XP_007699016.1
I can't really use a species-specific database of the sort that you might find in biomaRt since these accessions are from NCBIs entire fungal database. Is there a file or a tool that I can use to accomplish this that anyone is aware of?
Thanks very much!
Great, thank you so much!!!!