Hi all,
I have a list of bacterial RefSeq IDs corresponding to protein sequences (e.g., WP_007430823.1, WP_019686959.1, etc.). I need to retrieve the corresponding EntreZ IDs for these RefSeq IDs, in order to cotinue the RNA-seq downstream analysis (GO enrichment analysis ).
Here's part of these IDs: WP_007430823.1 WP_019686959.1 WP_016819208.1 WP_017428688.1 WP_017425888.1 WP_017426648.1 WP_013371668.1 WP_014280599.1 WP_010349214.1 WP_015737018.1 WP_016819923.1 WP_010347247.1 WP_017427551.1 WP_013373716.1 WP_019687916.1 WP_016822762.1 WP_013373542.1 WP_014278585.1 WP_010347544.1 WP_014278774.1 WP_016822628.1 WP_019638366.1
I followed the post : [Convert RefseqID to EntrezID], and Couldnot find any match using all the following databas-tools;
NCBI Batch Entrez tool
https://david.ncifcrf.gov/conversion.jsp
Uniprot (https://www.uniprot.org/id-mapping/)
https://useast.ensembl.org/Homo_sapiens/Tools/IDMapper
https://biit.cs.ut.ee/gprofiler/gost
Could someone guide me on the most effective way to obtain the EntreZ IDs for these RefSeq IDs, considering the issues I'm facing with the NCBI Batch Entrez tool? Are there alternative methods or tools that might be more suitable for this task?
Any help or suggestions would be greatly appreciated. Thank you!
I think you can try using R. https://cran.r-project.org/web/packages/rentrez/vignettes/rentrez_tutorial.html -Check out this tutorial. Hope it helps.