Hi everyone
I am looking for NCBI ProteinID vs GeneId pair database as well as Gene seq database, because I have a comprehensive list of protein IDs and need to convert them to Gene IDs, and then get corresponding sequences.
I should note I tried an online convertion tool, but run too slow because of a huge list.
I log on NCBI Gene ftp site and found a couple of dbs available for download. I am wondering if it is the correct place to download? I cannot find fasta data there.
THANKS A LOT in advance for your advice and help!
This needs clarification please. Proteins and genes are different concepts. Also there are many protein ID types. What exactly do you want to map to what?
Hi cdsouthan,
Thanks a lot for your reply. I finally sorted out the problem. I found my aim databases on NCBI ftp website. Actually my protein list only includes
NP_
...XP_
... andYP_
... Could you and others beifly explain to me the abbreviations (NP_
,XP_
,YP_
,XM
)? THANKS!!