Extracting annotation information from EntrezID
1
0
Entering edit mode
5.4 years ago
tomoya • 0

Now I have several EntrezIDs list (ex. 101171905, 101164258, 101173884, ...). By using this list, I'd like to add several information (gene name, gene symbol and description) to these lists. I know one of the way to extract these information is Biomart, but I would like to use NCBI database because EntrezID is NCBI ID. Can I use reutils or rentrez? I would be grateful if you could help me.

R gene • 1.1k views
ADD COMMENT
0
Entering edit mode

what about esummary -db gene -id YOUR_ID ? (The esummary is also commandline tool)

ADD REPLY
1
Entering edit mode

There is a typo in this comment. It should be -db gene.

A more complete solution would be something like this using Entrezdirect:

$ efetch -db gene -id "101173884" -format docsum | xtract -pattern DocumentSummary -element Name,Description
LOC101173884    dual specificity protein phosphatase 13
$ efetch -db gene -id "101171905" -format docsum | xtract -pattern DocumentSummary -element Name,Description
slc16a1 solute carrier family 16 member 1
ADD REPLY
0
Entering edit mode

Thank you for several suggestions and I successfully earn the annotation information by using this command.

ADD REPLY
0
Entering edit mode
5.4 years ago
BBioinf • 0

If you can handle large tables, then these information are available on the NCBI ftp site. (Based on your example you are interested in fish genes)
ftp://ftp.ncbi.nih.gov/gene/DATA/GENE_INFO/Non-mammalian_vertebrates/All_Non-mammalian_vertebrates.gene_info.gz

Also the NCBI information and the ENSEMBL information are the same in most of the cases, so sometimes it is easier to use ENSEMBL due to the simpler tools.

ADD COMMENT

Login before adding your answer.

Traffic: 1632 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6