Entering edit mode
6.8 years ago
Makplus T
▴
100
Hello, as a newbie, I do get some troubles in converting ensembl ID to gene SYMBOL.
For example, when I convert the "ENSG00000158164" to SYMBOL in the R library "org.Hs.eg.db", it has 2 symbol names. This is my code and output:
> select(org.Hs.eg.db, keys="ENSG00000158164",columns=c("CHR","SYMBOL","GENENAME"), keytype="ENSEMBL")
'select()' returned 1:many mapping between keys and columns
ENSEMBL CHR SYMBOL GENENAME
1 ENSG00000158164 X TMSB15A thymosin beta 15a
2 ENSG00000158164 X TMSB15B thymosin beta 15B
But actually the ensembl ID of TMSB15B is ENSG00000158427;
> select(org.Hs.eg.db, keys="ENSG00000158427",columns=c("CHR","SYMBOL","GENENAME"), keytype="ENSEMBL")
'select()' returned 1:1 mapping between keys and columns
ENSEMBL. CHR SYMBOL GENENAME
1 ENSG00000158427 X TMSB15B thymosin beta 15B
Could someone suggest me how to deal with it?
This looks like the results of a bug in the script that was used to make
org.Hs.eg.db
, can you report it to the package maintainer?I will update this post as soon as there is new progress.
Any update on this bug?
Hello These are two separate genes in Ensembl version 99: TMSB15A: https://www.ensembl.org/Multi/Search/Results?q=ENSG00000158164 TMSB15B: https://www.ensembl.org/Homo_sapiens/Gene/Summary?db=core;g=ENSG00000158427 I suggest to check which version this R library uses. Astrid Ensembl helpdesk