ensembl transcript id to gene symbol using biomart
1
0
Entering edit mode
7.3 years ago
hxlei613 ▴ 100

Hi. I am using biomart to convert ensembl transcript id to gene symbol. However there seems to be something wrong. Because in the output there is an ensembl transcript id which has two symbol. Here are my script.

> ensembl <- useMart(host="dec2013.archive.ensembl.org", biomart = "ENSEMBL_MART_ENSEMBL",dataset = "hsapiens_gene_ensembl")
> result <- getBM(attributes=c("ensembl_transcript_id", "hgnc_symbol"), filters = "ensembl_transcript_id", values = values, mart= ensembl)

The output is :

ensembl_transcript_id   hgnc_symbol

 ENST00000226218         SEBOX

ENST00000226218           VTN

In the script I use Ensembl 74: Dec 2013. Then I try Ensembl 75.The output is the same for Ensembl 75. However when I use the archive Ensembl 74 http://dec2013.archive.ensembl.org/index.html to search this transcript id myself, it only shows symbol VTN.

Thanks for any comments.

R biomart EnsEMBL • 11k views
ADD COMMENT
0
Entering edit mode

take a look at the id history on the website

ADD REPLY
1
Entering edit mode
7.3 years ago
Emily 24k

Both have been associated as external references in 74: http://dec2013.archive.ensembl.org/Homo_sapiens/Gene/Matches?db=core;g=ENSG00000109072;r=17:26691290-26700110

Pick the gene name rather than the HGNC symbol as attributes to get the gene name that was actually selected for the gene, rather than all that match.

Better yet, don't use four year old databases, use the up-to-date one which only has one HGNC external reference: http://www.ensembl.org/Homo_sapiens/Gene/Matches?db=core;g=ENSG00000109072;r=17:26691290-26700110

ADD COMMENT

Login before adding your answer.

Traffic: 1959 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6