Empty gene names from ensembl query using biomart
1
3
Entering edit mode
10.0 years ago
kindlychung ▴ 60

Here is the code:

require(biomaRt)
ensembl = useMart("ensembl", ,dataset="hsapiens_gene_ensembl")
# filters = listFilters(ensembl)
# filters[1:5, ]
# attributes = listAttributes(ensembl)
# ensembl_attr = cbind(attributes$name, attributes$description)
# head(ensembl_attr, 15)
getBM(c("hgnc_symbol", "start_position", "end_position"), filters=c("chromosome_name", "start", "end"), values=list(16, 1100000, 1250000), mart=ensembl)

Result:

   hgnc_symbol start_position end_position
1                     1103280      1105461
2                     1111627      1113399
3        TPSB2        1227272      1230184
4                     1206560      1207124
5                     1223639      1224143
6                     1156976      1157974
7                     1159548      1160176
8      CACNA1H        1153241      1221771
9        TPSG1        1221651      1225257
10      TPSAB1        1240696      1242554
11                    1148224      1148754

As you can see, some gene names are missing, why is that?

gene • 3.0k views
ADD COMMENT
3
Entering edit mode
10.0 years ago
Denise CS ★ 5.2k

It's because those genes have not got HGNC names. They rather have a clone based name. You may want to include Clone based VEGA gene name and/or Clone based VEGA transcript name in your Attributes.

ADD COMMENT

Login before adding your answer.

Traffic: 2374 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6