Entering edit mode
11.4 years ago
lillo.sim
▴
50
Hi,
I was wondering if there was an easy was to annotate many ENSEMBL GENE IDs to find the function of each gene and involvement with disease if it has been found (from GWAS or lab studies or observation studies etc..). The problem is that I have many gene ids so I can't look up each one of them individually. Does anybody know if there is a package in R or a software that allows batch query for this information?
thanks
You might look into the NCBI2R package in CRAN. You should be able to use it to get GO terms, GWAS associations, and OMIM information. I should note that you'll need Entrez IDs rather than Ensembl IDs, but that's a trivial conversion.
This seems like a nice package, I'll try it out now! thank you
You could use the R interface to biomart to do Ensembl lookups, GO terms, etc. (see How to get Ensembl ID (gene, transcript, protein) mapping information? for links)
Thank you Alex, but biomaRt doesn't seem to contain enough clinical information, unless I am using it incorrectly?
and it is empty…
And I have tried any of the following attributes, but it is not really what I am looking for:
It tells me that it is "nuclear paraspeckle assembly transcript 1 (non-protein coding)", and the pathology is empty. I am looking for information of any clinical associations, for example from NHGRI GWAS catalogue or from laboratory findings… Is there a field like "pathology" maybe that is a bit more populated more? Maybe using GO, but how to get the description for these instead of the "go_id" only? Thank you
You have a typo in that first example (the Gene ID) perhaps that's why you don't get results?