In R, how to get the gene symbols of a GPL (like GPL570)? There is no gene symbols in the annotation for some GPL from GEO. I have lots of GPL from multi platforms. for these with no gene symbol, they have GB_ACC
, so more specifically, how to map GB_ACC
to gene symbol in R?
Tools like ailun can do this online or WSDL or Perl. But not in R or other language I master. Are there any other tools?
Please add some more details and context to your question.
Thank you. I've added some details.
Thank you. so the point is how do the authors of GEOmetadb (I noticed you are one of) map the GPL to
bioc_package
? When developers make the annotationbioc_package
, I believe they know the GPL.On the other hand, Maybe there are a limited bioc_package for the GPLs.
(sorry, I click the wrong button. I just want to reply @Sean)
We map them "by hand" by looking at the Bioc packages available and mapping those to GPLs. This is a curated list (and we are happy to get help), so it is not guaranteed to be complete.
In the CellMix source code, there is a data dir there is a file,
GPL2bioc.rda
. there are 55. Here it's wired. Why the authors of these annotation package (Bioconductor Package Maintainer) not use GPL ID or insert the variable into the annotation package?