Hello! I have a list of about 6200 genes in symbol nomenclature (e.g: TP53) which results from having done a DE analysis on the LIHC-TCGA data.
I needed to convert it to Entrez in order to continue my workflow so I used the org.hs.eg.db
package, but there were about 800 genes that couldn't be converted. When I took a closer glance and googled some of these genes, I saw that the reason was that they were annotated with an old name (e.g: MCUB was annotated as CCDC109B and LAMTOR1 as C11orf59). This is fairly easy to find because the NCBI gives the official symbol and then below an "Also known as" title with other non-official and/or former names.
Now, I would like to convert these genes to the official name and ultimately to Entrez since they represent about 13% of my DE genes and I think it's a shame to just ignore them, but obviously doing it manually would take me forever.
Is there any tool or resource which recognizes these unofficial former names and can convert them to the official symbol or to Entrez or to any other official nomenclature?
You could try HGNC's Multi-Symbol checker