Convert Gene Ensemble IDs to Gene Symbols on R
I tried to convert Ensemble Gene IDs to Gene Symbols by using biomart and annotations (org.Hs.eg.db) on R and biotools website but there were some genes did not convert to symbols.
Why some genes did not convert? and is there a better solution?
Thanks
If you simply google some of them you can see that these Ensembl IDs refer to transcripts of non-coding regions and not annotated regions.
So, they have no symbols, right?
So naturally, you are not going to see them associated with any "Gene Symbol" when doing the conversion from "Ensembl ID".
Novel genes used to be assigned temporary cryptic placeholder symbols like AC010680.1 or LINC02050 or C1orf43. They recently stopped doing that in favor of just using Ensembl IDs, since those symbols were not particularly helpful. There was a blog post somewhere about this, but I can't find it.
But how I can see their symbols in the unnormalized data? How their symbols have been obtained?
I do not know why you are mentioning normalization now since it has nothing to do with the ID of a transcript in your dataset. However, I guess by even simple intuition if a transcript comes from a coding region you would expect it to have also a "Gene Symbol". Otherwise, if the transcript comes from a non-coding region, you would not expect it to annotate to any "Gene Symbol". Transcripts from non-coding regions still have "Ensembl ID" (for example, see here how this is possible: https://www.ensembl.org/info/genome/genebuild/ncrna.html).
I tried my best to explain it in the most simple way, so I hope it is clear. But I would recommend you to do some more reading because these are pretty basic and straightforward concepts.