gene IDs not recognized by DAVID
1
0
Entering edit mode
10.0 years ago
catherine ▴ 250

I have a gene list generated by RefSeq data downloaded from UCSC genome browser, and they have IDs, beginning with NM or NP. (for example, NM_001032214) They are transcript_IDs.

And I'm gonna run GO term analysis by DAVID, by selecting identifier as RefSeq-mRNA, only 560/980 of my gene IDs are recognized. I don't understand why is this happened? What should I do to include all of my genes?

Thank you

gene GO-term • 4.7k views
ADD COMMENT
0
Entering edit mode

In addition to Denises answer, a lot of the data might be outdated. You probably have modern gene names, while Davids are 5 years old. Hence, no mapping exists.

ADD REPLY
0
Entering edit mode
10.0 years ago
Denise CS ★ 5.2k

There might not be anything wrong going here. Without seeing some of your examples, I suspect that many of your NMs (or NPs) actually correspond to the same gene entity. The DMD gene in human for example has 30 spliced isoforms, and lots of NMs and NPs cross referenced to at least 8 of those isoforms. See the DMD example in Ensembl. You may also want to confirm in BioMart what DAVID is telling you. BioMart allows you to convert the NMs and NPs IDs into gene IDs (Ensembl IDs, HGNC, Entrez Gene IDs, etc) and get the GO terms for each of them.

ADD COMMENT

Login before adding your answer.

Traffic: 2512 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6