Entering edit mode
5.2 years ago
Vladislav
▴
20
Hi, Biostars community.
At first, sorry for my English.
I have some set of mRNA transcripts ids e.g.: 'NM_007300.4', 'NM_007297.4', 'NM_007294.3' ... Well, I can to find which of them is canonical by using knownCanonical.txt and kgXref.txt from ucsc. 'NM_007300' in this case.
But I also have a set of their proteins ids, e.g.: 'NP_009231.2', 'NP_009228.2', 'NP_009225.1' ...
So, can you, please, tell me, how to find which of them depends to canonical mRNA transcript?
Thanks.
ID's you have above are basically cross-references to each other. Using EntrezDirect you can verify that:
If you want to convert Ensembl identifiers from
knownCanonical.txt
, you could use their REST API (random example from Canonical file) or BioMart.