Entering edit mode
16 months ago
curious
▴
820
I have:
"TranscriptRefSeq" = NM_000110
"DNAChange" = c.1432G>T
iiuc the HGVS is the union of these:
NM_000110:c.1432G>T
I want to find the the VCF description of this variant (eg GRCh38:1:97549652:C:A).
I plug NM_000110:c.1432G>T
into variant validator and get:
"RefSeq variant accession numbers MUST include a version number"
If I add a version number it works:
NM_000110.4:c.1432G>T
My question is, is it not even possible to map HGVS without the version number with variant validator or otherwise? My data does not provide this version number :/
You can lookup the version history of that accession here: https://www.ncbi.nlm.nih.gov/nuccore/NM_000110?report=girevhist
Since the changes in v.3 and v.4 are substantial it may be tricky to interpret the changes without version numbers.
If you simply search with NM_000110 you will always get the newest revision.
Thanks, it goes beyond that as I have ~700 of these transcript accessions, none of which have versions. I would guess in the general case you need the version, please let me know if you think otherwise
If you need to lookup the version numbers of accessions, you could do it using EntrezDirect:
If you have 700 then provide a couple more examples and I can show you how to look those up using
epost
method.Looks like you need version numbers for the Variant validator site.
Below is a sample of what I have, I am a bit confused though you mentioned that the changes for
NM_000110.4
andNM_000110.3
are substantial, how do I know then for example which version the author of the below list was referring to when they wroteNM_000110
(or are you suggesting I just guess it is the most recent?).I am concerned that I will run into cases where where different cdna changes correspond to different VCF description changes (
eg GRCh38:1:97549652:C:A
) depending on which transcript accession version they are associated with. Or is this not an issue?If this analysis was done in the last year or so, you can (at some risk) assume that the latest version of the accession is being referenced (especially if you have no way to ask whoever generated this data). We can't really tell you one way or the other. If this is related to patient care you will want to triple check everything.
Here is how you can get the latest version numbers for accessions.
You need to ask whoever give your the data which gene model version they are using. If that's not available, the best you can do is to assume all 700 are from the same version of gene model, and then look up for all recent gene model versions to see which one has all 700 in it. If you find multiple gene model versions, you are out of luck.