I am having trouble resolving genomic positions from COSMIC, as some mutations simply do not seem to match the reference nucleotides. For example, I am looking at this mutation:
http://cancer.sanger.ac.uk/cosmic/mutation/overview?id=1272062
It says that G mutated to A, but on that position in hg38 the base is A. Reverse complementing doesn't solve the problem (I know that COSMIC has strand specific entries).
Also, looking at the COSMIC browser representation, we see that the mutation starts one position earlier:
All in all, I am thoroughly confused. The mutation is discovered in previous reference builds, so it persists, and there seems to be more of those. Anyone have an idea what is going on?
Thanks in advance!
It seems you discovered a bug :)
I checked on hg19 and it's looks like a G on this position but your gene doesn't exist on hg19 ... what a mess ..
I have a similar issue. There are many COMIC mutations in my data which cannot be mapped to the reference. One example as below
COSM1000001 . It says G>A mutation at GRCh38, 19:51417106..51417106 position. But on this position the actual base is C. Please check this. On the same position, a dbsnp mutation occurs rs747638044 with C/A/T. The ensembl database also shows C as the ancestral base on this position. link.
Am I missing something here? I need to analyse the wild type and the mutated flanking sequences of these mutations. Please let me know how can I go forward in such cases.
Apparently, this particular mutation is on the negative strand. That solves the problem.