I have downloaded some SNP data sets published in 2012 (http://www.biologiaevolutiva.org/dcomas/north-african-affy-6-0-data-henn-et-al-submitted/, http://mega.bioanth.cam.ac.uk/data/Ethiopia/).
I am trying to merge the data with recent data, but it seems to me that the data is using an older Hg build (I am assuming Hg18) and the rs numbers don't match my relatively new, existing data as well as I would expect them.
For example, here is a site that corresponds in those two 2012 map files
1 rs7519837 1.06103 1500664
1 rs7519837 0 1500664
but when you search for it in dbSNP, the coordinates are different.
I was wondering, how can I individually update the coordinates and rsIDs of these map files? Thanks!
I have a question related to this, so I thought I might just stay on the same thread:
I have a VCF file of variants (what else?) from hg18-aligned sequences. I need to convert these variants to hg19.
My question is: do I need to be concerned about the difference in sequence between the two genome versions?
For example:
One of my variants is at chr10 8365, and the hg18 reference sequence is a T, but our bamfile found a C at that spot.
In this case, the hg19 also has a T at that spot, but could it possibly be different? And if so, are there any tools to account for sequence position differences?
Thanks in advance!
Wyatt
Detailed post on tools for converting coordinates between genome builds: Converting Genome Coordinates From One Genome Version To Another (Ucsc Liftover, Ncbi Remap, Ensembl Api)