Hi All,
I am working on public genotype data in opensnp. I found that different platform (e.g., 23andme, decodeme etc) uses different genome versions (e.g., GRCh37 or GRCh36) and therefore, I have found different chromosomal position for the same reference SNP (rs#) id. For few SNP ids, I checked in dbSNP that the genotype remains the same even though the genomic position is shifted from GRCh37 to GRCh38.
My question is that if I want to infer biology (SNP association to a phynotype) from those SNPs obtained form different platforms, do I need to convert genomic position from different genome versions to one version? Or, can I use rs# to call a particular genotype independent of genome version?
Thanks in advance!
Thanks @Brain for the comment. I also found relevant information in NCBI documentation here that "Depending on the organism, the net result of the alterations in assemblies and annotations mentioned above is that a SNP may be in an exon (or more generally, located in a gene) in one build, but then locate to an intron or UTR (or intergenic DNA) in the next build if the exon (gene) is removed between build releases. Once an organismÕs genome assembly comes closer to its finishing point, like that of the human genome, its annotation (including its SNP functional classification) will become more stable."
You may want to move this to an answer (from a comment) and then accept (green check mark) to provide closure to this question.