How does assigning RSID (dbsnp IDs) work when there are multiple variants at the same line? For example, if the genotype is 1/2, and of of these variants have a distinct RSID, would they both be assigned, separated by comma? And what about multi-sample VCFs, is the situation the same (e.g. 0/1 and 0/2)? Are there any limitations or recommendations on this?
Thanks for your reply, I didn't expect that but I see examples of it when I look through dbSNP. Does that mean a variant in dbSNP can be uniquely identified by {chr:pos_ref}? What about indels? Are they all bundled together as long as the position is the same or is this more complex?
Hello again,
please use the
ADD REPLY
button below a post you like to comment to. I moved your post to a comment now, but as you see this is not perfect.Unfortunately rsIDs are not that unique as one would think. What you can take for sure is that an ID points to
CHROM
andPOS
. But you don't know what changes are behind such a number. AFAIK an rsID is always of the same type (SNV or INDEL). In case of INDEL I guess their can be of different length and still have the same rsID. In case of SNV the ALT can also have multiple values and the rsID is still the same.Especially for INDELs it is possible that you have multiple IDs for the same variant, because this kind of variant can be described often in many ways.
fin swimmer