Say, allele A and B define haplotype 1. allele A, B and C define haplotype 2.
If we identify A, B and C at the same time, two situation may occur: 1) they all could exist in the same chromosome. 2) A and B in one chromosome and C in another chromosome.
This opens ambiguity between genotype 1* and 2. We can't just assume A, B and C are most likely in the same chromosome (statistically linked) because that way genotype 1 won't exist, right?
Any comments are appreciated!
I think that the OP is referring to 'phasing'.
OP: you meant that they cannot exist on the same maternal or paternal chromosome right?
If you identify A, B, and C using standard NGS technology, you will find it very very difficult to determine whether they have occurred on the same maternal or paternal chromosome. Phasing information will not exist in the VCF.
You could yes determine phasing information via the latest 10x technology by using special adaptors that allow you to determine phasing (I recently did a whole genome experiment this way). Some genotype arrays may also allow it.
OP: you are correct in your assumption, though, in that we cannot know whether they have occurred on the same maternal or paternal chromosome, UNLESS they happen to all be homozygotic calls.
Now I may understand what the post means...
Do you mean "allele A and B define haplotype 1. allele A, B and C define haplotype 2"?
Yes, sorry for the confusion
See also Do multiple SNPs exist in same chromosome to be called allele?