I'm a little confused on the format of phased variants in a VCF file. I was under the assumption that A|B
is equal to Paternal Haplotype | Maternal Haplotype
but I believe it's Haplotype Block A | Haplotype Block B
.
So in the case of
KID DAD MOM
0|1 0|1 0|1
How does one determine the parent of origin for the derived allele?
Lame. I was afraid this was the case. The phasing leveraged nanopore long reads, so the blocks are really long. You would think that phasing software that uses inheritance information would encode the parent of origin. Thanks for the insight. To add, if anyone knows of a software that can assign parent of origin to ambiguous cases like this, please comment.
That's new information, you didn't specify this before.
It should be feasible to identify the parent for each block if those are sufficiently big. You just have to find variant(s) specific to one parent to assign the entire block.