The number of variations in the pan-genome is reduced compared to the variations in the input VCF file
1
0
Entering edit mode
13 months ago
Wenke • 0

Do vg filter out some variants during the construction of the pan-genome, and if so, what are the criteria for filtering? The number of variations in the pan-genome is reduced compared to the variations in the input VCF file. The command I used is "vg autoindex --workflow giraffe -v sv.vcf -r ref.genome -p sv -t 64" The number of variants decreased by almost half after undergoing vg deconstruct compared to the number of variants in the VCF used to construct the pan-genome graph. What could be the reason for this?

vg • 714 views
ADD COMMENT
0
Entering edit mode
13 months ago

If variants overlap in the genome, vg deconstruct will combine them into one locus with multiple alleles. If your input VCF has a lot of deletions, that's probably what the cause is.

ADD COMMENT

Login before adding your answer.

Traffic: 2049 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6