Entering edit mode
2.3 years ago
Timotheus
▴
40
Hello,
I called variants using bcftools to obtain a two-sample vcf from which I excluded everything except biallelic SNPs using bcftools view -m2 -M2 -v snps
. Then, using gatk VariantFiltration
and SelectVariants
I obtained subset VCFs with different SNP classes. Surprisingly, the number of SNPs in different classes does not add up to the number of SNPs in my master biallelic VCF, and is lower by a few dozens. Assuming I considered all the possible SNP classes (like het in both samples, homo ref in one het variant in the other etc.), why could those numbers differ?
extract the the SNP from file 1:
extract the the SNP from the other files:
show us the difference(s):
Thanks! Hmm this did not work for me for some reason, will try to invesigate why