I have a VCF where I'm applying filters. I want to filter by minDP and maxDP.
vcftools --vcf output.vcf --minDP 4 --maxDP 100 --recode --out output.filtered4
I've tried with version v0.1.13 and v0.1.15.
Parameters as interpreted:
--vcf output.vcf
--maxDP 100
--minDP 4
--out output.filtered4
--recode
After filtering, kept 56 out of 56 Individuals
Outputting VCF file...
After filtering, kept 626116 out of a possible 626116 Sites
Ok, maybe I have nothing to filter in that range. I tried also this:
vcftools --vcf output.vcf --minDP 10000 --maxDP 10000000 --recode --out output.filtered4
Parameters as interpreted:
--vcf output.vcf
--maxDP 1e+07
--minDP 1e+04
--out output.filtered4
--recode
After filtering, kept 56 out of 56 Individuals
Outputting VCF file...
After filtering, kept 626116 out of a possible 626116 Sites
But it's telling me that it kept everything! I don't understand.
How to filter with minDP and maxDP?
On their website, they are saying that all sites should have the "DP" FORMAT tag. I checked with this:
grep GT:PL:DP:SP:GQ output.vcf | wc -l
626116
This is the same number of sites that I have in my filtering. So I have everything, but it's not working. Am I the only one with this problem?
what's the diff ?
I don't really see where is the difference. The data is starting at line 84680. With diff, it's telling me that the files are different, but when I look into the files, they are not really different (with the flag -y), I can compare in two different columns)
The odd thing is that when I use vcftools with min-meanDP and max-meanDP it's working:
Basically, there is no difference... why it's not working in that case?