Entering edit mode
5.6 years ago
waqaskhokhar999
▴
160
I have > 1000 samples and I want to filter out variants based on minor allele frequency, My input dataset is a vcf file in this format:
CHROM POS ID REF ALT QUAL FILTER INFO FORMAT 88 108 139 159 265 350
1 55 . C T 40 PASS DP=6720;EFF=intergenic_region(MODIFIER||||||||||1) GT:GQ:DP ./.:.:. 0|0:36:4 0|0:32:9 0|0:30:4 ./.:.:. ./.:.:.
1 56 . T A 40 PASS DP=6785;EFF=intergenic_region(MODIFIER||||||||||1) GT:GQ:DP ./.:.:. ./.:.:. 0|0:32:9 0|0:30:4 ./.:.:. ./.:.:.
1 63 . T C 40 PASS DP=7053;EFF=intergenic_region(MODIFIER||||||||||1) GT:GQ:DP ./.:.:. 0|0:40:5 0|0:32:9 0|0:38:5 ./.:.:. ./.:.:.
1 73 . C A 40 PASS DP=8169;EFF=intergenic_region(MODIFIER||||||||||1) GT:GQ:DP ./.:.:. 0|0:40:5 0|0:40:9 0|0:38:6 ./.:.:. ./.:.:.
How can I keep snps with minor allele frequecny >= 0.05
see Updating allele frequency (AF) and minor allele frequency (MAF) INFO fields in .vcf
I am trying to compile vcffilterjdk but getting this error:
FAILURE: Build failed with an exception.
What went wrong: Execution failed for vcffilterjdk com.github.lindenb.jvarkit.tools.vcffilterjs.VcfFilterJdk.
Try: Run with --stacktrace option to get the stack trace. Run with --info or --debug option to get more log output. Run with --scan to get full insights.
Get more help at https://help.gradle.org
BUILD FAILED in 0s 1 actionable task: 1 executed
as you can see "http://central.maven.org/maven2/com/github/samtools/htsjdk/2.19.0/htsjdk-2.19.0.jar" exists.
are you running behind a proxy ? try to fix it: https://stackoverflow.com/questions/5991194
I have fixed the proxy settings and VcfFilterJdk took around 3 hours to complete the process, but it just updated the header, it didn't updated the INFO field, do I need to update the AN/AC fields first and then have to apply this script, as you can see from INFO (DP=6720;EFF=intergenic_region(MODIFIER||||||||||1)) filed that i don't have AN/AC tags in it? The currnet output of VcfFilterJdk is: