gatk SelectVariants odd behavior
1
0
Entering edit mode
5.6 years ago
jon.klonowski ▴ 210

gatk SelectVariants is not working properly for me...

I have a file of SNP ids (i.e chr1:861219:G:C) delimited with /n and a vcf where the snp ID is also formatted chr:pos:ref:alt (chr1:861219:G:C).

The list file of SNPs were generated by subsetting a portion of the vcf SNPs (based on 1000G frequency) when I try:

gatk SelectVariants --variant File_name.vcf.gz -O Output_file_name.vcf.gz --keep-ids SNPS_of_interest.txt

I get a file of just the headers.

and then when I try:

 gatk SelectVariants --variant File_name.vcf.gz -O Output_file_name.vcf.gz --exclude-ids SNPS_of_interest.txt

The output is the same as the input excluding a zless | wc -l of ~10

gatk selectvariants vcf filtering SNP • 2.6k views
ADD COMMENT
0
Entering edit mode

You may want to post on the GATK Support Forum.

ADD REPLY
0
Entering edit mode
5.0 years ago
cricgpu • 0

Although not explicitly stated in the gatk SelectVariants docs, make sure your filtering files have the suffix .list. In you example change SNPS_of_interest.txt to SNPS_of_interest.list. This also hold true for other filters eg. --keepIDs.

Hope this helps!

ADD COMMENT

Login before adding your answer.

Traffic: 2033 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6