I presume most simple way is to write a perl script but thought I would check not missing a filter option rather than go down this route and future use. I've got GATK SNP and INDEL output. Filtered Indels to another vcf but want to filter only Indel insertions of 20 or more to another vcf. Is there an option for this in vcftools or other GATK post filtering software i.e. snpsift? I can't seem to see it.
getIndelLengths().0
returns the first element of the list isn't it ? what happens if there is more than one ALT allele ?I would be curious about this too if someone has the answer
It works, thanks but I want a min length of 20 not max (could use --maxIndelSize for this too). I'll look at the doc and post what I use for 20 and above indels filtering. changing the "<" around doesn't produce any results even when I go down to 5 even though I know there are indels of 5+ and above.
Got it for insertions only that are larger than 19 and less than 1000 as this: -select 'vc.getIndelLengths().0 > 19 && vc.getIndelLengths().0 < 1000'