I understand that GATK's Indel realigner tool helps in finding the right snps. But, does one need to use it when finding only indels?
I understand that GATK's Indel realigner tool helps in finding the right snps. But, does one need to use it when finding only indels?
GATK's HaplotypeCaller is both capable of detecting SNVs and InDels using a method that performs local de novo assembly (kind of a local realignment) to call variants, although it doesn't output any realigned bam. so, in summary, there's no need to use IndelRealigner if you are going to call variants through HaplotypeCaller.
that being said, note that GATK4 has removed IndelRealigner from its guts as it is not needed anymore... if you are going to use GATK's pipeline. as Devon says, using IndelRealigner does still make sense if you want to use any other variant caller (including GATK's UnifiedGenotyper) for whatever reason (GATK4's HaplotypeCaller definitely produces higher confidence calls than samtools+bcftools).
You don't even need to use it when finding SNPs, so there's no reason to use it for finding InDels. It's mostly still around for those using the unified genotyper rather than the haplotype caller.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Hello,
I often read that people say the HaplotypeCaller is doing local realignment. But that's not true. It's doing local de-novo assembly.
From the manual:
fin swimmer
I question whether it's really de novo. I presume they're putting the reference sequence into their de Bruijn graph too (at least that's what I've done when implementing this sort of thing).
It's not 'true' de novo assembly as the reads being assembled have already been mapped to a particular region of the genome. I've done a similar thing for SV calling and I did it without incorporating the reference sequence into the de Bruijn Graph. Whether you do or not, your assembly is already biased towards the reference allele due to the mapping step.
I must agree with you both. I've updated my answer to be more precise on what GATK states and how I personally have always considered it. thank you for the clarification.