Hello All,
I have a variant detected from both tools VARSCAN and GATK but when I look for it using IGV I couldn't find it.
Example: I have a variant at ch1:20006 T>G in a gene supported with a frequency of 10 % from both GATK and VARSCAN, but when I view it on the same sample in IGV I can not find the the reads supporting the variant G. Would be great if someone could share there thoughts on this.. Or an alternate way to look into this? This happened with a few genes..
Both fromIGV and BAm file I could see the reads of interest but not the variant in IGV alone..
Can you rephrase the sentence ? You mean to say that you can see the reads from the sample on IGV, but the variants (reported from the tools) are missing on that location. Is that it?
If that is the case, yes GATK does realignment and you should look for the realigned reads
http://gatkforums.broadinstitute.org/discussion/1235/i-expect-to-see-a-variant-at-a-specific-site-but-its-not-getting-called
https://www.broadinstitute.org/gatk/guide/article?id=5484
I also had the same problem long back but I did not want the realignment and needed simple haplotype caller to capture indels. Since I did not have time to look into the solutions for GATK at that point of time, I used simple haplotype caller callled Freebayes (Not alignment-based variant caller). It worked perfectly well for my PARCLIP (kind of) data and captured T -> C transitions very well (Figure 1C in the below article). I have also tested this tool for different datasets (not yet published) and it works well for SNPs.
Just for your reference:
This pipeline as been implicated in our recent article in Nature communications:
Since you have tried with two different tools, you can also try this tool and it is simple to use. https://github.com/ekg/freebayes
Thank you and yes I can use it..My requirement is only to get variants from a targeted regions of genome and so I did it with two tools inorder to have better sensitivity..
Can you rephrase the sentence ? You mean to say that you can see the reads from the sample on IGV, but the variants (reported from the tools) are missing on that location. Is that it?
--its Yes and No; As I could get a variant at a specific position from both variant callers (GATK and VARSCAN), I just wanted to view it in IGV for the same sample on the same locus for that captured varinat.. Where its showing the reads supporting that position but no, variant as reported by tools..But as you mentioned base reclaibration is a step I also followed from the GATK pipeline.. But the Recalibration is not applicable for VARSCAN ..
Ok then try this tool and let us know if you still could not solve it.