Question

Base Quality Score Recalibration with different reference genome

0

Entering edit mode

7 months ago

dtnondorf • 0

I am working with nonmodel systems, so I have followed GATK's workflow for using high confidence SNPs discovered in the first round of SNP calling as the SNP panel to aid base quality score recalibration. This is followed by a second round of SNP calling using the recalibrated bam files. However, I am now analyzing SNPs in a sister species to the focal species that has the annotated reference genome. I know that BQSR looks for mismatches with the reference and adjusts quality scores while accounting for the SNP panel I provide. My main questions are the following:

What kind of biases could be introduced during BQSR when the reference genome is from a sister taxa to the samples?

and

Should I be more lenient when building my high confidence SNP set since I should expect more mismatches between the samples and the reference genome during BQSR?

Let me know what you think! Happy to elaborate if I've left important information out. Thanks for the help!

SNPs GATK BQSR RNA-seq • 446 views

ADD COMMENT • link 7 months ago by dtnondorf • 0

0

Entering edit mode

See answer here: Base recalibration in normal vs. tumor somatic variant calling in WXS data?

Heng Li said:

IMO, if your data is not more than 6-8 years old, BQSR is not necessary most of time. Some combination of callers and BQSR may even lead to unexplained results.
— Heng Li (@lh3lh3) June 30, 2021