Question

BQSR on one sample vs. all samples in a run

0

Entering edit mode

7.6 years ago

ari.nazarian ▴ 10

I'm trying to run BQSR on a sample from a miSeq run, but received an error saying that I need to add read groups. Since this sample was one of 40-something samples multiplexed during a run, should I merge all the samples on one BAM before running BQSR, so that BQSR is more accurate/effective? I read this on the GATK article on read groups, which prompted my question:

Use for BQSR: ID is the lowest denominator that differentiates factors contributing to technical batch effects: therefore, a read group is effectively treated as a separate run of the instrument in data processing steps such as base quality score recalibration, since they are assumed to share the same error model.

BSQR miseq • 2.1k views

ADD COMMENT • link updated 7.6 years ago by cpad0112 21k • written 7.6 years ago by ari.nazarian ▴ 10

score 1 · Answer 1 · 2017-04-14

1

Entering edit mode

7.6 years ago

cpad0112 21k

To my understanding, it simply needs a RG field appended to your sample. But you have a different question though: Does BQSR vary between single sample in one RG vs multiple samples in one RG.

ADD COMMENT • link 7.6 years ago by cpad0112 21k