GATK recalibration, duplicates & realignment
1
0
Entering edit mode
9.5 years ago
alons ▴ 270

Hi all, I'm working on a variant calling pipeline based on the following link:

http://www.htslib.org/workflow/

Now, in the "Improvement" section, which mainly uses GATK, it says that I should realign the bam file and then recalibrate and mark the duplicates. What is unclear to me is which bam file is used as input for each step. For example, is the realigned bam file the input of the recalibration step?
I should note that I'm using a single bam file, 1 library.

Thank you, Alon

alignment gatk realignment bam ngs • 3.8k views
ADD COMMENT
2
Entering edit mode
9.5 years ago

You are correct in your reading. The steps described are run serially with the output BAM being the input BAM to the next step.

ADD COMMENT
0
Entering edit mode

Thank you!

A follow up question, though: in the same section they recommend another realignment, after the improvement steps (initial realignment, recalibration and marking of duplicates).

Is it really necessary if I have only one bam (no merging of several bam files)?

ADD REPLY

Login before adding your answer.

Traffic: 2595 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6