Odd alignment question/finding

0

Entering edit mode

11 months ago

poordumbsillyidiot • 0

Hi, I am working this pipeline for grad school, and I noticed a thing I can't really explain.

I simulated two fastq samples, ran them through an alignment pipeline (fastp to bwa mem), and got alignment stats from qualimap (pictures below) Then, I combined the two samples into one sample using cat, and ran it through the same pipeline. When I did that, none of the alignment stats added up. For example, there were less unmapped reads when combining the two samples than when you ran the two samples on their own. i have no idea why.

Here are the qualimap outputs for the three runs. I can give any other details if needed. Thanks!

qualimap of first sample ran on its own

qualimap of second sample ran on its own

qualimap of both samples catted together

Alignment • 671 views

ADD COMMENT • link updated 11 months ago by barslmn ★ 2.4k • written 11 months ago by poordumbsillyidiot • 0

1

Entering edit mode

bwa mem is not fully deterministic, see from the developer: BWA mem output inconsistent on same but re-ordered FASTQ input

ADD REPLY • link 11 months ago by ATpoint 87k

0

Entering edit mode

I can't come up with an explanation, but two things you can try for debugging are: 1) look at the fastp stats - do they add up and make sense? 2) scan the bam files for reads that changed their status between runs. Also, please include the fastp and bwa.commands you used.