Explanation of dimension differences between bam files (BWA aln and BWA mem)
0
0
Entering edit mode
2.8 years ago
qstefano ▴ 20

Hello everyone,

I want to compare the output .bam files generated by two different algorithms: BWA aln and mem, in order to replace the old code (aln) with the new one (mem)

For aln alignment I ran the following command:

bwa aln -o 1 -n 0.01 -l 200 -e 12 -d 12 -t 30 'ref.fa' 'pair_1.fastq' 'pair_2.fastq.gz' | samtools sort -@30 -o out.aln.bam 

For mem:

bwa mem -t 30 -M 'ref.fa' 'pair_1.fastq'  'pair_2.fastq' | samtools sort -@30 -o out.mem.bam -

The dimension of out.aln.bam resulted in 24689 MB. The dimension of out.mem.bam resulted in 31069 MB.

Considering that the input fastq files are the same, how can I explain this huge difference in dimensions? Is it due to options in aln?

Thanks in advice

R RNA-Seq bwa gene alignment • 743 views
ADD COMMENT
0
Entering edit mode

Rather than comparing the file sizes, you should compare the alignment statistics.

ADD REPLY

Login before adding your answer.

Traffic: 2334 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6