Minimap2 giving 2 TB sam file as an output
0
0
Entering edit mode
5.0 years ago

Hello,

I have been trying to run Minimap2 with fasta files of ~200Mb. I have 3 such files which I am trying to map onto a reference genome. The output sam file of first 2 files are about 500MB. But the third file gives a 2TB sam file and it has not completed running. Is there any reason of why this could be happening ? Any help would be appreciated.

The command I used is as follows:

minimap2 -a reference.mmi query.fasta -o output.sam -t 20
sequence minimap2 alignment • 5.1k views
ADD COMMENT
1
Entering edit mode

You could consider avoiding creating a sam intermediate:

minimap2 -a -t 20 reference.mmi query.fasta | samtools sort -o output.bam
ADD REPLY
0
Entering edit mode

Thanks for your answer. But I am still confused as the other 2 files of the same size gave the output of around 500Mb and this gives 2TB.

ADD REPLY
1
Entering edit mode

Perhaps you have a lot of multi-mapping or secondary alignments?

ADD REPLY
0
Entering edit mode

Thanks for your reply. Is there anyway with which I can restrict the mapping to only the top hit or something like that ? Are there any parameters in Minimap to do so ?

ADD REPLY
1
Entering edit mode

You can set -N 0 to not retain secondary alignments.

ADD REPLY
0
Entering edit mode

Why do you tag this as "assembly" while you are aligning? Why do you think this is a "software error"? Wouldn't "minimap2" be the best tag here?

ADD REPLY
0
Entering edit mode

Noted. Thank you for mentioning this

ADD REPLY

Login before adding your answer.

Traffic: 1908 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6