Entering edit mode
3.7 years ago
alicecol
▴
20
Hi, I am getting a segmentation error while using muscle to align a fasta file containing 427 sequences. I have a another fasta that is also 427 sequences for a different gene that muscle works on without problem although the average sequence length in that file is a bit shorter.
Here is the typical input I use:
muscle -in bt_18s.fasta -out bt_18s.fasta.aln
And here's the output I get from running that line of code:
MUSCLE v3.8.1551 by Robert C. Edgar
http://www.drive5.com/muscle
This software is donated to the public domain.
Please cite: Edgar, R.C. Nucleic Acids Res 32(5), 1792-97.
bt_18s 427 seqs, lengths min 100, max 10877, avg 1744
00:00:02 28 MB(15%) Iter 1 100.00% K-mer dist pass 1
00:00:02 28 MB(15%) Iter 1 100.00% K-mer dist pass 2
00:01:42 1502 MB(100%) Iter 1 100.00% Align node
00:01:42 1502 MB(100%) Iter 1 100.00% Root alignment
00:03:27 1502 MB(100%) Iter 2 100.00% Refine tree
00:03:27 1502 MB(100%) Iter 2 100.00% Root alignment
00:03:27 1502 MB(100%) Iter 2 100.00% Root alignment
Segmentation fault100%) Iter 3 0.59% Refine biparts
Can anyone point in me in the right direction as to how to address this issue?
How much memory do you have? What is the size of the input dataset? Seg fault likely points to running out of memory.
The file is 761 kb and I've been running it interactively with 32GB of allocated memory. I suspect you're right so I'm going to try it again with 96GB.