What's the preferred method for running multiple sequence alignment on such a large amount of protein sequences? I'm trying something fairly experimental and running MSA would be really helpful in the approach.
I usually use muscle and noticed there is a super5 module that helps with this: https://drive5.com/muscle5/manual/cmd_super5.html
How can I adjust the parameters to help out with running out of memory? Alternatively, is there another tool that's better suited for this? Basically, I want a fasta MSA for the output.
Hi, take a look here : MAFFT
I should have mentioned that some of the sequences are long. There are a few that are ~70k. I've trimmed them out and it's working now but I'll keep MAFFT in the back of my in case this fails.