Entering edit mode
5.6 years ago
fhsantanna
▴
620
I have 11,000 protein alignments and I have to blast each one to 20,000 bacterial genomes using psiblast. How could I speed up this process? Is it possible using more than one computer for it using gnu parallel? Thanks.
https://www.gnu.org/software/parallel/parallel_tutorial.html#Remote-execution
For this purpose, how do I share the database? Does the host do it automatically?
I have no idea, and I've never done anything like this.
check the --transfer option in the link above.
What kind of compute resources do you have access to?
At the time of posting the question, I had three Desktop computers having 20 threads in total (4, 8, and 8 threads). Fortunately, after that, we bought another pc with SSD HD and 16 threads, and I managed to do all blasts using parallel in one week, using only this new computer.