Dear Community Members,
I am currently running AlphaFold jobs on a high-performance computing (HPC) cluster to predict hypothetical protein structures. For sequences ranging from 100 to 500 residues, I have successfully obtained five unrelaxed models per protein within a 24-hour job using 4 GPUs, 16 CPUs per task, and 200 GB of memory.
However, I am encountering issues with longer sequences, specifically those between 1000 to 2000 residues. Despite increasing the memory allocation to 500 GB, I have not been able to produce any models for these longer sequences.
Could anyone provide insights or suggestions on how to successfully run AlphaFold for proteins with longer sequences? Are there additional parameters or resources that I should consider adjusting?
Thank you for your assistance.
Hey thanks for responding. Could you take a look and tell what is wrong in the error file:
There is no error in that file. Closer to the top, which you are not showing, it will tell you which device is used for folding.