I am performing a 3-class Ab Initio reconstruction on ~200k particles with the “HR-HAIR” method proposed by @olibclarke (https://www.biorxiv.org/content/10.1101/2025.09.08.674935v1). It’s giving great results. (Shoutout to Clarke Lab
) I’m having some compute issues however. The expected time to complete the job (~16k iterations) is about 9 days, which seems off.
When I run nvidia-smi command within the GPU that is running my Ab Initio job, I noticed that the GPU does not seem to be utilized. I noticed that other people have noticed this strange GPU allocation behavior (1, 2). I’ve increased the number of CPUs and RAM requested, and the job runs fine. However, it’s taking too long to run to complete. (Maximum request time for a GPU is only 7 days.) Is there a fix to this? Thank you all for the help.
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.163.01 Driver Version: 550.163.01 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA A40 On | 00000000:46:00.0 Off | 0 |
| 0% 40C P0 71W / 300W | 275MiB / 46068MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 52805 C python 266MiB |
+-----------------------------------------------------------------------------------------+