Kill signal sent by CryoSPARC (ID: <Heartbeat Monitor>)

Dear wtempel,

This is the output of nvidia-smi in our computer cryo:

(base) santiago@cryo:~$ nvidia-smi
Tue Oct 22 21:28:04 2024
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.183.01             Driver Version: 535.183.01   CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|=========================================+======================+======================|
|   0  NVIDIA GeForce RTX 4090        Off | 00000000:41:00.0  On |                  Off |
|  0%   27C    P8              16W / 450W |    288MiB / 24564MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+
|   1  NVIDIA GeForce RTX 4090        Off | 00000000:42:00.0 Off |                  Off |
|  0%   29C    P8              10W / 450W |     21MiB / 24564MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+

+---------------------------------------------------------------------------------------+
| Processes:                                                                            |
|  GPU   GI   CI        PID   Type   Process name                            GPU Memory |
|        ID   ID                                                             Usage      |
|=======================================================================================|
|    0   N/A  N/A      2505      G   /usr/lib/xorg/Xorg                           41MiB |
|    0   N/A  N/A      5652      G   /usr/lib/xorg/Xorg                           88MiB |
|    0   N/A  N/A      5946      G   /usr/bin/gnome-shell                         36MiB |
|    1   N/A  N/A      2505      G   /usr/lib/xorg/Xorg                            4MiB |
|    1   N/A  N/A      5652      G   /usr/lib/xorg/Xorg                            4MiB |
+---------------------------------------------------------------------------------------+

Thanks for posting the nvidia-smi output @sanjusare. I am still not sure what the problem might be. Outputs from these commands, run on cryo as cryosparc_user, may (or, unfortunately, may not) provide more insights:

/home/cryosparc_user/cryosparc/cryosparc_worker/bin/cryosparcw call env | grep PATH
/home/cryosparc_user/cryosparc/cryosparc_worker/bin/cryosparcw call python -c "import torch; print(f'Torch version {torch.__version__}\nTorch CUDA available? {torch.cuda.is_available()}')"