Ab-initio stop without error message

vitorserrao · November 4, 2021, 3:24am

Hi there,

I had some issues during the Ab-initio reconstruction that suddenly failed but I have no error message. I was digging into the discussion, and I found some other users that have faced a similar issue using RTX3090. Any idea what it could be and how to fix it?

Thanks,

cryosparcm joblog P5 J20
========= sending heartbeat
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
========= sending heartbeat
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty
**** handle exception rc
set status to failed
========= main process now complete.
========= monitor process now complete.
Waiting for data… (interrupt to abort)

stephan · November 5, 2021, 4:21am

Hi @vitorserrao,

What version of CUDA did you install cryoSPARC with? Can you also paste the output of the file cryosparc_worker/config.sh and the command nvidia-smi?

vitorserrao · November 5, 2021, 4:43am

Hi @stephan,

I used cuda-11.1 and cryosparc v3.2.0. Below are the outputs:

config.sh:

export CRYOSPARC_LICENSE_ID=“OUR_CORRECT_LICENSE_NUMBER”
export CRYOSPARC_USE_GPU=true
export CRYOSPARC_CUDA_PATH="/usr/local/cuda"
export CRYOSPARC_DEVELOP=false

stephan · November 5, 2021, 2:40pm

Thanks for your response. CUDA looks good.
Can you try this job on a different GPU or different dataset? Does the ab-initio job die every time?

vitorserrao · November 5, 2021, 6:29pm

It worked well on the T20 tutorial that I used to check the installation. Either cryosparc or cryosparc-live worked nicely and I haven’t had any issues. It’s only dying with this particular dataset (until now). I also tried different initial models (1; 2; 4 and 6) and it’s always dying without an error message.

stephan · November 5, 2021, 6:32pm

Hi @vitorserrao,

Thanks for your response. Can you provide screenshots or copy+paste the outputs of the “Overview” tab of the dead Ab-Initio job, that includes where it dies?

vitorserrao · November 5, 2021, 6:51pm

Hi @stephan,

Sure, this is the final part of it. Looking at cryosparcm joblog there was no apparent error message, and I just found it after a very long search on the overview (I should have found this earlier, my apologies).
It did run for a while, and then I get this message, however, there is still free space on the cards and disk.

stephan · January 5, 2022, 7:59pm

Hi @vitorserrao,

Can you add export CRYOSPARC_NO_PAGELOCK=true to cryosparc_worker/config.sh, then re-run the job?

vitorserrao · January 5, 2022, 8:13pm

Hi @stephan,

I already figured it out. We had some memory allocation problems that were already solved.
Thanks anyway!

Vitor

nikydna · August 11, 2023, 7:33pm

Hi @vitorserrao

How did you solve memory allocation problems??

vitorserrao · August 11, 2023, 11:27pm

Hi @nikydna,

It has been awhile, but I think I downsampled the particles and restarted the job.