Versions running:
cryosparc: 3.3.1, using python 3.7
topaz: variable (I’ve tried 0.2.3, 0.2.4, and 0.2.5)
CUDA: 10.2
I am trying to run a Topaz Train job on cryosparc and the job is failing at the stage of checking the version of topaz. The error message reads as follows:
[CPU: 227.4 MB] Traceback (most recent call last):
File “cryosparc_worker/cryosparc_compute/run.py”, line 85, in cryosparc_compute.run.main
File “/home/xxxxxlab/cryosparc/cryosparc_worker/cryosparc_compute/jobs/topaz/run_topaz.py”, line 115, in run_topaz_wrapper_train
topaz_version = utils.get_topaz_version(topaz_exec_path)
File “/home/xxxxxlab/cryosparc/cryosparc_worker/cryosparc_compute/jobs/topaz/topaz_utils.py”, line 126, in get_topaz_version
f’Cannot determine topaz version, command “{topaz_exec_path} --version” did not produce valid output: “{topaz_version}”’
AssertionError: Cannot determine topaz version, command “/home/xxxxxlab/anaconda3/envs/topaz7/bin/topaz --version” did not produce valid output: “ImportError: /home/xxxxxlab/anaconda3/envs/topaz7/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so: undefined symbol: iJIT_NotifyEvent”
Putting the command “/home/xxxxxlab/anaconda3/envs/topaz7/bin/topaz --version” into the command line returns the following:
File “/home/xxxxxlab/anaconda3/envs/topaz7/bin/topaz”, line 33, in
sys.exit(load_entry_point(‘topaz-em==0.2.5’, ‘console_scripts’, ‘topaz’)())
File “/home/xxxxxlab/anaconda3/envs/topaz7/lib/python3.7/site-packages/topaz/main.py”, line 60 in main
import topaz.commands.train
File “/home/xxxxxlab/anaconda3/envs/topaz7/lib/python3.7/site-packages/topaz/commands/train.py”, line 12, in
import torch
File “/home/xxxxxlab/anaconda3/envs/topaz7/lib/python3.7/site-packages/torch/init.py”, line 202, in
from torch._C import * # noqa: F403
ImportError: /home/xxxxxlab/anaconda3/envs/topaz7/lib/python3.7/site-packages/torch/lib/liptorch_cpu.so: undefined symbol: iJIT_NotifyEvent
Previously I was having another issue with topaz 0.2.5 train jobs failing at another later step (unable to find the image_list_train.txt file), so I’ve tried installing older versions of topaz to try to fix the issue. Topaz version 0.2.3 has worked before for us, I believe with this same version of cryosparc. I’m not sure what is different about how I’ve tried to install it, but everything I’ve tried has resulted in the above error.
The path to the topaz executable is /home/xxxxxlab/anaconda3/envs/topaz7/bin/topaz
Where it says “topaz7” is the name of the environment I’ve made–there have been several attempts using different versions of topaz/python.
When I make the environment, I have tried specifying python versions 2.7, 3.6, and 3.7. The 2.7 did not seem to work well at all but the other two have at least allowed install.
When I install topaz, I specify either version 0.2.3 or 0.2.5 (0.2.3 has worked historically and 0.2.5 has worked for us outside of cryosparc), plus CUDA toolkit for version 10.2. See example below:
conda create -n topazX python=3.7
conda activate topazX
conda install topaz=0.2.3 cudatoolkit=10.2 -c tbepler -c pytorch
I’ve also tried adding -c nvidia to the end of the install command, but same result.
All combinations of topaz/python I’ve tried have resulted in the same error message, so I think something else must be wrong.