3D Flex Train error

Hi,

I met an error below in the job of 3D flex train.
ImportError: /scem/work/niu/cryosparc/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so: undefined symbol: ncclRedOpDestroy

Dose anyone have solution? thanks.
My version is v4.1.1+230110 and no error reported in installing 3D Flex dependencies.

niu

Traceback (most recent call last):
File “cryosparc_master/cryosparc_compute/run.py”, line 80, in cryosparc_compute.run.main
File “/scem/work/niuff/cryosparc/cryosparc_worker/cryosparc_compute/jobs/jobregister.py”, line 443, in get_run_function
runmod = importlib.import_module(“…”+modname, name)
File “/scem/work/niuff/cryosparc/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/importlib/init.py”, line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File “”, line 1014, in _gcd_import
File “”, line 991, in _find_and_load
File “”, line 975, in _find_and_load_unlocked
File “”, line 671, in _load_unlocked
File “”, line 1174, in exec_module
File “”, line 219, in _call_with_frames_removed
File “cryosparc_master/cryosparc_compute/jobs/flex_refine/run_train.py”, line 12, in init cryosparc_compute.jobs.flex_refine.run_train
File “cryosparc_master/cryosparc_compute/jobs/flex_refine/flexmod.py”, line 19, in init cryosparc_compute.jobs.flex_refine.flexmod
File “/scem/work/niuff/cryosparc/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/torch/init.py”, line 229, in
from torch._C import * # noqa: F403
ImportError: /scem/work/niu/cryosparc/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so: undefined symbol: ncclRedOpDestroy

Please can you run the commands in 3D Flex Training error (library not found) - #4 by wtempel and post their outputs.
You will have adjust the initial variable definition:

csw=/scem/work/niuff/cryosparc/cryosparc_worker/bin/cryosparcw