We have updated to v2.11, finally gotten the pycuda driver to build, but we are unable to run jobs.
Job output shows the following.
Traceback (most recent call last):
File "cryosparc2_worker/cryosparc2_compute/run.py", line 69, in cryosparc2_compute.run.main
File "cryosparc2_compute/jobs/jobregister.py", line 308, in get_run_function
runmod = importlib.import_module(".."+modname, __name__)
File "/local-home/repository/cryosparcv2/cryosparc2_worker/deps/anaconda/lib/python2.7/importlib/__init__.py", line 37, in import_module
__import__(name)
File "cryosparc2_worker/cryosparc2_compute/jobs/abinit/run.py", line 15, in init cryosparc2_compute.jobs.abinit.run
File "cryosparc2_compute/engine/__init__.py", line 8, in <module>
from engine import *
File "cryosparc2_worker/cryosparc2_compute/engine/engine.py", line 4, in init cryosparc2_compute.engine.engine
ImportError: No module named pycuda.driver
Which tracks with our attempting to run cryosparcv2_worker/bin/connect command and seeing the “no module named pycuda.driver” error.
cryosparcm status shows command_core, command_proxy, command_vis and webapp all running, with the following as stopped - app app_dev, command_rtp, watchdog_dev and webapp_dev.
I don’t know if any of the stopped jobs are needed.
I believe the issue is loading pycuda, and when I change the PYTHONPATH variable in the bin/cryosparcw script it seems to load. But other modules then fail, or they fail sooner and we don’t get there but I did get the PATH corrected and we are now finding nvcc.
trying the /bin/cryosparcw newcuda command I’ve prevented loading of the other modules and cause nvcc to not be found.
I don’t know enough about PYTHON, or cryosparc (I’m OS support, not a member of the department and not a crypsparc user nor prior installer) so I do need some guidance.
thanks in advance,
Brian