Thanks, but the installation is definitely not okay. The validation tests fail:
> cryosparcm test workers P1 --test gpu --test-pytorch
Using project P1
Specifying gpu test
Enabling PyTorch test
Running worker tests...
Traceback (most recent call last):
File "/home/cryosparc_user/V3.X/cryosparc_master/deps/anaconda/envs/cryosparc_master_env/lib/python3.8/runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/cryosparc_user/V3.X/cryosparc_master/deps/anaconda/envs/cryosparc_master_env/lib/python3.8/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/instance_tests/worker_test.py", line 309, in <module>
execute_tests(args.project, test_type, args.targets, log_level, args.test_tensorflow, args.test_pytorch)
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/instance_tests/worker_test.py", line 176, in execute_tests
workspace_uid = get_testing_workspace(project_uid, cli)
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/instance_tests/worker_test.py", line 60, in get_testing_workspace
workspace_uid = cli.create_empty_workspace(
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_tools/cryosparc/command.py", line 112, in func
assert "error" not in res, f'Error for "{key}" with params {params}:\n' + format_server_error(res["error"])
AssertionError: Error for "create_empty_workspace" with params {'project_uid': 'P1', 'created_by_user_id': 'instance_tester', 'title': 'Instance Testing on 2023-03-02 00:50:35.098922'}:
ServerError: validation error: lock file for P1 not found at /home/cryosparc_user/P1/cs.lock
Traceback (most recent call last):
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/commandcommon.py", line 200, in wrapper
res = func(*args, **kwargs)
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/command_core/__init__.py", line 4432, in create_empty_workspace
assert check_project_exists(project_uid), f"Project {project_uid} does not exist."
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/commandcommon.py", line 191, in wrapper
return func(*args, **kwargs)
File "/home/cryosparc_user/V3.X/cryosparc_master/cryosparc_command/commandcommon.py", line 251, in wrapper
assert os.path.isfile(
AssertionError: validation error: lock file for P1 not found at /home/cryosparc_user/P1/cs.lock
as does a simple 2D classification which ran fine previously:
[CPU: 2.04 GB Avail: 121.09 GB]
Traceback (most recent call last):
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/tools.py", line 429, in context_dependent_memoize
return ctx_dict[cur_ctx][args]
KeyError: <pycuda._driver.Context object at 0x7ff8882f14a0>
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "cryosparc_master/cryosparc_compute/run.py", line 96, in cryosparc_compute.run.main
File "cryosparc_master/cryosparc_compute/jobs/class2D/run.py", line 336, in cryosparc_compute.jobs.class2D.run.run_class_2D
File "cryosparc_master/cryosparc_compute/engine/engine.py", line 964, in cryosparc_compute.engine.engine.process
File "cryosparc_master/cryosparc_compute/engine/engine.py", line 974, in cryosparc_compute.engine.engine.process
File "cryosparc_master/cryosparc_compute/engine/cuda_core.py", line 156, in cryosparc_compute.engine.cuda_core.allocate_gpu
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/gpuarray.py", line 549, in fill
func = elementwise.get_fill_kernel(self.dtype)
File "<decorator-gen-13>", line 2, in get_fill_kernel
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/tools.py", line 433, in context_dependent_memoize
result = func(*args)
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/elementwise.py", line 493, in get_fill_kernel
return get_elwise_kernel(
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/elementwise.py", line 162, in get_elwise_kernel
mod, func, arguments = get_elwise_kernel_and_types(
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/elementwise.py", line 148, in get_elwise_kernel_and_types
mod = module_builder(arguments, operation, name,
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/elementwise.py", line 45, in get_elwise_module
return SourceModule("""
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/compiler.py", line 290, in __init__
cubin = compile(source, nvcc, options, keep, no_extern_c,
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/compiler.py", line 254, in compile
return compile_plain(source, options, keep, nvcc, cache_dir, target)
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/compiler.py", line 78, in compile_plain
checksum.update(preprocess_source(source, options, nvcc).encode("utf-8"))
File "/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/compiler.py", line 54, in preprocess_source
raise CompileError("nvcc preprocessing of %s failed" % source_path,
pycuda.driver.CompileError: nvcc preprocessing of /tmp/tmp0zt44t4o.cu failed
[command: nvcc --preprocess -arch sm_61 -I/home/cryosparc_user/V3.X/cryosparc_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.8/site-packages/pycuda/cuda /tmp/tmp0zt44t4o.cu --compiler-options -P]
[stderr:
b'In file included from <command-line>:0:0:\n/usr/include/stdc-predef.h:40:1: fatal error: cuda_runtime.h: No such file or directory\n #endif\n ^\ncompilation terminated.\n']
Removing the Flex dependencies and reinstalling 4.1.2 worker cures the problem and we can again access the gpus.
Thanks,
-jh-