Hi all,
After upgrading to Cryosparc v4.6.0 we ran the validation tests as per usual (launch, SSD, GPU). All test jobs completed successfully. This is a useful feature to quickly validate the instance functionality after changes and updates.
However, the GPU test failed to detect an issue with the numba library in the worker env, which was not linked against the correct/latest CUDA version (it was running against < 11.6 but should’ve been running with CUDA 12.x).
For this reason I would like to propose to modify the GPU validation test (cryosparcm test workers
) so that it does a minimal compute example to check essential libraries. In our specific case, the test was fine, but jobs running then hit a CUDA init error when loading the python numba library. The error was:
raise CudaSupportError(f"Error at driver init: {description}")
numba.cuda.cudadrv.error.CudaSupportError: Error at driver init: Call to cuInit results in CUDA_ERROR_UNKNOWN (999)
Best,
Erich