Slogging thorough a series of issues after a cooler and motherboard failure and subsequent replacement on our workstation. Previously we were running CentOS7, but we couldn’t get a clean install to work so have upgraded to an install of Ubuntu on an internal SSD. The previous drives are mounted and we’re able to see the file system, including rescuing the latest cryoSPARC database backup (from 3/21).
We have installed a fresh instance of cryoSPARC 4.2.1 on Ubuntu 20.04.06. I have successfully carried over a backed up database into the new instance. We regularly keep our data and project directories on a cluster, which we mount via sshfs, to use this workstation’s GPUs. There was previously a remote worker attached to this station, though we haven’t gotten to the step of reconnecting that yet.
Starting a new project on the mounted cluster, I am able to import micrographs from a different directory on the mounted cluster and run exposure utilities to select a subset of 100 to continue troubleshooting with. Attempting to run CTF Estimation gives:
“[2023-04-14 12:21:48.62] License is valid.
[2023-04-14 12:21:48.71] Launching job on lane [workstation_name] target [workstation_name].[network_name].edu …
[2023-04-14 12:21:49.43] Running job on remote worker node hostname [workstation_name].[networkname].edu
[2023-04-14 12:21:58.67] Failed to launch! 255 Host key verification failed.”
I find it odd that cryoSPARC thinks it’s running the job from a remote worker node. The two lanes on which to queue I see after importing the db are the original lanes I had before this fresh install (i.e. [workstation] and [remote_workstation_that’s_not_reconnected_yet].
I wonder if for some reason importing the old database results in the old local worker being seen as a remote worker, and on top of that we have an issue with ssh? ~/.ssh/known_hosts is empty.
In a likely related issue, attempting to run any job in our previous workspaces gives the error:
“Job directory /path/to/cluster/project/job/ is not empty. found /path/to/cluster/project/job/job.log”
EDIT: restarting the workstation and cryoSPARC changes the behavior for attempting to run jobs in previous workspaces. The error for a NU refinement is now also “Failed to launch! 255 Host key verification failed.”
Any thoughts or suggestions would be appreciated,
Sergei