Job struck in launch state in UI

Hi Team,

while running jobs on cryosparc in UI i can see this error -
slurm_load_jobs error: Invalid job id specified

nothing in the .err or .out file but in the job.log i can see

/efs/sw/cryosparc/v4.4.1_240110/cryosparc_worker/cryosparc_tools/cryosparc/command.py:134: UserWarning: *** CommandClient: (server:39002/api) URL Error [Errno 111] Connection refused, attempt 1 of 3. Retrying in 30 seconds
system = self._get_callable(“system.describe”)()

Hi @wtempel, could you please help me on this.
Thank you

could occur if the slurm job ID is incorrectly parsed or the correctly parsed slurm job ID is no longer “known” to slurm because the job has exited.

Please ensure that

  • the command_core process is running on the CryoSPARC master host:
    cryosparcm status
    
  • the server hostname can be resolved by the worker node to the IP address of the CryoSPARC master host
  • access to port 39002 on the CryoSPARC master host from the worker host is not blocked by a firewall
1 Like

Hi @wtempel, Thank you for your inputs.
it is working now.

1 Like