My jobs on an HPC cluster often fail with the error message:
“Job is unresponsive - no heartbeat received in 30 seconds.”
I increased “CRYOSPARC_HEARTBEAT_SECONDS” to 180 then 3600, but I still get the same error.
gives “export CRYOSPARC_HEARTBEAT_SECONDS=3600” properly.
Did CS job ignore the info, or see no heartbeat for 3600 sec and export the “30 sec” error?
Let me know anything wrong with the line or if any log would be useful for diagnosis.
Thanks for your help!