Dynamically generated local scratch

DavidHoover · March 6, 2019, 3:57pm

On our cluster, the local /scratch space is generated dynamically, and the path is not defined until the job is submitted. Is there a variable that represents the slurm job id? Can environment variables be used within the cluster_info.json, such that they are defined at job runtime?

Juha · February 5, 2020, 1:45pm

I have the same issue. I would like to use an environment variable in cluster_info.json to define the SSD path, instead of a fixed path.

stephan · February 5, 2020, 7:46pm

Hey @DavidHoover, @Juha,

You can edit your cluster_script.sh to have the line
export CRYOSPARC_SSD_PATH=<function that creates ssd path>

Just make sure you delete the line from cryosparc2_master/config.sh if you have it set there as well.

Juha · February 6, 2020, 7:41pm

Thanks, this worked
.

Juha · February 23, 2020, 3:29pm

It seems that after updating to v2.14.2 the SSD path is not set anymore to CRYOSPARC_SSD_PATH.

For some reason, cryoSPARC remembers the CRYOSPARC_SSD_PATH from a previous run. It is set correctly in the submission script to the value of $LOCAL_SCRATCH (dynamically set path on our HPC system) - I have verified this by adding a print statement. Despite this a different path (matching an old run) is used (which makes the job crash as this path is not available anymore).

I have checked cryosparc2_master/config.sh and there is no path for the SSD there.

nfrasser · March 13, 2020, 6:27pm

Hi @Juha, we have a fix for this issue coming in our next release.

In the meantime, I’ve prepared a patched revision of the caching code for you to use. With this patched version, the jobs will always use the value of $CRYOSPARC_SSD_PATH from cryosparc2_worker/config.sh.

Download the file from here: https://structura-assets.s3.amazonaws.com/cache_override_v2.14_fix/cache.py

Or from the terminal with wget:

wget https://structura-assets.s3.amazonaws.com/cache_override_v2.14_fix/cache.py

Replace the file in the following folder on the cryoSPARC master node:
cryosparc2_master/cryosparc2_compute/jobs

Then try running the same job again. Let me know if you run into any trouble with this!

nfrasser · May 15, 2020, 5:02pm

FYI, this is now out in the latest cryoSPARC v2.15

thomaspv · December 17, 2020, 9:42pm

Is there something similar to

export CRYOSPARC_SSD_PATH=<function that creates ssd path>

to give each cryosparc job a specific size of scratch to use on a per-job level instead of having the ssdreserve or ssdquota on the per-worker level ?

stephan · December 21, 2020, 9:15pm

Hi @thomaspv,

It’s possible, but in the current implementation of the caching system, if you set a quota too small for the job, it may endlessly wait for the cache to become free.
To better understand how I can find a solution for you, can you elaborate on the reason you’d want something like this?

thomaspv · December 21, 2020, 9:30pm

Hi Stephan,

Currently, our cluster creates a folder on a scratch SSD if we request scratch space. However, it does not explicitly limit us to using the space we requested. I think this would make it possible to use more than the requested which could cause problems for other jobs on that node. If there is a way to provide ssdquota for the job, then I could set it to the space I request for the job through the scheduler.

Thanks.
Paul