I have added a multi gpu cluster as a lane but i am unable to submit to the cluster. I can pull the commands from cryosparc and submit it manually. It gives a [Errno 2] No such file or directory
error. Logs attached any help would be appreciated.
[cryosparc_user@pearl ~]$ cryosparcm log command_core
Launchable! -- Launching.
Changed job P2.J8 status launched
Running project UID P2 job UID J8
Running job on worker type cluster
cmd: source /admin/lsflilac/lsf/conf/profile.lsf; source /admin/lsflilac/lsf/conf/profile.lsf; /admin/lsflilac/lsf/10.1/linux3.10-glibc2.17-x86_64/bin/bsub < /data/hite/cryosparc/P2/J8/queue_sub_script.sh
[JSONRPC ERROR 2020-05-29 01:27:07.428737 at run_job ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 2092, in run_job
res = subprocess.check_output(shlex.split(cmd), stderr=subprocess.STDOUT)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 216, in check_output
process = Popen(stdout=PIPE, *popenargs, **kwargs)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 394, in __init__
errread, errwrite)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 1047, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
[JSONRPC ERROR 2020-05-29 01:27:07.430072 at scheduler_run ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 1640, in scheduler_run
scheduler_run_core(do_run)
File "cryosparc2_command/command_core/__init__.py", line 1862, in scheduler_run_core
run_job(job['project_uid'], job['uid']) # takes care of the cluster case and the node case
File "cryosparc2_command/command_core/__init__.py", line 124, in wrapper
raise e
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
[JSONRPC ERROR 2020-05-29 01:27:07.430237 at enqueue_job ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 4585, in enqueue_job
scheduler_run()
File "cryosparc2_command/command_core/__init__.py", line 124, in wrapper
raise e
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
---- Killing project UID P2 job UID J8
Changed job P2.J8 status killed
[EXPORT_JOB] : Request to export P2 J8
[EXPORT_JOB] : Exporting job to /data/hite/cryosparc/P2/J8
[EXPORT_JOB] : Exporting all of job's images in the database to /data/hite/cryosparc/P2/J8/gridfs_data...
[EXPORT_JOB] : Done. Exported 0 images in 0.00s
[EXPORT_JOB] : Exporting all job's streamlog events...
[EXPORT_JOB] : Done. Exported 1 files in 0.00s
[EXPORT_JOB] : Exporting job metafile...
[EXPORT_JOB] : Done. Exported in 0.01s
[EXPORT_JOB] : Updating job manifest...
[EXPORT_JOB] : Done. Updated in 0.00s
[EXPORT_JOB] : Exported P2 J8 in 0.03s
[EXPORT_JOB] : Request to export P2 J8
[EXPORT_JOB] : Exporting job to /data/hite/cryosparc/P2/J8
[EXPORT_JOB] : Exporting all of job's images in the database to /data/hite/cryosparc/P2/J8/gridfs_data...
[EXPORT_JOB] : Done. Exported 0 images in 0.00s
[EXPORT_JOB] : Exporting all job's streamlog events...
[EXPORT_JOB] : Done. Exported 1 files in 0.00s
[EXPORT_JOB] : Exporting job metafile...
[EXPORT_JOB] : Done. Exported in 0.01s
[EXPORT_JOB] : Updating job manifest...
[EXPORT_JOB] : Done. Updated in 0.00s
[EXPORT_JOB] : Exported P2 J8 in 0.01s
---------- Scheduler running ---------------
Jobs Queued: [(u'P2', u'J8')]
Licenses currently active : 8
Now trying to schedule J8
Need slots : {u'GPU': 8, u'RAM': 3, u'CPU': 2}
Need fixed : {u'SSD': True}
Master direct : False
Scheduling job to lilac
Failed to connect link: HTTP Error 502: Bad Gateway
Not a commercial instance - heartbeat set to 12 hours.
Launchable! -- Launching.
Changed job P2.J8 status launched
Running project UID P2 job UID J8
Running job on worker type cluster
cmd: source /admin/lsflilac/lsf/conf/profile.lsf; source /admin/lsflilac/lsf/conf/profile.lsf; /admin/lsflilac/lsf/10.1/linux3.10-glibc2.17-x86_64/bin/bsub < /data/hite/cryosparc/P2/J8/queue_sub_script.sh
[JSONRPC ERROR 2020-05-29 01:30:42.034596 at run_job ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 2092, in run_job
res = subprocess.check_output(shlex.split(cmd), stderr=subprocess.STDOUT)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 216, in check_output
process = Popen(stdout=PIPE, *popenargs, **kwargs)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 394, in __init__
errread, errwrite)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 1047, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
[JSONRPC ERROR 2020-05-29 01:30:42.035631 at scheduler_run ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 1640, in scheduler_run
scheduler_run_core(do_run)
File "cryosparc2_command/command_core/__init__.py", line 1862, in scheduler_run_core
run_job(job['project_uid'], job['uid']) # takes care of the cluster case and the node case
File "cryosparc2_command/command_core/__init__.py", line 124, in wrapper
raise e
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
[JSONRPC ERROR 2020-05-29 01:30:42.035758 at enqueue_job ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 4585, in enqueue_job
scheduler_run()
File "cryosparc2_command/command_core/__init__.py", line 124, in wrapper
raise e
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
---- Killing project UID P2 job UID J8
Changed job P2.J8 status killed
[EXPORT_JOB] : Request to export P2 J8
[EXPORT_JOB] : Exporting job to /data/hite/cryosparc/P2/J8
[EXPORT_JOB] : Exporting all of job's images in the database to /data/hite/cryosparc/P2/J8/gridfs_data...
[EXPORT_JOB] : Done. Exported 0 images in 0.00s
[EXPORT_JOB] : Exporting all job's streamlog events...
[EXPORT_JOB] : Done. Exported 1 files in 0.00s
[EXPORT_JOB] : Exporting job metafile...
[EXPORT_JOB] : Done. Exported in 0.01s
[EXPORT_JOB] : Updating job manifest...
[EXPORT_JOB] : Done. Updated in 0.05s
[EXPORT_JOB] : Exported P2 J8 in 0.09s
[EXPORT_JOB] : Request to export P2 J8
[EXPORT_JOB] : Exporting job to /data/hite/cryosparc/P2/J8
[EXPORT_JOB] : Exporting all of job's images in the database to /data/hite/cryosparc/P2/J8/gridfs_data...
[EXPORT_JOB] : Done. Exported 0 images in 0.00s
[EXPORT_JOB] : Exporting all job's streamlog events...
[EXPORT_JOB] : Done. Exported 1 files in 0.00s
[EXPORT_JOB] : Exporting job metafile...
[EXPORT_JOB] : Done. Exported in 0.01s
[EXPORT_JOB] : Updating job manifest...
[EXPORT_JOB] : Done. Updated in 0.00s
[EXPORT_JOB] : Exported P2 J8 in 0.02s
---------- Scheduler running ---------------
Jobs Queued: [(u'P2', u'J8')]
Licenses currently active : 8
Now trying to schedule J8
Need slots : {u'GPU': 8, u'RAM': 3, u'CPU': 2}
Need fixed : {u'SSD': True}
Master direct : False
Scheduling job to lilac
Failed to connect link: HTTP Error 502: Bad Gateway
Not a commercial instance - heartbeat set to 12 hours.
Launchable! -- Launching.
Changed job P2.J8 status launched
Running project UID P2 job UID J8
Running job on worker type cluster
cmd: source /admin/lsflilac/lsf/conf/profile.lsf; source /admin/lsflilac/lsf/conf/profile.lsf; /admin/lsflilac/lsf/10.1/linux3.10-glibc2.17-x86_64/bin/bsub < /data/hite/cryosparc/P2/J8/queue_sub_script.sh
[JSONRPC ERROR 2020-06-01 10:50:15.874862 at run_job ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 2092, in run_job
res = subprocess.check_output(shlex.split(cmd), stderr=subprocess.STDOUT)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 216, in check_output
process = Popen(stdout=PIPE, *popenargs, **kwargs)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 394, in __init__
errread, errwrite)
File "/admin/opt/common/cryosparc/software/cryosparc2_master/deps/anaconda/lib/python2.7/subprocess.py", line 1047, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
[JSONRPC ERROR 2020-06-01 10:50:15.877183 at scheduler_run ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 1640, in scheduler_run
scheduler_run_core(do_run)
File "cryosparc2_command/command_core/__init__.py", line 1862, in scheduler_run_core
run_job(job['project_uid'], job['uid']) # takes care of the cluster case and the node case
File "cryosparc2_command/command_core/__init__.py", line 124, in wrapper
raise e
OSError: [Errno 2] No such file or directory
-----------------------------------------------------
[JSONRPC ERROR 2020-06-01 10:50:15.877454 at enqueue_job ]
-----------------------------------------------------
Traceback (most recent call last):
File "cryosparc2_command/command_core/__init__.py", line 115, in wrapper
res = func(*args, **kwargs)
File "cryosparc2_command/command_core/__init__.py", line 4585, in enqueue_job
scheduler_run()
File "cryosparc2_command/command_core/__init__.py", line 124, in wrapper
raise e
OSError: [Errno 2] No such file or directory
-----------------------------------------------------