Hi Harris,
Sorry for the confusion. Now I post the correct output from the problematic job:
========= monitor process now starting main process
MAINPROCESS PID 20714
========= monitor process now waiting for main process
MAIN PID 20714
motioncorrection.run_patch cryosparc2_compute.jobs.jobregister
Running job on hostname %s sangho-sparc.skku.edu
Allocated Resources : {u’lane’: u’default’, u’target’: {u’monitor_port’: None, u’lane’: u’default’, u’name’: u’sangho-sparc.skku.edu’, u’title’: u’Worker node sangho-sparc.skku.edu’, u’resource_slots’: {u’GPU’: [0, 1], u’RAM’: [0, 1], u’CPU’: [0, 1, 2, 3, 4, 5, 6, 7]}, u’hostname’: u’sangho-sparc.skku.edu’, u’worker_bin_path’: u’/home/sangho/cryosparc/cryosparc2_worker/bin/cryosparcw’, u’cache_path’: None, u’cache_quota_mb’: None, u’resource_fixed’: {u’SSD’: False}, u’gpus’: [{u’mem’: 3163422720, u’id’: 0, u’name’: u’GeForce GTX 1050’}, {u’mem’: 3166699520, u’id’: 1, u’name’: u’GeForce GTX 1050’}], u’cache_reserve_mb’: 10000, u’type’: u’node’, u’ssh_str’: u’sangho@sangho-sparc.skku.edu’, u’desc’: None}, u’license’: True, u’hostname’: u’sangho-sparc.skku.edu’, u’slots’: {u’GPU’: [0], u’RAM’: [0, 1], u’CPU’: [0, 1, 2, 3, 4, 5]}, u’fixed’: {u’SSD’: False}, u’lane_type’: u’default’, u’licenses_acquired’: 1}
/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/cublas.py:284: UserWarning: creating CUBLAS context to get version number
warnings.warn(‘creating CUBLAS context to get version number’)
Process Process-1:1:
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/process.py”, line 267, in _bootstrap
self.run()
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/process.py”, line 114, in run
self._target(*self._args, **self._kwargs)
File “cryosparc2_compute/jobs/pipeline.py”, line 155, in process_work_simple
process_setup(proc_idx) # do any setup you want on a per-process basis
File “cryosparc2_master/cryosparc2_compute/jobs/motioncorrection/run_patch.py”, line 80, in cryosparc2_compute.jobs.motioncorrection.run_patch.run_patch_motion_correction_multi.process_setup
File “cryosparc2_compute/engine/init.py”, line 8, in
from engine import *
File “cryosparc2_worker/cryosparc2_compute/engine/engine.py”, line 12, in init cryosparc2_compute.engine.engine
File “cryosparc2_worker/cryosparc2_compute/engine/gfourier.py”, line 6, in init cryosparc2_compute.engine.gfourier
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/fft.py”, line 20, in
from . import misc
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/misc.py”, line 25, in
from . import cublas
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/cublas.py”, line 292, in
_cublas_version = int(_get_cublas_version())
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/cublas.py”, line 285, in _get_cublas_version
h = cublasCreate()
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/cublas.py”, line 203, in cublasCreate
cublasCheckStatus(status)
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/site-packages/skcuda/cublas.py”, line 179, in cublasCheckStatus
raise e
cublasNotInitialized
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/queues.py”, line 268, in _feed
send(obj)
IOError: [Errno 32] Broken pipe
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/queues.py”, line 268, in _feed
send(obj)
IOError: [Errno 32] Broken pipe
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/queues.py”, line 268, in _feed
send(obj)
IOError: [Errno 32] Broken pipe
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/queues.py”, line 268, in _feed
send(obj)IOError: [Errno 32] Broken pipe
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/queues.py”, line 268, in _feed
send(obj)
IOError: [Errno 32] Broken pipe
Traceback (most recent call last):
File “/home/sangho/cryosparc/cryosparc2_worker/deps/anaconda/lib/python2.7/multiprocessing/queues.py”, line 268, in _feed
send(obj)
IOError: [Errno 32] Broken pipe
========= sending heartbeat
========= sending heartbeat
========= sending heartbeat
**** handle exception rc
set status to failed
Traceback (most recent call last):
File “cryosparc2_worker/cryosparc2_compute/run.py”, line 85, in cryosparc2_compute.run.main
File “cryosparc2_master/cryosparc2_compute/jobs/motioncorrection/run_patch.py”, line 363, in cryosparc2_compute.jobs.motioncorrection.run_patch.run_patch_motion_correction_multi
AssertionError: Child process with PID 20746 has terminated unexpectedly!
========= main process now complete.
========= monitor process now complete.