cuMemcpyHtoDAsync failed

Hi again, Unfortunately I spoke too soon.

2D classification ran without problems.
However, the exact same error described above occurred again when extracting the same particles with a smaller box size.

originally with CUDA 11.5 the extraction job would consistently crash around 800 micrographs out of 4000. Now with CUDA 11.2 the job crashed around 3300 out of 4000. Why it completed successfully the first time with a larger boxsize. I do not know…

Other ideas? thanks again!
Ben