Restack particles before export yields tar file containing original particle stack

Trying to save space by consolidating good particles and throwing out bad particles from an initial local motion correction job. From NU refinement, I restack particles resulting in 1.9 GB file. Then reconstruct those particles, export, and tar with dereference option. The resulting file is >100 GB, same as the original local motion correction job.

@cryofun Please can you post the output of the command

find /path/to/project_dir/exports/jobs/job_dir -type d

and also list the job types of all job IDs shown in the find command’s output.

Here it is:

find /path/to/project_dir/exports/jobs/job_dir/ -type d
/path/to/project_dir/exports/jobs/job_dir/
/path/to/project_dir/exports/jobs/job_dir/J4155_volume
/path/to/project_dir/exports/jobs/job_dir/J4155_volume/J4155
/path/to/project_dir/exports/jobs/job_dir/J4155_particles
/path/to/project_dir/exports/jobs/job_dir/J4155_particles/J4153
/path/to/project_dir/exports/jobs/job_dir/J4155_particles/J4153/restack
/path/to/project_dir/exports/jobs/job_dir/J4155_particles/J1371
/path/to/project_dir/exports/jobs/job_dir/J4155_particles/J1371/motioncorrected
/path/to/project_dir/exports/jobs/job_dir/J4155_particles/J1410
/path/to/project_dir/exports/jobs/job_dir/J4155_particles/J1410/localmotioncorrected
/path/to/project_dir/exports/jobs/job_dir/J4155_mask
/path/to/project_dir/exports/jobs/job_dir/J4155_mask/J4155
/path/to/project_dir/exports/jobs/job_dir/gridfs_data

Thanks @cryofun. Please can you also post the outputs of

cd /path/to/project_dir/
ls J1371/motioncorrected | wc -l
ls J1410/localmotioncorrected | wc -l
find exports/jobs/job_dir -type d  -exec sh -c "du -shL {}; ls {} | wc -l" \;

Hi, I do not have a very good understanding of the way particle images are stored, but I had similar issues when trying to transfer particle sets between instances. What worked for me was to take the restacked particle set, and plug it into a Particle Sets job. There, you can open the input group and remove the location, which will result in your selected particles being written into a single .mrcs file, instead of just referencing their location on the micrograph.

Thanks for the help. Here is the output:

ls J1371/motioncorrected | wc -l
26705

ls J1410/localmotioncorrected | wc -l
10196

find /path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/ -type d -exec sh -c "du -shL {}; ls {} | wc -l" \;
235G	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/
6
463M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_volume
3
463M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_volume/J4155
6
235G	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles
5
2.3G	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles/J4153
1
2.3G	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles/J4153/restack
1
232G	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles/J1371
1
232G	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles/J1371/motioncorrected
4221
291M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles/J1410
1
291M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_particles/J1410/localmotioncorrected
4221
85M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_mask
3
84M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/J4155_mask/J4155
1
3.0M	/path/to/project_dir/exports/jobs/job_dir/J4155_homo_reconstruct/gridfs_data
1