The new beam shift clustering feature in Exposure Group Utilities is excellent!
However, with leginon generated adhoc beam tilt values as the input, there can be some small residual clusters of a few images (at least using the agglomerative clustering method).
These groups can sometimes be too small to allow for reliable beam tilt estimation.
I wonder if it would be possible to set a minimum group size? So the agglomerative clustering would first split into n clusters, then dispers clusters below the minimum group size amongst their nearest neighbor groups?
Also, is there any data-driven way to optimize the number of clusters? Obviously there is a trade-off between the size of the groups and the accuracy & precision of the beam tilt estimates - would it be possible to optimize the grouping on the fly during Global CTF refinement? I imagine for high resolution structures this could make a difference