I am working on two datasets (same sample, same data collection scheme). I started to process dataset1 and got to a good map after refinement (70k particles at the end). I then process dataset2 till 2D classification (only 30k particles) and I wanted to merge them.
When merging them, I run first a 2D classification and get these odd classes:
But then when trying to run a refinement, or to start over from ab initio, the maps look very odd (ab initio fails, in refinement, resolution and quality map get worse).
Global CTF refinement refines and corrects for optical aberrations which can vary from dataset to dataset (even if collected in the same way). I would try running that as a diagnostic at least.
Do both sets independently refine to high resolution, and things only get bad when they are combined?
I see. I have run the Global CTF ref and now using the particles for refinement.
Dataset 1 goes to 3.8A, for the dataset 2 (only 30k particles) I stopped at 2D classification and directly merged the particles. And the dataset 2 2D classification looks just as for dataset 1.
Yes although you want to look at the plots above it (the red and blue ones) to get an idea of whether the aberrations are being fit appropriately. If this is correct, this is a lot of tilt! And could make a significant difference to refinement at your current resolution.
Also there should be two sets of numbers in the log - one for each exposure group (which in this case should correspond to your two datasets)
Definitely there is some pretty strong beam tilt in the underlying data - correcting that should help somewhat even for the individual datasets, and may help during refinement of the combined set.
The first plot “Data for odd terms” shows if there are aberrations present in the underlying data - the red and blue indicates beam tilt in this case.
The second plot shows a model for this beam tilt, and the third plot shows the residual after applying this model. This third plot should just look like noise if beam tilt has been well corrected - in this case there is some residual at least for group 21, so estimates are not perfect yet (to be expected at this resolution).
You may want to play around with the resolution limits for the data to use in Global CTF (the defaults are quite conservative), and once you feel you have the best fit, use the resulting particle set for a fresh refinement (e.g. NU-refine).