Ab initio is giving more or less similar classes

Sarah · June 20, 2024, 2:17pm

Hi everyone,

I am working with a small membrane protein with a protruding extracellular domain.

After Blob picker, I did 2 rounds of 2D-classification and ended up with about 360K particles . An example of my 2Ds:

I ran ab initio to create an initial model and get rid of junk particles. I changed the following parameters:

Number of ab initio classes: 7
Maximum resolution: 7
Initial resolution: 10
Initial mini batch size: 300
Final minibatch size: 1000
Class similarity: 0

When I run the ab initio job, I get 7 similar classes, meaning they look like a particle or a micelle. like in this picture:

When I check them in chimera, they seems to have a density inside this micelle so they should have some protein containing particles amongst them.

When I read the literature, I notice that in every paper they run ab initio and succeed in separating the junk from real particles because they state: [we got “Junk classes” which we removed].

I am wondering if my ab initio is converging too soon (if this makes sense) and if there is a parameter I can tweak to circumvent this problem. I am now running heterogeneous refinement with the volumes I got from ab initio to see if I can get rid of the junk particles.

I have also faced the same problem when I did 3D classification on a different set of particles.

Does anyone have any suggestions on how to get rid of junk during 3D ab initio reconstruction or any other way? I would be grateful.

CryoEM2 · June 20, 2024, 7:47pm

run again with 2, 3, 4, 5 classes. I use ab initio simply to generate a good reference for future jobs. I do not use it as a selection criteria for good/bad particles, though that seems to be what several do. Instead, do what is possible to get a good initial model, and clean up with heterogeneous refinement, providing 1 good and 5 bad models, or half/half, or 1 junk and 5 good models etc. Often using several “good” models of varying preferred orientation or resolutions will also be successful in tricking the het refine to securing 1 very good class.

MLiziczai · June 21, 2024, 12:13pm

You can also just take the best/best 2-3 particles sets, and run a new ab-initio with them to get a better model, and repeat. Once I got my good initial model with just 20k particles, then I used heterogenous refinement to sort out particles that I have previously discarded too.

To circumvent too early convergence, you can set “Number of initial iterations” to something like 1000.

Sarah · June 24, 2024, 9:35am

Thank you @CryoEM2 for your response. I will give it a try to clean up with the het-ref. Do you mind elaborating on what you meant by “using several “good” models of varying preferred orientation”? Did you mean deliberately choosing particles that pertain to a certain view then creating a model with it and use it as a volume in het-ref?

Sarah · June 24, 2024, 9:36am

Thank you @MLiziczai for responding. That was my previous strategy but it didn’t help much either. I will try your suggestion about increasing the initial number of iterations.

MLiziczai · June 24, 2024, 10:39am

Let me know how it goes.
I found that having nr. of initial iterations below 500 does nothing, and although I also change the nr. of final iterations to 1000, I find it less important. And I have seen far larger numbers then 1000 work too. Just be aware, that it takes very long to compute (depending on the particle numbers and number of final iterations).

CryoEM2 · June 25, 2024, 3:51pm

Mostly yes. By running ab initio and her refine several times, I typically end up with a couple classes that contain mostly just one preferred view, and the 3D volume of course has extreme anisotropy. Using this as a reference or a few references can sometimes act as a sink to dial out the overrepresented particle view, making the remaining particles have a better chance to develop a good 3D