Hi everyone,
I am working on a large single-particle cryo-EM dataset of a hexameric AAA+ complex with dynamic adaptor occupancy at the N-terminal domain (NTD). The main challenge appears to be strong compositional heterogeneity combined with intrinsic NTD flexibility.
Current workflow:
- Started with ~1 million particles
- After junk removal and hetero refinement: ~800k particles
- Initial 3DVA suggests significant NTD motion even without strong adaptor density
- Proceeded with 3D classification (hard classification) to separate adaptor-associated states
- Used 10 classes with filter resolution set to 10 Å
- Obtained ~6 distinct volumes/classes with varying adaptor-associated densities
I do not expect major conformational changes in the core hexamer, so I am trying to understand what exactly the classification is separating in this case.
My questions are:
1. If 10 classes produce ~6 meaningful volumes, does this likely represent true compositional heterogeneity or could it reflect continuous NTD motion/flexibility?
2. In systems where adaptor occupancy is weak and dynamic, how reliable is hard classification for separating biologically relevant states?
3. Would focused classification/local refinement around the NTD region be more effective than global classification?
4. Are there alternative pipelines people recommend for resolving subtle compositional heterogeneity in highly dynamic datasets (multi-body approaches, particle subtraction, cryoDRGN, iterative heterogeneous refinement, etc.)?
Would really appreciate suggestions or experiences from others working on resolving structural ensembles and compositional heterogeneity in highly dynamic cryo-EM datasets.