Opinions on composite reconstructions?

Hi there
I have been told that many strongly discourage the production of composite reconstructions; for clarity i mean merging of multiple local refinements using a greater conensus map to increase local resolution areas.

I am working on a large and flexible structure that demands use of quite a few local refinements. Individually these refinements are great in resolution, and can be fit together contiguously. In many cases, a single protein spans 2-3 local maps.

I am now thinking of how to deposit these maps and their corresponding models; the simplest solution is to use Phenix or ChimeraX to merge the maps (I have found phenix is best). But as stated above, I am frequently told to avoid this by colleagues, but i don’t really know the arguments against this? I can think of a few small issues personally, but nothing major off the top of my head.

Can someone illuminate me?

Cheers,
James.

2 Likes

For me, there are a range of reasons I am careful with composite maps. Others may agree or disagree. I’m not saying that they are not useful (they often can be) but that while caution is recommended with cryo-EM data in general (experienced eyes-on is king) composite maps need even more.

First, artefacting: stitching lines, density offsets, resampling loss are all a concern. Second, a lot of basic validations are difficult or impossible with composite maps. Third, there are a lot of different ways to “composite” a map - I saw a preprint around 2022 which is now published (Beton et al, 2024) and which might be of interest for what looks like a cleaner method compared to PHENIX or ChimeraX. Fourth, poorly resolved regions (in the focussed refinements) are usually where you’re stitching together, meaning that they tend to have increased noise. Given the enthusiasm some modellers have with putting water in every single noise speck, at higher resolutions I worry that interpretations of non-protein density might be… overly optimistic. So how much overlap do you use? A set amount, or case-by-case? Fifth, are you losing heterogeneity which is actually of value for interpretation? Cryo-EM is now very, very interested in protein dynamics/flexibility (latent space, CryoDRGN, TomoDRGN, CryoDRGN AI, 3D Flex, DynaMight, etc.) which is basically the exact opposite of composite map generation (therefore, question whether your composite map is biologically accurate/relevant*).

The first composite maps I saw had obvious stitching artefacts and no “component” maps were available (along with their corresponding half-maps) so sanity checking was functionally impossible. Now EMDB has improved support for deposition of composite maps dramatically.

*This is, of course, a concern for all purified or recombinant samples, but until we can hit <3Ang consistently and constantly with in situ work… :wink:

3 Likes

@rbs_sci Thankyou so much for your detailed reply :slight_smile: Id like to respond as a sanity check to each point :slight_smile:

First: I think all of these concerns are valid, and these are the issues i mentioned i can think of; while these concern me, i think in some of these cases there are mitigations, for example, using substantial overlap in the areas with quality density to minimize incorrect density offsets.

Second: Yes this is obviously an issue on the map front; in theory you can still model a protein and validate the geometry and model map fit (though this does pose the risk of introducing slight but potentially meaningful geometry errors if there are density offset errors), and individual component maps and concensus map are required as mandatory additional depositions on PDB nowadays so the maps in theory can be independently validated.

Third: Thankyou so much for the paper, i will read this :slight_smile: i have to date found that Phenix works best to minimize noise between component maps. For strictly visualization purposes, i have in past used chimeraX delete density tool to remove noisy periphery of component maps, preserving only the high resolution overlapping regions. This produces a composite density that looks far less noisy. I want to stress i have never deposited one of these maps as deleting parts of density even if its done rationally to remove noise seems dodgy. Do you agree this should not be deposited?

Fourth: this leads on a bit from the last comment. I agree this is a big problem; i have found it far less of a problem in phenix, as phenix essentially crops the best map around each model, meaning there is no noise added from the maps that are poor quality at those locations. Also i am working at 2.8-3.5 Ă… so i never model waters, and am careful with what i do model. The densities are very clean for this resolution so protein is pretty unambiguous. Also i normally use overlap on a case by case basis, but inspect the result to determine the quality of the overlap region.

Fifth: absolutely definitely i am! however, this is the only way to achieve high resolution of the region and does represent what i believe is the only stable conformation. The structure features a hinging mechanism, that appears to have a free range of motion when not “locked” so i don’t think it is possible for me to see the other conformations, however i can view the “locked” conformation as a composite map at high res which is better than no structure i figure.

Finally, i have deposited composite maps before i was informed of how critical people are of them. I would always deposit the maps with clear title indication of the fact that they are composite, all component maps linked, and the concensus map. The models is the difficult part i feel as i said some proteins cannot be resolved in a single map.

I could in theory generate several protein fragments in each map and deposit them seperately, but i feel this would be more confusing, and that a single model in a composite map composed from high quality overlapping maps is probably better and still fairly structurally accurate despite small introduced artefacts?

Sorry for making this so long, its just a complicated question and there are few people i can ask around me!
James.

1 Like

Honestly, you’ve addressed most of the point that I get twitchy on. :slight_smile: I would hope others chime in with their suggestions/opinions also. It’s pretty much unavoidable to lose resolution when resampling (just do it as little as possible :smiley: ) and stitching artefacts can still be a pain.

Sounds a fascinating complex, I look forward to seeing it public! :smiley:

If I remember correctly, there was a bit of a scandal last year regarding some… creative… map deletions in depositions…

Sounds reasonable, as all the focussed refinements and their half-maps will be available. :slight_smile:

Thanks again for your responses I really appreciate hearing other opinions on this :slight_smile: seems one of those things that has a hazy consensus!

Roger that sounds like i made the right call!

It is very very cool! shouldnt be terribly long if i can sort out these composites!
Cheers again,
James.

Hello!

Adding 3DVA to the list of tools in the first reply by rbs_sci.

From the top of my head, there might be cases where the movement of one domain does not interfere with the others, but what I see all the time is the opposite. It is usually much more interesting to have the full picture of several different conformations including all possible domains, than to have very high resolution of one small bit - unless that small bit is critical, of course, but why not having everything in that case? Add 3DFlex movies - you can have a lot of fun with custom meshes, and try different masks on 3DVA. Multiple conformations add up to the story and might even drive you to very interesting questions on the protein physiology. No problem with local refinements, but then you won’t need to composite anything. If the Phenix tool - which I am not familiar with - is not using information from the particles to improve your composite, then it is just modifying the map to make it look nicer, but what is the real usefulness of it?

@carlos Hi mate thanks for the reply :slight_smile:

In part we wanted the high res complex because none of the proteins were previously known and have not been modelled. We also weren’t certain which proteins made up the mechanism so this gave us the insight we needed to ID the component proteins. I agree its really interesting to see the different conformational states but we could not get any 3D classes of alternate states. We think the unlocked conformation has a very wide range of motion.

Flex 3D is an excellent idea i had not thought of! we will do this for sure!

Cheers again,
James.