What is the CTF estimation for this micrograph? If the CTF looks good, you can extract these particles using a larger box size and then bin them down (for example, extract at 600 Å and bin to 200 Å). What is the pixel size for this dataset?
We have faced a similar issue in our lab with a 0.74 Å/pixel dataset. When we extracted with a larger box size, both the 2D classes and the refinement improved significantly.
Another possible issue could be the number of particles. Do you have more particle classes available, or are these the only classes you used for ab-initio and subsequent refinement?