Hi,
Remove duplicates is very slow on large datasets (millions of symmetry expanded particles), taking multiple hours to complete.
It does not seem to be parallelized across multiple CPUs (correct me if I am mistaken here). Would it be possible to parallelize it to speed up this process for large datasets?
Cheers
Oli