Intel's x86-simd-sort 4.0 Delivers A 2x Boost For AVX-512 Performance, Adds AVX2 Code
Earlier this year Intel published x86-simd-sort as a very speedy sorting library that initially leveraged AVX-512 instructions for 10x to 17x faster sorts. Numpy was one of the first major projects to adopt x86-simd-sort and OpenJDK more recently adopted it. Since the initial release we've seen more features and performance optimizations added. Today marks the release of x86-simd-sort 4.0 and it's delivering even greater performance while also adding an AVX2 code path to help those without AVX-512...