Shuffle optimization for AVX/AVX2.