[X86][SSE] Improvements to byte shift shuffle matching