[x86] Recognize that we can use duplication to widen v16i8 shuffles due