add support for 128 bit inputs on both x86-64 and x86-32.