Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and