Custom lower v4i32 multiplies into a cute sequence, instead of having legalize
authorChris Lattner <sabre@nondot.org>
Tue, 18 Apr 2006 03:24:30 +0000 (03:24 +0000)
committerChris Lattner <sabre@nondot.org>
Tue, 18 Apr 2006 03:24:30 +0000 (03:24 +0000)
commite7c768ea24027938b52abd6ee94755b6d17f3da3
tree9e65c68d141466e2679d512fd836bdfc94bd148e
parent843ecd647c9bcce841e113703ecd1bc57d248b82
Custom lower v4i32 multiplies into a cute sequence, instead of having legalize
scalarize the sequence into 4 mullw's and a bunch of load/store traffic.

This speeds up v4i32 multiplies 4.1x (measured) on a G5.  This implements
PowerPC/vec_mul.ll

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@27788 91177308-0d34-0410-b5e6-96231b3b80d8
lib/Target/PowerPC/PPCISelLowering.cpp