[x86] Teach the new vector shuffle lowering of v4f64 to prefer a direct
authorChandler Carruth <chandlerc@gmail.com>
Sun, 21 Sep 2014 11:17:55 +0000 (11:17 +0000)
committerChandler Carruth <chandlerc@gmail.com>
Sun, 21 Sep 2014 11:17:55 +0000 (11:17 +0000)
commit29720a4bad5f6ca271843f31cf3f03865ccb73cb
tree60523d08c755ea5e36fbf013369effbd8dcf8140
parent0dd52092d0d067f5e1c7bda3c5087315c7ede6dd
[x86] Teach the new vector shuffle lowering of v4f64 to prefer a direct
VBLENDPD over using VSHUFPD. While the 256-bit variant of VBLENDPD slows
down to the same speed as VSHUFPD on Sandy Bridge CPUs, it has twice the
reciprocal throughput on Ivy Bridge CPUs much like it does everywhere
for 128-bits. There isn't a downside, so just eagerly use this
instruction when it suffices.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@218208 91177308-0d34-0410-b5e6-96231b3b80d8
lib/Target/X86/X86ISelLowering.cpp
test/CodeGen/X86/vector-shuffle-256-v4.ll