Programming Languages Research Group: Git

author	Chandler Carruth <chandlerc@gmail.com>
	Fri, 3 Oct 2014 13:11:13 +0000 (13:11 +0000)
committer	Chandler Carruth <chandlerc@gmail.com>
	Fri, 3 Oct 2014 13:11:13 +0000 (13:11 +0000)
commit	dce98e67391a5d721b96fa2be85219c2f81bdce8
tree	5a2cb9816c492ed9ba566847ce65a838c30b4900	tree \| snapshot
parent	ea01fda5b399b9071de9b7c8b500e2d5a86729c7	commit \| diff

[x86] Teach the new vector shuffle lowering to aggressively form MOVSS
and MOVSD nodes for single element vector inserts.

This is particularly important because a number of patterns in the
backend detect these patterns and leverage them to simplify things. It
also fixes quite a few of the insertion bad code examples. However, it
regresses a specific area: when available, blendps and blendpd are
*dramatically* faster than movss and movsd respectively. But it doesn't
really work to form the blend logic first because the blends *aren't* as
crazy efficient when the data is coming from memory anyways, and thus
will have a movss or movsd regardless. Also, doing that would block
a bunch of the patterns that this is designed to hit.

So my plan is to go into the patterns for lowering MOVSS and MOVSD and
lower them via blends when available. However that's a pretty invasive
restructuring so it will need to be a follow-up patch.

I have already gone into the patterns to lower MOVSS and MOVSD from
memory using MOVLPD, etc. Without that, several of the test cases
I already have regress.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@218985 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Target/X86/X86ISelLowering.cpp		diff \| blob \| history
lib/Target/X86/X86InstrSSE.td		diff \| blob \| history
test/CodeGen/X86/vector-shuffle-128-v16.ll		diff \| blob \| history
test/CodeGen/X86/vector-shuffle-128-v2.ll		diff \| blob \| history
test/CodeGen/X86/vector-shuffle-256-v4.ll		diff \| blob \| history
test/CodeGen/X86/vector-shuffle-512-v8.ll		diff \| blob \| history