Use movups to lower memcpy and memset even if it's not fast (like corei7).
authorEvan Cheng <evan.cheng@apple.com>
Thu, 6 Jan 2011 07:58:36 +0000 (07:58 +0000)
committerEvan Cheng <evan.cheng@apple.com>
Thu, 6 Jan 2011 07:58:36 +0000 (07:58 +0000)
commit461f1fc359dff438dad25e809499845b10a3d032
tree143a2a682ffdd84409d6bd1673e22630d42d565e
parentcce240d26bbf1c2bec9cfff4838d8d807b215586
Use movups to lower memcpy and memset even if it's not fast (like corei7).
The theory is it's still faster than a pair of movq / a quad of movl. This
will probably hurt older chips like P4 but should run faster on current
and future Intel processors. rdar://8817010

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@122955 91177308-0d34-0410-b5e6-96231b3b80d8
lib/Target/X86/X86ISelLowering.cpp
test/CodeGen/X86/2010-04-08-CoalescerBug.ll
test/CodeGen/X86/2010-06-25-CoalescerSubRegDefDead.ll
test/CodeGen/X86/2010-09-17-SideEffectsInChain.ll
test/CodeGen/X86/memcpy-2.ll
test/CodeGen/X86/memcpy.ll
test/CodeGen/X86/memset-2.ll
test/CodeGen/X86/memset64-on-x86-32.ll
test/CodeGen/X86/small-byval-memcpy.ll
test/CodeGen/X86/tlv-1.ll
test/CodeGen/X86/unaligned-load.ll