In the pre-RA scheduler, maintain cmp+br proximity.
authorAndrew Trick <atrick@apple.com>
Thu, 14 Apr 2011 05:15:06 +0000 (05:15 +0000)
committerAndrew Trick <atrick@apple.com>
Thu, 14 Apr 2011 05:15:06 +0000 (05:15 +0000)
commit12f0dc6bb556976f22d89ebcf42bce273c9e7d38
treed3c41e0b3c32158742854b3a12ac286d59505e71
parentb57b06647ead66113a97d84c3b3d332afe807dd6
In the pre-RA scheduler, maintain cmp+br proximity.

This is done by pushing physical register definitions close to their
use, which happens to handle flag definitions if they're not glued to
the branch. This seems to be generally a good thing though, so I
didn't need to add a target hook yet.

The primary motivation is to generate code closer to what people
expect and rule out missed opportunity from enabling macro-op
fusion. As a side benefit, we get several 2-5% gains on x86
benchmarks. There is one regression:
SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is
an independent scheduler bug that will be tracked separately.
See rdar://problem/9283108.

Incidentally, pre-RA scheduling is only half the solution. Fixing the
later passes is tracked by:
<rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump

Fixes:
<rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@129508 91177308-0d34-0410-b5e6-96231b3b80d8
include/llvm/CodeGen/ScheduleDAG.h
lib/CodeGen/SelectionDAG/ScheduleDAGRRList.cpp
lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp
test/CodeGen/X86/2011-04-13-SchedCmpJmp.ll [new file with mode: 0644]
test/CodeGen/X86/lsr-loop-exit-cond.ll
test/CodeGen/X86/pr2659.ll
test/CodeGen/X86/tail-opts.ll
test/CodeGen/X86/test-nofold.ll