[AVX512] Pull pattern for subvector extract into the instruction definition
authorAdam Nemet <anemet@apple.com>
Thu, 25 Sep 2014 23:48:49 +0000 (23:48 +0000)
committerAdam Nemet <anemet@apple.com>
Thu, 25 Sep 2014 23:48:49 +0000 (23:48 +0000)
No functional change.

I initially thought that pulling the Pat<> into the instruction pattern was
not possible because it was doing a transform on the index in order to convert
it from a per-element (extract_subvector) index into a per-chunk (vextract*x4)
index.

Turns out this also works inside the pattern because the vextract_extract
PatFrag has an OperandTransform EXTRACT_get_vextract{128,256}_imm, so the
index in $idx goes through the same conversion.

The existing test CodeGen/X86/avx512-insert-extract.ll extended in the
previous commit provides coverage for this change.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@218480 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Target/X86/X86InstrAVX512.td

index 047686fcdc64f872b3c6f4a418a95952959c7261..274dea90b9b33f0753c202b0fc31d3e22a4a2e50 100644 (file)
@@ -422,10 +422,12 @@ multiclass vextract_for_size<int Opcode,
                              SDNodeXForm EXTRACT_get_vextract_imm> {
   let hasSideEffects = 0, ExeDomain = To.ExeDomain in {
     def rr : AVX512AIi8<Opcode, MRMDestReg, (outs To.RC:$dst),
-            (ins VR512:$src1, i8imm:$src2),
-            "vextract" # To.EltTypeName # "x4\t{$src2, $src1, $dst|"
-                                               "$dst, $src1, $src2}",
-            []>, EVEX, EVEX_V512;
+            (ins VR512:$src1, i8imm:$idx),
+            "vextract" # To.EltTypeName # "x4\t{$idx, $src1, $dst|"
+                                               "$dst, $src1, $idx}",
+            [(set To.RC:$dst, (vextract_extract:$idx (From.VT VR512:$src1),
+                                                     (iPTR imm)))]>,
+            EVEX, EVEX_V512;
     let mayStore = 1 in
     def rm : AVX512AIi8<Opcode, MRMDestMem, (outs),
             (ins To.MemOp:$dst, VR512:$src1, i8imm:$src2),
@@ -434,11 +436,6 @@ multiclass vextract_for_size<int Opcode,
             []>, EVEX, EVEX_V512, EVEX_CD8<To.EltSize, CD8VT4>;
   }
 
-  // Codegen pattern, e.g. v16i32 -> v4i32 for vextracti32x4
-  def : Pat<(vextract_extract:$ext (From.VT VR512:$src1), (iPTR imm)),
-            (To.VT (!cast<Instruction>(NAME # To.EltSize # "x4rr") VR512:$src1,
-                      (EXTRACT_get_vextract_imm To.RC:$ext)))>;
-
   // Codegen pattern with the alternative types, e.g. v8i64 -> v2i64 for
   // vextracti32x4
   def : Pat<(vextract_extract:$ext (AltFrom.VT VR512:$src1), (iPTR imm)),