From: Nadav Rotem Date: Mon, 5 Aug 2013 04:27:34 +0000 (+0000) Subject: Update the docs. X-Git-Url: http://plrg.eecs.uci.edu/git/?a=commitdiff_plain;h=055028f6c805b089d10984ec691aa637cf21dc42;p=oota-llvm.git Update the docs. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@187713 91177308-0d34-0410-b5e6-96231b3b80d8 --- diff --git a/docs/Vectorizers.rst b/docs/Vectorizers.rst index 221fb2949f8..61ebca2bb52 100644 --- a/docs/Vectorizers.rst +++ b/docs/Vectorizers.rst @@ -13,6 +13,8 @@ The SLP vectorizer merges multiple scalars that are found in the code into vectors while the Loop Vectorizer widens instructions in loops to operate on multiple consecutive iterations. +Both the Loop Vectorizer and the SLP Vectorizer are enabled by default. + .. _loop-vectorizer: The Loop Vectorizer @@ -21,9 +23,8 @@ The Loop Vectorizer Usage ----- -LLVM's Loop Vectorizer is now enabled by default for -O3. -We plan to enable parts of the Loop Vectorizer on -O2 and -Os in future releases. -The vectorizer can be disabled using the command line: +The Loop Vectorizer is enabled by default, but it can be disabled +through clang using the command line flag: .. code-block:: console @@ -322,12 +323,12 @@ The SLP-vectorizer processes the code bottom-up, across basic blocks, in search Usage ------ -The SLP Vectorizer is not enabled by default, but it can be enabled +The SLP Vectorizer is enabled by default, but it can be disabled through clang using the command line flag: .. code-block:: console - $ clang -fslp-vectorize file.c + $ clang -fno-slp-vectorize file.c LLVM has a second basic block vectorization phase which is more compile-time intensive (The BB vectorizer). This optimization @@ -337,24 +338,3 @@ can be enabled through clang using the command line flag: $ clang -fslp-vectorize-aggressive file.c - -The SLP vectorizer is in early development stages but can already vectorize -and accelerate many programs in the LLVM test suite. - -======================= ============ -Benchmark Name Gain -======================= ============ -Misc/flops-7 -32.70% -Misc/matmul_f64_4x4 -23.23% -Olden/power -21.45% -Misc/flops-4 -14.90% -ASC_Sequoia/AMGmk -13.85% -TSVC/LoopRerolling-flt -11.76% -Misc/flops-6 -9.70% -Misc/flops-5 -8.54% -Misc/flops -8.12% -TSVC/NodeSplitting-dbl -6.96% -Misc-C++/sphereflake -6.74% -Ptrdist/yacr2 -6.31% -======================= ============ -