Don't include Operator.h from InstrTypes.h.

[oota-llvm.git] / docs / LangRef.html
diff --git a/docs/LangRef.html b/docs/LangRef.html

index c2ee7019b9601d76aeedb4b49a5a3d2a3b6133c8..90b200809c3345b573243d7ed36fa57e30076085 100644 (file)
--- a/docs/LangRef.html
+++ b/docs/LangRef.html
@@ -493,7 +493,7 @@
  
  <pre class="doc_code">
  <i>; Declare the string constant as a global constant.</i>&nbsp;
-<a href="#identifiers">@.LC0</a> = <a href="#linkage_internal">internal</a> <a href="#globalvars">constant</a> <a href="#t_array">[13 x i8]</a> c"hello world\0A\00"      <i>; [13 x i8]*</i>&nbsp;
+<a href="#identifiers">@.LC0</a> = <a href="#linkage_internal">internal</a>&nbsp;<a href="#globalvars">constant</a>&nbsp;<a href="#t_array">[13 x i8]</a> c"hello world\0A\00"      <i>; [13 x i8]*</i>&nbsp;
  
  <i>; External declaration of the puts function</i>&nbsp;
  <a href="#functionstructure">declare</a> i32 @puts(i8*)                                      <i>; i32 (i8*)* </i>&nbsp;
@@ -846,6 +846,13 @@ define i32 @main() {   <i>; i32()* </i>&nbsp;
     region of memory, and all memory objects in LLVM are accessed through
     pointers.</p>
  
+<p>Global variables can be marked with <tt>unnamed_addr</tt> which indicates
+  that the address is not significant, only the content. Constants marked
+  like this can be merged with other constants if they have the same
+  initializer. Note that a constant with significant address <em>can</em>
+  be merged with a <tt>unnamed_addr</tt> constant, the result being a
+  constant whose address is significant.</p>
+
  <p>A global variable may be declared to reside in a target-specific numbered
     address space. For targets that support them, address spaces may affect how
     optimizations are performed and/or what target instructions are used to
@@ -885,7 +892,8 @@ define i32 @main() {   <i>; i32()* </i>&nbsp;
  <p>LLVM function definitions consist of the "<tt>define</tt>" keyword, an
     optional <a href="#linkage">linkage type</a>, an optional
     <a href="#visibility">visibility style</a>, an optional
-   <a href="#callingconv">calling convention</a>, a return type, an optional
+   <a href="#callingconv">calling convention</a>,
+   an optional <tt>unnamed_addr</tt> attribute, a return type, an optional
     <a href="#paramattrs">parameter attribute</a> for the return type, a function
     name, a (possibly empty) argument list (each with optional
     <a href="#paramattrs">parameter attributes</a>), optional
@@ -896,7 +904,8 @@ define i32 @main() {   <i>; i32()* </i>&nbsp;
  <p>LLVM function declarations consist of the "<tt>declare</tt>" keyword, an
     optional <a href="#linkage">linkage type</a>, an optional
     <a href="#visibility">visibility style</a>, an optional
-   <a href="#callingconv">calling convention</a>, a return type, an optional
+   <a href="#callingconv">calling convention</a>,
+   an optional <tt>unnamed_addr</tt> attribute, a return type, an optional
     <a href="#paramattrs">parameter attribute</a> for the return type, a function
     name, a possibly empty list of arguments, an optional alignment, and an
     optional <a href="#gc">garbage collector name</a>.</p>
@@ -922,6 +931,9 @@ define i32 @main() {   <i>; i32()* </i>&nbsp;
     specified, the function is forced to have at least that much alignment.  All
     alignments must be a power of 2.</p>
  
+<p>If the <tt>unnamed_addr</tt> attribute is given, the address is know to not
+  be significant and two identical functions can be merged</p>.
+
  <h5>Syntax:</h5>
  <pre class="doc_code">
  define [<a href="#linkage">linkage</a>] [<a href="#visibility">visibility</a>]
@@ -1005,13 +1017,15 @@ declare signext i8 @returns_signed_char()
  <dl>
    <dt><tt><b>zeroext</b></tt></dt>
    <dd>This indicates to the code generator that the parameter or return value
-      should be zero-extended to a 32-bit value by the caller (for a parameter)
-      or the callee (for a return value).</dd>
+      should be zero-extended to the extent required by the target's ABI (which
+      is usually 32-bits, but is 8-bits for a i1 on x86-64) by the caller (for a
+      parameter) or the callee (for a return value).</dd>
  
    <dt><tt><b>signext</b></tt></dt>
    <dd>This indicates to the code generator that the parameter or return value
-      should be sign-extended to a 32-bit value by the caller (for a parameter)
-      or the callee (for a return value).</dd>
+      should be sign-extended to the extent required by the target's ABI (which
+      is usually 32-bits) by the caller (for a parameter) or the callee (for a
+      return value).</dd>
  
    <dt><tt><b>inreg</b></tt></dt>
    <dd>This indicates that this parameter or return value should be treated in a
@@ -1021,8 +1035,9 @@ declare signext i8 @returns_signed_char()
        registers).  Use of this attribute is target-specific.</dd>
  
    <dt><tt><b><a name="byval">byval</a></b></tt></dt>
-  <dd>This indicates that the pointer parameter should really be passed by value
-      to the function.  The attribute implies that a hidden copy of the pointee
+  <dd><p>This indicates that the pointer parameter should really be passed by
+      value to the function.  The attribute implies that a hidden copy of the
+      pointee
        is made between the caller and the callee, so the callee is unable to
        modify the value in the callee.  This attribute is only valid on LLVM
        pointer arguments.  It is generally used to pass structs and arrays by
@@ -1030,10 +1045,13 @@ declare signext i8 @returns_signed_char()
        to belong to the caller not the callee (for example,
        <tt><a href="#readonly">readonly</a></tt> functions should not write to
        <tt>byval</tt> parameters). This is not a valid attribute for return
-      values.  The byval attribute also supports specifying an alignment with
-      the align attribute.  This has a target-specific effect on the code
-      generator that usually indicates a desired alignment for the synthesized
-      stack slot.</dd>
+      values.</p>
+      
+      <p>The byval attribute also supports specifying an alignment with
+      the align attribute.  It indicates the alignment of the stack slot to
+      form and the known alignment of the pointer specified to the call site. If
+      the alignment is not specified, then the code generator makes a
+      target-specific assumption.</p></dd>
  
    <dt><tt><b><a name="sret">sret</a></b></tt></dt>
    <dd>This indicates that the pointer parameter specifies the address of a
@@ -1132,9 +1150,12 @@ define void @f() optsize { ... }
        threshold for this caller.</dd>
  
    <dt><tt><b>hotpatch</b></tt></dt>
-  <dd>This attribute indicates that the prologue should contain a 'hotpatch'
-      sequence at the beginning. This is the same sequence used in the
-      system DLLs in Microsoft Windows XP Service Pack 2 and higher.</dd>
+  <dd>This attribute indicates that the function should be 'hotpatchable',
+      meaning the function can be patched and/or hooked even while it is
+      loaded into memory. On x86, the function prologue will be preceded
+      by six bytes of padding and will begin with a two-byte instruction.
+      Most of the functions in the Windows system DLLs in Windows XP SP2 or
+      higher were compiled in this fashion.</dd>
  
    <dt><tt><b>inlinehint</b></tt></dt>
    <dd>This attribute indicates that the source code contained a hint that inlining
@@ -1489,6 +1510,7 @@ Classifications</a> </div>
        <td><a href="#t_primitive">primitive</a></td>
        <td><a href="#t_label">label</a>,
            <a href="#t_void">void</a>,
+          <a href="#t_integer">integer</a>,
            <a href="#t_floating">floating point</a>,
            <a href="#t_x86mmx">x86mmx</a>,
            <a href="#t_metadata">metadata</a>.</td>
@@ -2159,8 +2181,8 @@ Classifications</a> </div>
  
  <p>The string '<tt>undef</tt>' can be used anywhere a constant is expected, and
     indicates that the user of the value may receive an unspecified bit-pattern.
-   Undefined values may be of any type (other than label or void) and be used
-   anywhere a constant is permitted.</p>
+   Undefined values may be of any type (other than '<tt>label</tt>'
+   or '<tt>void</tt>') and be used anywhere a constant is permitted.</p>
  
  <p>Undefined values are useful because they indicate to the compiler that the
     program is well defined no matter what value is used.  This gives the
@@ -2179,7 +2201,7 @@ Safe:
  </pre>
  
  <p>This is safe because all of the output bits are affected by the undef bits.
-Any output bit can have a zero or one depending on the input bits.</p>
+   Any output bit can have a zero or one depending on the input bits.</p>
  
  <pre class="doc_code">
    %A = or %X, undef
@@ -2193,13 +2215,14 @@ Unsafe:
  </pre>
  
  <p>These logical operations have bits that are not always affected by the input.
-For example, if "%X" has a zero bit, then the output of the 'and' operation will
-always be a zero, no matter what the corresponding bit from the undef is.  As
-such, it is unsafe to optimize or assume that the result of the and is undef.
-However, it is safe to assume that all bits of the undef could be 0, and
-optimize the and to 0.  Likewise, it is safe to assume that all the bits of
-the undef operand to the or could be set, allowing the or to be folded to
--1.</p>
+   For example, if <tt>%X</tt> has a zero bit, then the output of the
+   '<tt>and</tt>' operation will always be a zero for that bit, no matter what
+   the corresponding bit from the '<tt>undef</tt>' is. As such, it is unsafe to
+   optimize or assume that the result of the '<tt>and</tt>' is '<tt>undef</tt>'.
+   However, it is safe to assume that all bits of the '<tt>undef</tt>' could be
+   0, and optimize the '<tt>and</tt>' to 0. Likewise, it is safe to assume that
+   all the bits of the '<tt>undef</tt>' operand to the '<tt>or</tt>' could be
+   set, allowing the '<tt>or</tt>' to be folded to -1.</p>
  
  <pre class="doc_code">
    %A = select undef, %X, %Y
@@ -2215,13 +2238,14 @@ Unsafe:
    %C = undef
  </pre>
  
-<p>This set of examples show that undefined select (and conditional branch)
-conditions can go "either way" but they have to come from one of the two
-operands.  In the %A example, if %X and %Y were both known to have a clear low
-bit, then %A would have to have a cleared low bit.  However, in the %C example,
-the optimizer is allowed to assume that the undef operand could be the same as
-%Y, allowing the whole select to be eliminated.</p>
-
+<p>This set of examples shows that undefined '<tt>select</tt>' (and conditional
+   branch) conditions can go <em>either way</em>, but they have to come from one
+   of the two operands.  In the <tt>%A</tt> example, if <tt>%X</tt> and
+   <tt>%Y</tt> were both known to have a clear low bit, then <tt>%A</tt> would
+   have to have a cleared low bit. However, in the <tt>%C</tt> example, the
+   optimizer is allowed to assume that the '<tt>undef</tt>' operand could be the
+   same as <tt>%Y</tt>, allowing the whole '<tt>select</tt>' to be
+   eliminated.</p>
  
  <pre class="doc_code">
    %A = xor undef, undef
@@ -2242,16 +2266,17 @@ Safe:
    %F = undef
  </pre>
  
-<p>This example points out that two undef operands are not necessarily the same.
-This can be surprising to people (and also matches C semantics) where they
-assume that "X^X" is always zero, even if X is undef.  This isn't true for a
-number of reasons, but the short answer is that an undef "variable" can
-arbitrarily change its value over its "live range".  This is true because the
-"variable" doesn't actually <em>have a live range</em>.  Instead, the value is
-logically read from arbitrary registers that happen to be around when needed,
-so the value is not necessarily consistent over time.  In fact, %A and %C need
-to have the same semantics or the core LLVM "replace all uses with" concept
-would not hold.</p>
+<p>This example points out that two '<tt>undef</tt>' operands are not
+   necessarily the same. This can be surprising to people (and also matches C
+   semantics) where they assume that "<tt>X^X</tt>" is always zero, even
+   if <tt>X</tt> is undefined. This isn't true for a number of reasons, but the
+   short answer is that an '<tt>undef</tt>' "variable" can arbitrarily change
+   its value over its "live range".  This is true because the variable doesn't
+   actually <em>have a live range</em>. Instead, the value is logically read
+   from arbitrary registers that happen to be around when needed, so the value
+   is not necessarily consistent over time. In fact, <tt>%A</tt> and <tt>%C</tt>
+   need to have the same semantics or the core LLVM "replace all uses with"
+   concept would not hold.</p>
  
  <pre class="doc_code">
    %A = fdiv undef, %X
@@ -2262,17 +2287,17 @@ b: unreachable
  </pre>
  
  <p>These examples show the crucial difference between an <em>undefined
-value</em> and <em>undefined behavior</em>.  An undefined value (like undef) is
-allowed to have an arbitrary bit-pattern.  This means that the %A operation
-can be constant folded to undef because the undef could be an SNaN, and fdiv is
-not (currently) defined on SNaN's.  However, in the second example, we can make
-a more aggressive assumption: because the undef is allowed to be an arbitrary
-value, we are allowed to assume that it could be zero.  Since a divide by zero
-has <em>undefined behavior</em>, we are allowed to assume that the operation
-does not execute at all.  This allows us to delete the divide and all code after
-it: since the undefined operation "can't happen", the optimizer can assume that
-it occurs in dead code.
-</p>
+  value</em> and <em>undefined behavior</em>. An undefined value (like
+  '<tt>undef</tt>') is allowed to have an arbitrary bit-pattern. This means that
+  the <tt>%A</tt> operation can be constant folded to '<tt>undef</tt>', because
+  the '<tt>undef</tt>' could be an SNaN, and <tt>fdiv</tt> is not (currently)
+  defined on SNaN's. However, in the second example, we can make a more
+  aggressive assumption: because the <tt>undef</tt> is allowed to be an
+  arbitrary value, we are allowed to assume that it could be zero. Since a
+  divide by zero has <em>undefined behavior</em>, we are allowed to assume that
+  the operation does not execute at all. This allows us to delete the divide and
+  all code after it. Because the undefined operation "can't happen", the
+  optimizer can assume that it occurs in dead code.</p>
  
  <pre class="doc_code">
  a:  store undef -> %X
@@ -2282,11 +2307,11 @@ a: &lt;deleted&gt;
  b: unreachable
  </pre>
  
-<p>These examples reiterate the fdiv example: a store "of" an undefined value
-can be assumed to not have any effect: we can assume that the value is
-overwritten with bits that happen to match what was already there.  However, a
-store "to" an undefined location could clobber arbitrary memory, therefore, it
-has undefined behavior.</p>
+<p>These examples reiterate the <tt>fdiv</tt> example: a store <em>of</em> an
+   undefined value can be assumed to not have any effect; we can assume that the
+   value is overwritten with bits that happen to match what was already there.
+   However, a store <em>to</em> an undefined location could clobber arbitrary
+   memory, therefore, it has undefined behavior.</p>
  
  </div>
  
@@ -2407,18 +2432,17 @@ end:
     the address of the entry block is illegal.</p>
  
  <p>This value only has defined behavior when used as an operand to the
-   '<a href="#i_indirectbr"><tt>indirectbr</tt></a>' instruction or for comparisons
-   against null.  Pointer equality tests between labels addresses is undefined
-   behavior - though, again, comparison against null is ok, and no label is
-   equal to the null pointer.  This may also be passed around as an opaque
-   pointer sized value as long as the bits are not inspected.  This allows
-   <tt>ptrtoint</tt> and arithmetic to be performed on these values so long as
-   the original value is reconstituted before the <tt>indirectbr</tt>.</p>
+   '<a href="#i_indirectbr"><tt>indirectbr</tt></a>' instruction, or for
+   comparisons against null. Pointer equality tests between labels addresses
+   results in undefined behavior &mdash; though, again, comparison against null
+   is ok, and no label is equal to the null pointer. This may be passed around
+   as an opaque pointer sized value as long as the bits are not inspected. This
+   allows <tt>ptrtoint</tt> and arithmetic to be performed on these values so
+   long as the original value is reconstituted before the <tt>indirectbr</tt>
+   instruction.</p>
  
-<p>Finally, some targets may provide defined semantics when
-   using the value as the operand to an inline assembly, but that is target
-   specific.
-   </p>
+<p>Finally, some targets may provide defined semantics when using the value as
+   the operand to an inline assembly, but that is target specific.</p>
  
  </div>
  
@@ -2433,7 +2457,7 @@ end:
     to be used as constants.  Constant expressions may be of
     any <a href="#t_firstclass">first class</a> type and may involve any LLVM
     operation that does not have side effects (e.g. load and call are not
-   supported).  The following is the syntax for constant expressions:</p>
+   supported). The following is the syntax for constant expressions:</p>
  
  <dl>
    <dt><b><tt>trunc (CST to TYPE)</tt></b></dt>
@@ -2620,8 +2644,8 @@ call void asm alignstack "eieio", ""()
  <div class="doc_text">
  
  <p>The call instructions that wrap inline asm nodes may have a "!srcloc" MDNode
-   attached to it that contains a constant integer.  If present, the code
-   generator will use the integer as the location cookie value when report
+   attached to it that contains a list of constant integers.  If present, the
+  code generator will use the integer as the location cookie value when report
     errors through the LLVMContext error reporting mechanisms.  This allows a
     front-end to correlate backend errors that occur with inline asm back to the
     source code that produced it.  For example:</p>
@@ -2633,7 +2657,8 @@ call void asm sideeffect "something bad", ""()<b>, !srcloc !42</b>
  </pre>
  
  <p>It is up to the front-end to make sense of the magic numbers it places in the
-   IR.</p>
+   IR.  If the MDNode contains multiple constants, the code generator will use
+   the one that corresponds to the line of the asm that the error occurs on.</p>
  
  </div>
  
@@ -2667,16 +2692,21 @@ call void asm sideeffect "something bad", ""()<b>, !srcloc !42</b>
  <p>Metadata can be used as function arguments. Here <tt>llvm.dbg.value</tt> 
     function is using two metadata arguments.</p>
  
-     <pre class="doc_code">
-       call void @llvm.dbg.value(metadata !24, i64 0, metadata !25)
-     </pre>
+<div class="doc_code">
+<pre>
+call void @llvm.dbg.value(metadata !24, i64 0, metadata !25)
+</pre>
+</div>
  
  <p>Metadata can be attached with an instruction. Here metadata <tt>!21</tt> is
     attached with <tt>add</tt> instruction using <tt>!dbg</tt> identifier.</p>
  
-    <pre class="doc_code">
-      %indvar.next = add i64 %indvar, 1, !dbg !21
-    </pre>
+<div class="doc_code">
+<pre>
+%indvar.next = add i64 %indvar, 1, !dbg !21
+</pre>
+</div>
+
  </div>
  
  
@@ -3418,7 +3448,8 @@ Instruction</a> </div>
  
  <h5>Syntax:</h5>
  <pre>
-  &lt;result&gt; = udiv &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
+  &lt;result&gt; = udiv &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;         <i>; yields {ty}:result</i>
+  &lt;result&gt; = udiv exact &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
  </pre>
  
  <h5>Overview:</h5>
@@ -3437,6 +3468,11 @@ Instruction</a> </div>
  
  <p>Division by zero leads to undefined behavior.</p>
  
+<p>If the <tt>exact</tt> keyword is present, the result value of the
+   <tt>udiv</tt> is a <a href="#trapvalues">trap value</a> if %op1 is not a
+  multiple of %op2 (as such, "((a udiv exact b) mul b) == a").</p>
+
+
  <h5>Example:</h5>
  <pre>
    &lt;result&gt; = udiv i32 4, %var          <i>; yields {i32}:result = 4 / %var</i>
@@ -3577,9 +3613,10 @@ Instruction</a> </div>
  
  <h5>Semantics:</h5>
  <p>This instruction returns the <i>remainder</i> of a division (where the result
-   has the same sign as the dividend, <tt>op1</tt>), not the <i>modulo</i>
-   operator (where the result has the same sign as the divisor, <tt>op2</tt>) of
-   a value.  For more information about the difference,
+   is either zero or has the same sign as the dividend, <tt>op1</tt>), not the
+   <i>modulo</i> operator (where the result is either zero or has the same sign
+   as the divisor, <tt>op2</tt>) of a value.
+   For more information about the difference,
     see <a href="http://mathforum.org/dr.math/problems/anne.4.28.99.html">The
     Math Forum</a>. For a table of how this is implemented in various languages,
     please see <a href="http://en.wikipedia.org/wiki/Modulo_operation">
@@ -3655,7 +3692,10 @@ Instruction</a> </div>
  
  <h5>Syntax:</h5>
  <pre>
-  &lt;result&gt; = shl &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
+  &lt;result&gt; = shl &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;           <i>; yields {ty}:result</i>
+  &lt;result&gt; = shl nuw &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;       <i>; yields {ty}:result</i>
+  &lt;result&gt; = shl nsw &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;       <i>; yields {ty}:result</i>
+  &lt;result&gt; = shl nuw nsw &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
  </pre>
  
  <h5>Overview:</h5>
@@ -3675,6 +3715,14 @@ Instruction</a> </div>
     vectors, each vector element of <tt>op1</tt> is shifted by the corresponding
     shift amount in <tt>op2</tt>.</p>
  
+<p>If the <tt>nuw</tt> keyword is present, then the shift produces a 
+   <a href="#trapvalues">trap value</a> if it shifts out any non-zero bits.  If
+   the <tt>nsw</tt> keyword is present, then the shift produces a
+   <a href="#trapvalues">trap value</a> if it shifts out any bits that disagree
+   with the resultant sign bit.  As such, NUW/NSW have the same semantics as
+   they would if the shift were expressed as a mul instruction with the same
+   nsw/nuw bits in (mul %op1, (shl 1, %op2)).</p>
+
  <h5>Example:</h5>
  <pre>
    &lt;result&gt; = shl i32 4, %var   <i>; yields {i32}: 4 &lt;&lt; %var</i>
@@ -3694,7 +3742,8 @@ Instruction</a> </div>
  
  <h5>Syntax:</h5>
  <pre>
-  &lt;result&gt; = lshr &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
+  &lt;result&gt; = lshr &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;         <i>; yields {ty}:result</i>
+  &lt;result&gt; = lshr exact &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
  </pre>
  
  <h5>Overview:</h5>
@@ -3714,6 +3763,11 @@ Instruction</a> </div>
     vectors, each vector element of <tt>op1</tt> is shifted by the corresponding
     shift amount in <tt>op2</tt>.</p>
  
+<p>If the <tt>exact</tt> keyword is present, the result value of the
+   <tt>lshr</tt> is a <a href="#trapvalues">trap value</a> if any of the bits
+   shifted out are non-zero.</p>
+
+
  <h5>Example:</h5>
  <pre>
    &lt;result&gt; = lshr i32 4, 1   <i>; yields {i32}:result = 2</i>
@@ -3733,7 +3787,8 @@ Instruction</a> </div>
  
  <h5>Syntax:</h5>
  <pre>
-  &lt;result&gt; = ashr &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
+  &lt;result&gt; = ashr &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;         <i>; yields {ty}:result</i>
+  &lt;result&gt; = ashr exact &lt;ty&gt; &lt;op1&gt;, &lt;op2&gt;   <i>; yields {ty}:result</i>
  </pre>
  
  <h5>Overview:</h5>
@@ -3754,6 +3809,10 @@ Instruction</a> </div>
     the arguments are vectors, each vector element of <tt>op1</tt> is shifted by
     the corresponding shift amount in <tt>op2</tt>.</p>
  
+<p>If the <tt>exact</tt> keyword is present, the result value of the
+   <tt>ashr</tt> is a <a href="#trapvalues">trap value</a> if any of the bits
+   shifted out are non-zero.</p>
+
  <h5>Example:</h5>
  <pre>
    &lt;result&gt; = ashr i32 4, 1   <i>; yields {i32}:result = 2</i>
@@ -4121,6 +4180,14 @@ Instruction</a> </div>
     <a href="#t_array">array</a> type.  The operands are constant indices to
     specify which value to extract in a similar manner as indices in a
     '<tt><a href="#i_getelementptr">getelementptr</a></tt>' instruction.</p>
+   <p>The major differences to <tt>getelementptr</tt> indexing are:</p>
+     <ul>
+       <li>Since the value being indexed is not a pointer, the first index is
+           omitted and assumed to be zero.</li>
+       <li>At least one index must be specified.</li>
+       <li>Not only struct indices but also array indices must be in
+           bounds.</li>
+     </ul>
  
  <h5>Semantics:</h5>
  <p>The result is the value at the position in the aggregate specified by the
@@ -4155,7 +4222,7 @@ Instruction</a> </div>
     <a href="#t_array">array</a> type.  The second operand is a first-class
     value to insert.  The following operands are constant indices indicating
     the position at which to insert the value in a similar manner as indices in a
-   '<tt><a href="#i_getelementptr">getelementptr</a></tt>' instruction.  The
+   '<tt><a href="#i_extractvalue">extractvalue</a></tt>' instruction.  The
     value to insert must have the same type as the value identified by the
     indices.</p>
  
@@ -4516,12 +4583,12 @@ entry:
     type <tt>ty2</tt>.</p>
  
  <h5>Arguments:</h5>
-<p>The '<tt>trunc</tt>' instruction takes a <tt>value</tt> to trunc, which must
-   be an <a href="#t_integer">integer</a> type, and a type that specifies the
-   size and type of the result, which must be
-   an <a href="#t_integer">integer</a> type. The bit size of <tt>value</tt> must
-   be larger than the bit size of <tt>ty2</tt>. Equal sized types are not
-   allowed.</p>
+<p>The '<tt>trunc</tt>' instruction takes a value to trunc, and a type to trunc it to.
+   Both types must be of <a href="#t_integer">integer</a> types, or vectors
+   of the same number of integers.
+   The bit size of the <tt>value</tt> must be larger than
+   the bit size of the destination type, <tt>ty2</tt>.
+   Equal sized types are not allowed.</p>
  
  <h5>Semantics:</h5>
  <p>The '<tt>trunc</tt>' instruction truncates the high order bits
@@ -4531,9 +4598,10 @@ entry:
  
  <h5>Example:</h5>
  <pre>
-  %X = trunc i32 257 to i8              <i>; yields i8:1</i>
-  %Y = trunc i32 123 to i1              <i>; yields i1:true</i>
-  %Z = trunc i32 122 to i1              <i>; yields i1:false</i>
+  %X = trunc i32 257 to i8                        <i>; yields i8:1</i>
+  %Y = trunc i32 123 to i1                        <i>; yields i1:true</i>
+  %Z = trunc i32 122 to i1                        <i>; yields i1:false</i>
+  %W = trunc &lt;2 x i16&gt; &lt;i16 8, i16 7&gt; to &lt;2 x i8&gt; <i>; yields &lt;i8 8, i8 7&gt;</i>
  </pre>
  
  </div>
@@ -4555,10 +4623,11 @@ entry:
  
  
  <h5>Arguments:</h5>
-<p>The '<tt>zext</tt>' instruction takes a value to cast, which must be of
-   <a href="#t_integer">integer</a> type, and a type to cast it to, which must
-   also be of <a href="#t_integer">integer</a> type. The bit size of the
-   <tt>value</tt> must be smaller than the bit size of the destination type,
+<p>The '<tt>zext</tt>' instruction takes a value to cast, and a type to cast it to.
+   Both types must be of <a href="#t_integer">integer</a> types, or vectors
+   of the same number of integers.
+   The bit size of the <tt>value</tt> must be smaller than
+   the bit size of the destination type,
     <tt>ty2</tt>.</p>
  
  <h5>Semantics:</h5>
@@ -4571,6 +4640,7 @@ entry:
  <pre>
    %X = zext i32 257 to i64              <i>; yields i64:257</i>
    %Y = zext i1 true to i32              <i>; yields i32:1</i>
+  %Z = zext &lt;2 x i16&gt; &lt;i16 8, i16 7&gt; to &lt;2 x i32&gt; <i>; yields &lt;i32 8, i32 7&gt;</i>
  </pre>
  
  </div>
@@ -4590,10 +4660,11 @@ entry:
  <p>The '<tt>sext</tt>' sign extends <tt>value</tt> to the type <tt>ty2</tt>.</p>
  
  <h5>Arguments:</h5>
-<p>The '<tt>sext</tt>' instruction takes a value to cast, which must be of
-   <a href="#t_integer">integer</a> type, and a type to cast it to, which must
-   also be of <a href="#t_integer">integer</a> type.  The bit size of the
-   <tt>value</tt> must be smaller than the bit size of the destination type,
+<p>The '<tt>sext</tt>' instruction takes a value to cast, and a type to cast it to.
+   Both types must be of <a href="#t_integer">integer</a> types, or vectors
+   of the same number of integers.
+   The bit size of the <tt>value</tt> must be smaller than
+   the bit size of the destination type,
     <tt>ty2</tt>.</p>
  
  <h5>Semantics:</h5>
@@ -4607,6 +4678,7 @@ entry:
  <pre>
    %X = sext i8  -1 to i16              <i>; yields i16   :65535</i>
    %Y = sext i1 true to i32             <i>; yields i32:-1</i>
+  %Z = sext &lt;2 x i16&gt; &lt;i16 8, i16 7&gt; to &lt;2 x i32&gt; <i>; yields &lt;i32 8, i32 7&gt;</i>
  </pre>
  
  </div>
@@ -4679,8 +4751,8 @@ entry:
  
  <h5>Example:</h5>
  <pre>
-  %X = fpext float 3.1415 to double        <i>; yields double:3.1415</i>
-  %Y = fpext float 1.0 to float            <i>; yields float:1.0 (no-op)</i>
+  %X = fpext float 3.125 to double         <i>; yields double:3.125000e+00</i>
+  %Y = fpext double %X to fp128            <i>; yields fp128:0xL00000000000000004000900000000000</i>
  </pre>
  
  </div>
@@ -7496,7 +7568,7 @@ LLVM</a>.</p>
  
  <h5>Syntax:</h5>
  <pre>
-  declare {}* @llvm.invariant.start(i64 &lt;size&gt;, i8* nocapture &lt;ptr&gt;) readonly
+  declare {}* @llvm.invariant.start(i64 &lt;size&gt;, i8* nocapture &lt;ptr&gt;)
  </pre>
  
  <h5>Overview:</h5>
@@ -7671,7 +7743,7 @@ LLVM</a>.</p>
     the <tt>AllocaInst</tt> stack slot to be before local variables on the
     stack. This is to ensure that if a local variable on the stack is
     overwritten, it will destroy the value of the guard. When the function exits,
-   the guard on the stack is checked against the original guard. If they're
+   the guard on the stack is checked against the original guard. If they are
     different, then the program aborts by calling the <tt>__stack_chk_fail()</tt>
     function.</p>
  
@@ -7691,25 +7763,24 @@ LLVM</a>.</p>
  </pre>
  
  <h5>Overview:</h5>
-<p>The <tt>llvm.objectsize</tt> intrinsic is designed to provide information
-   to the optimizers to discover at compile time either a) when an
-   operation like memcpy will either overflow a buffer that corresponds to
-   an object, or b) to determine that a runtime check for overflow isn't
-   necessary. An object in this context means an allocation of a
-   specific class, structure, array, or other object.</p>
+<p>The <tt>llvm.objectsize</tt> intrinsic is designed to provide information to
+   the optimizers to determine at compile time whether a) an operation (like
+   memcpy) will overflow a buffer that corresponds to an object, or b) that a
+   runtime check for overflow isn't necessary. An object in this context means
+   an allocation of a specific class, structure, array, or other object.</p>
  
  <h5>Arguments:</h5>
-<p>The <tt>llvm.objectsize</tt> intrinsic takes two arguments.  The first
+<p>The <tt>llvm.objectsize</tt> intrinsic takes two arguments. The first
     argument is a pointer to or into the <tt>object</tt>. The second argument
-   is a boolean 0 or 1.  This argument determines whether you want the 
-   maximum (0) or minimum (1) bytes remaining.  This needs to be a literal 0 or
+   is a boolean 0 or 1. This argument determines whether you want the 
+   maximum (0) or minimum (1) bytes remaining. This needs to be a literal 0 or
     1, variables are not allowed.</p>
     
  <h5>Semantics:</h5>
  <p>The <tt>llvm.objectsize</tt> intrinsic is lowered to either a constant
-   representing the size of the object concerned or <tt>i32/i64 -1 or 0</tt>
-   (depending on the <tt>type</tt> argument if the size cannot be determined
-   at compile time.</p>
+   representing the size of the object concerned, or <tt>i32/i64 -1 or 0</tt>,
+   depending on the <tt>type</tt> argument, if the size cannot be determined at
+   compile time.</p>
  
  </div>
  
@@ -7722,7 +7793,7 @@ LLVM</a>.</p>
    src="http://www.w3.org/Icons/valid-html401-blue" alt="Valid HTML 4.01"></a>
  
    <a href="mailto:sabre@nondot.org">Chris Lattner</a><br>
-  <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+  <a href="http://llvm.org/">The LLVM Compiler Infrastructure</a><br>
    Last modified: $Date$
  </address>