X-Git-Url: http://plrg.eecs.uci.edu/git/?a=blobdiff_plain;f=docs%2FLangRef.html;h=a781992f89826c027120e325a3d3ad6ac8e9a799;hb=908a831a9a1fb043bc4758d6712d78255099ae51;hp=bf834c211a07bfc82dadd5f0865cad3d4628cfdf;hpb=e865c035fd5db0daee1807b2c5b6d79c9dd4062b;p=oota-llvm.git diff --git a/docs/LangRef.html b/docs/LangRef.html index bf834c211a0..a781992f898 100644 --- a/docs/LangRef.html +++ b/docs/LangRef.html @@ -7,7 +7,7 @@ - + @@ -35,7 +35,7 @@
  • 'extern_weak' Linkage
  • 'linkonce_odr' Linkage
  • 'weak_odr' Linkage
  • -
  • 'externally visible' Linkage
  • +
  • 'external' Linkage
  • 'dllimport' Linkage
  • 'dllexport' Linkage
  • @@ -92,7 +92,7 @@
  • Complex Constants
  • Global Variable and Function Addresses
  • Undefined Values
  • -
  • Trap Values
  • +
  • Poison Values
  • Addresses of Basic Blocks
  • Constant Expressions
  • @@ -100,7 +100,18 @@
  • Other Values
    1. Inline Assembler Expressions
    2. -
    3. Metadata Nodes and Metadata Strings
    4. +
    5. Metadata Nodes and Metadata Strings +
        +
      1. 'tbaa' Metadata
      2. +
      3. 'fpmath' Metadata
      4. +
      5. 'range' Metadata
      6. +
      +
    6. +
    +
  • +
  • Module Flags Metadata +
      +
    1. Objective-C Garbage Collection Module Flags Metadata
  • Intrinsic Global Variables @@ -123,7 +134,6 @@
  • 'switch' Instruction
  • 'indirectbr' Instruction
  • 'invoke' Instruction
  • -
  • 'unwind' Instruction
  • 'resume' Instruction
  • 'unreachable' Instruction
  • @@ -275,34 +285,18 @@
  • Debugger intrinsics
  • Exception Handling intrinsics
  • -
  • Trampoline Intrinsic +
  • Trampoline Intrinsics
    1. 'llvm.init.trampoline' Intrinsic
    2. -
    -
  • -
  • Atomic intrinsics -
      -
    1. llvm.memory_barrier
    2. -
    3. llvm.atomic.cmp.swap
    4. -
    5. llvm.atomic.swap
    6. -
    7. llvm.atomic.load.add
    8. -
    9. llvm.atomic.load.sub
    10. -
    11. llvm.atomic.load.and
    12. -
    13. llvm.atomic.load.nand
    14. -
    15. llvm.atomic.load.or
    16. -
    17. llvm.atomic.load.xor
    18. -
    19. llvm.atomic.load.max
    20. -
    21. llvm.atomic.load.min
    22. -
    23. llvm.atomic.load.umax
    24. -
    25. llvm.atomic.load.umin
    26. +
    27. 'llvm.adjust.trampoline' Intrinsic
  • Memory Use Markers
      -
    1. llvm.lifetime.start
    2. -
    3. llvm.lifetime.end
    4. -
    5. llvm.invariant.start
    6. -
    7. llvm.invariant.end
    8. +
    9. 'llvm.lifetime.start' Intrinsic
    10. +
    11. 'llvm.lifetime.end' Intrinsic
    12. +
    13. 'llvm.invariant.start' Intrinsic
    14. +
    15. 'llvm.invariant.end' Intrinsic
  • General intrinsics @@ -313,10 +307,14 @@ 'llvm.annotation.*' Intrinsic
  • 'llvm.trap' Intrinsic
  • +
  • + 'llvm.debugtrap' Intrinsic
  • 'llvm.stackprotector' Intrinsic
  • 'llvm.objectsize' Intrinsic
  • +
  • + 'llvm.expect' Intrinsic
  • @@ -495,43 +493,43 @@
    -

    LLVM programs are composed of "Module"s, each of which is a translation unit - of the input programs. Each module consists of functions, global variables, - and symbol table entries. Modules may be combined together with the LLVM - linker, which merges function (and global variable) definitions, resolves - forward declarations, and merges symbol table entries. Here is an example of - the "hello world" module:

    +

    LLVM programs are composed of Modules, each of which is a + translation unit of the input programs. Each module consists of functions, + global variables, and symbol table entries. Modules may be combined together + with the LLVM linker, which merges function (and global variable) + definitions, resolves forward declarations, and merges symbol table + entries. Here is an example of the "hello world" module:

     ; Declare the string constant as a global constant. 
    -@.LC0 = internal constant [13 x i8] c"hello world\0A\00"      ; [13 x i8]* 
    +@.str = private unnamed_addr constant [13 x i8] c"hello world\0A\00" 
     
     ; External declaration of the puts function 
    -declare i32 @puts(i8*)                                      ; i32 (i8*)*  
    +declare i32 @puts(i8* nocapture) nounwind 
     
     ; Definition of main function
     define i32 @main() {   ; i32()*  
       ; Convert [13 x i8]* to i8  *... 
    -  %cast210 = getelementptr [13 x i8]* @.LC0, i64 0, i64 0   ; i8* 
    +  %cast210 = getelementptr [13 x i8]* @.str, i64 0, i64 0
     
       ; Call puts function to write out the string to stdout. 
    -  call i32 @puts(i8* %cast210)           ; i32 
    +  call i32 @puts(i8* %cast210)
       ret i32 0 
     }
     
     ; Named metadata
    -!1 = metadata !{i32 41}
    +!1 = metadata !{i32 42}
     !foo = !{!1, null}
     

    This example is made up of a global variable named - ".LC0", an external declaration of the "puts" function, + ".str", an external declaration of the "puts" function, a function definition for "main" and named metadata - "foo".

    + "foo".

    -

    In general, a module is made up of a list of global values, where both - functions and global variables are global values. Global values are +

    In general, a module is made up of a list of global values (where both + functions and global variables are global values). Global values are represented by a pointer to a memory location (in this case, a pointer to an array of char, and a pointer to a function), and have one of the following linkage types.

    @@ -646,7 +644,7 @@ define i32 @main() { ; i32()*   be merged with equivalent globals. These linkage types are otherwise the same as their non-odr versions. -
    externally visible:
    +
    external
    If none of the above identifiers are used, the global is externally visible, meaning that it participates in linkage and can be used to resolve external symbol references.
    @@ -679,8 +677,8 @@ define i32 @main() { ; i32()*   declarations), they are accessible outside of the current module.

    It is illegal for a function declaration to have any linkage type - other than "externally visible", dllimport - or extern_weak.

    + other than external, dllimport + or extern_weak.

    Aliases can have only external, internal, weak or weak_odr linkages.

    @@ -943,7 +941,7 @@ define i32 @main() { ; i32()*   alignments must be a power of 2.

    If the unnamed_addr attribute is given, the address is know to not - be significant and two identical functions can be merged

    . + be significant and two identical functions can be merged.

    Syntax:
    @@ -1152,6 +1150,10 @@ define void @f() optsize { ... }
     
    +
    address_safety
    +
    This attribute indicates that the address safety analysis + is enabled for this function.
    +
    alignstack(<n>)
    This attribute indicates that, when emitting the prologue and epilogue, the backend should forcibly align the stack pointer. Specify the @@ -1162,14 +1164,6 @@ define void @f() optsize { ... } function into callers whenever possible, ignoring any active inlining size threshold for this caller.
    -
    hotpatch
    -
    This attribute indicates that the function should be 'hotpatchable', - meaning the function can be patched and/or hooked even while it is - loaded into memory. On x86, the function prologue will be preceded - by six bytes of padding and will begin with a two-byte instruction. - Most of the functions in the Windows system DLLs in Windows XP SP2 or - higher were compiled in this fashion.
    -
    nonlazybind
    This attribute suppresses lazy symbol binding for the function. This may make calls to the function faster, at the cost of extra program @@ -1219,8 +1213,7 @@ define void @f() optsize { ... } It does not write through any pointer arguments (including byval arguments) and never changes any state visible to callers. This means that it cannot unwind - exceptions by calling the C++ exception throwing methods, but - could use the unwind instruction.
    + exceptions by calling the C++ exception throwing methods.
    readonly
    This attribute indicates that the function does not write through any @@ -1230,8 +1223,13 @@ define void @f() optsize { ... } and read state that may be set in the caller. A readonly function always returns the same value (or unwinds an exception identically) when called with the same set of arguments and global state. It cannot unwind an - exception by calling the C++ exception throwing methods, but may - use the unwind instruction.
    + exception by calling the C++ exception throwing methods. + +
    returns_twice
    +
    This attribute indicates that this function can return twice. The + C setjmp is an example of such a function. The compiler + disables some optimizations (like tail calls) in the caller of these + functions.
    ssp
    This attribute indicates that the function should emit a stack smashing @@ -1260,7 +1258,6 @@ define void @f() optsize { ... } show that no exceptions passes by it. This is normally the case for the ELF x86-64 abi, but it can be disabled for some compilation units.
    -
    @@ -1321,6 +1318,13 @@ target datalayout = "layout specification" the bits with the least significance have the lowest address location. +
    Ssize
    +
    Specifies the natural alignment of the stack in bits. Alignment promotion + of stack variables is limited to the natural stack alignment to avoid + dynamic stack realignment. The stack alignment must be a multiple of + 8-bits. If omitted, the natural stack alignment defaults to "unspecified", + which does not prevent any alignment promotions.
    +
    p:size:abi:pref
    This specifies the size of a pointer and its abi and preferred alignments. All sizes are in bits. Specifying @@ -1401,6 +1405,22 @@ target datalayout = "layout specification" implemented in terms of 64 <2 x double>, for example. +

    The function of the data layout string may not be what you expect. Notably, + this is not a specification from the frontend of what alignment the code + generator should use.

    + +

    Instead, if specified, the target data layout is required to match what the + ultimate code generator expects. This string is used by the + mid-level optimizers to + improve code, and this only works if it matches what the ultimate code + generator uses. If you would like to generate IR that does not embed this + target-specific detail into the IR, then you don't have to specify the + string. This will disable some optimizations that require precise layout + information, but this also prevents those optimizations from introducing + target specificity into the IR.

    + + + @@ -1497,6 +1517,9 @@ or to register signal handlers. Nonetheless, there are platform-specific ways to create them, and we define LLVM IR's behavior in their presence. This model is inspired by the C++0x memory model.

    +

    For a more informal introduction to this model, see the +LLVM Atomic Instructions and Concurrency Guide. +

    We define a happens-before partial order as the least partial order that

    acquire
    -
    In addition to the guarantees of monotonic, if this operation -reads a value written by a release atomic operation, it -synchronizes-with that operation.
    -
    release
    In addition to the guarantees of monotonic, -a synchronizes-with edge may be formed by an acquire -operation.
    +a synchronizes-with edge may be formed with a release +operation. This is intended to model C++'s memory_order_acquire. +
    release
    +
    In addition to the guarantees of monotonic, if this operation +writes a value which is subsequently read by an acquire operation, +it synchronizes-with that operation. (This isn't a complete +description; see the C++0x definition of a release sequence.) This corresponds +to the C++0x/C1x memory_order_release.
    acq_rel (acquire+release)
    Acts as both an -acquire and release operation on its address.
    +acquire and release operation on its address. +This corresponds to the C++0x/C1x memory_order_acq_rel.
    seq_cst (sequentially consistent)
    In addition to the guarantees of acq_rel (acquire for an operation which only reads, release @@ -1637,9 +1673,8 @@ for an operation which only writes), there is a global total order on all sequentially-consistent operations on all addresses, which is consistent with the happens-before partial order and with the modification orders of all the affected addresses. Each sequentially-consistent read sees the last -preceding write to the same address in this global order. This is intended -to model C++'s sequentially-consistent atomic variables and Java's volatile -shared variables.
    +preceding write to the same address in this global order. This corresponds +to the C++0x/C1x memory_order_seq_cst and Java volatile.

    If an atomic operation is marked singlethread, @@ -1683,7 +1718,7 @@ in signal handlers).

    floating point - float, double, x86_fp80, fp128, ppc_fp128 + half, float, double, x86_fp80, fp128, ppc_fp128 first class @@ -1783,6 +1818,7 @@ in signal handlers).

    + @@ -1879,9 +1915,6 @@ in signal handlers).

    possible to have a two dimensional array, using an array as the element type of another array.

    - - -

    Aggregate Types @@ -1890,9 +1923,9 @@ in signal handlers).

    Aggregate Types are a subset of derived types that can contain multiple - member types. Arrays, - structs, and vectors are - aggregate types.

    + member types. Arrays and + structs are aggregate types. + Vectors are not considered to be aggregate types.

    @@ -2033,7 +2066,7 @@ in signal handlers).

    alignment of the struct is one byte, and that there is no padding between the elements. In non-packed structs, padding between field types is inserted as defined by the TargetData string in the module, which is required to match - what the underlying processor expects.

    + what the underlying code generator expects.

    Structures can either be "literal" or "identified". A literal structure is defined inline with other types (e.g. {i32, i32}*) whereas identified @@ -2166,8 +2199,8 @@ in signal handlers).

    The number of elements is a constant integer value larger than 0; elementtype - may be any integer or floating point type. Vectors of size zero are not - allowed, and pointers are not allowed as the element type.

    + may be any integer or floating point type, or a pointer to these types. + Vectors of size zero are not allowed.

    Examples:
    TypeDescription
    half16-bit floating point value
    float32-bit floating point value
    double64-bit floating point value
    fp128128-bit floating point value (112-bit mantissa)
    @@ -2183,12 +2216,18 @@ in signal handlers).

    + + + +
    <2 x i64> Vector of 2 64-bit integer values.
    <4 x i64*>Vector of 4 pointers to 64-bit integer values.
    + +

    Constants

    @@ -2239,18 +2278,20 @@ in signal handlers).

    represented in their IEEE hexadecimal format so that assembly and disassembly do not cause any bits to change in the constants.

    -

    When using the hexadecimal form, constants of types float and double are +

    When using the hexadecimal form, constants of types half, float, and double are represented using the 16-digit form shown above (which matches the IEEE754 - representation for double); float values must, however, be exactly - representable as IEE754 single precision. Hexadecimal format is always used + representation for double); half and float values must, however, be exactly + representable as IEE754 half and single precision, respectively. + Hexadecimal format is always used for long double, and there are three forms of long double. The 80-bit format used by x86 is represented as 0xK followed by 20 hexadecimal digits. The 128-bit format used by PowerPC (two adjacent doubles) is represented by 0xM followed by 32 hexadecimal digits. The IEEE 128-bit format is represented by 0xL followed by 32 hexadecimal digits; no currently supported target uses this format. Long doubles will only work if - they match the long double format on your target. All hexadecimal formats - are big-endian (sign bit at the left).

    + they match the long double format on your target. The IEEE 16-bit format + (half precision) is represented by 0xH followed by 4 hexadecimal + digits. All hexadecimal formats are big-endian (sign bit at the left).

    There are no constants of type x86mmx.

    @@ -2477,22 +2518,21 @@ b: unreachable

    - Trap Values + Poison Values

    -

    Trap values are similar to undef values, however - instead of representing an unspecified bit pattern, they represent the - fact that an instruction or constant expression which cannot evoke side - effects has nevertheless detected a condition which results in undefined - behavior.

    +

    Poison values are similar to undef values, however + they also represent the fact that an instruction or constant expression which + cannot evoke side effects has nevertheless detected a condition which results + in undefined behavior.

    -

    There is currently no way of representing a trap value in the IR; they +

    There is currently no way of representing a poison value in the IR; they only exist when produced by operations such as add with the nsw flag.

    -

    Trap value behavior is defined in terms of value dependence:

    +

    Poison value behavior is defined in terms of value dependence:

    -

    Whenever a trap value is generated, all values which depend on it evaluate - to trap. If they have side effects, the evoke their side effects as if each - operand with a trap value were undef. If they have externally-visible side - effects, the behavior is undefined.

    +

    Poison Values have the same behavior as undef values, + with the additional affect that any instruction which has a dependence + on a poison value has undefined behavior.

    Here are some examples:

     entry:
    -  %trap = sub nuw i32 0, 1           ; Results in a trap value.
    -  %still_trap = and i32 %trap, 0     ; Whereas (and i32 undef, 0) would return 0.
    -  %trap_yet_again = getelementptr i32* @h, i32 %still_trap
    -  store i32 0, i32* %trap_yet_again  ; undefined behavior
    +  %poison = sub nuw i32 0, 1           ; Results in a poison value.
    +  %still_poison = and i32 %poison, 0   ; 0, but also poison.
    +  %poison_yet_again = getelementptr i32* @h, i32 %still_poison
    +  store i32 0, i32* %poison_yet_again  ; memory at @h[0] is poisoned
     
    -  store i32 %trap, i32* @g           ; Trap value conceptually stored to memory.
    -  %trap2 = load i32* @g              ; Returns a trap value, not just undef.
    +  store i32 %poison, i32* @g           ; Poison value stored to memory.
    +  %poison2 = load i32* @g              ; Poison value loaded back from memory.
     
    -  volatile store i32 %trap, i32* @g  ; External observation; undefined behavior.
    +  store volatile i32 %poison, i32* @g  ; External observation; undefined behavior.
     
       %narrowaddr = bitcast i32* @g to i16*
       %wideaddr = bitcast i32* @g to i64*
    -  %trap3 = load i16* %narrowaddr     ; Returns a trap value.
    -  %trap4 = load i64* %wideaddr       ; Returns a trap value.
    +  %poison3 = load i16* %narrowaddr     ; Returns a poison value.
    +  %poison4 = load i64* %wideaddr       ; Returns a poison value.
     
    -  %cmp = icmp slt i32 %trap, 0       ; Returns a trap value.
    -  br i1 %cmp, label %true, label %end ; Branch to either destination.
    +  %cmp = icmp slt i32 %poison, 0       ; Returns a poison value.
    +  br i1 %cmp, label %true, label %end  ; Branch to either destination.
     
     true:
    -  volatile store i32 0, i32* @g      ; This is control-dependent on %cmp, so
    -                                     ; it has undefined behavior.
    +  store volatile i32 0, i32* @g        ; This is control-dependent on %cmp, so
    +                                       ; it has undefined behavior.
       br label %end
     
     end:
       %p = phi i32 [ 0, %entry ], [ 1, %true ]
    -                                     ; Both edges into this PHI are
    -                                     ; control-dependent on %cmp, so this
    -                                     ; always results in a trap value.
    +                                       ; Both edges into this PHI are
    +                                       ; control-dependent on %cmp, so this
    +                                       ; always results in a poison value.
     
    -  volatile store i32 0, i32* @g      ; This would depend on the store in %true
    -                                     ; if %cmp is true, or the store in %entry
    -                                     ; otherwise, so this is undefined behavior.
    +  store volatile i32 0, i32* @g        ; This would depend on the store in %true
    +                                       ; if %cmp is true, or the store in %entry
    +                                       ; otherwise, so this is undefined behavior.
     
       br i1 %cmp, label %second_true, label %second_end
    -                                     ; The same branch again, but this time the
    -                                     ; true block doesn't have side effects.
    +                                       ; The same branch again, but this time the
    +                                       ; true block doesn't have side effects.
     
     second_true:
       ; No side effects!
       ret void
     
     second_end:
    -  volatile store i32 0, i32* @g      ; This time, the instruction always depends
    -                                     ; on the store in %end. Also, it is
    -                                     ; control-equivalent to %end, so this is
    -                                     ; well-defined (again, ignoring earlier
    -                                     ; undefined behavior in this example).
    +  store volatile i32 0, i32* @g        ; This time, the instruction always depends
    +                                       ; on the store in %end. Also, it is
    +                                       ; control-equivalent to %end, so this is
    +                                       ; well-defined (ignoring earlier undefined
    +                                       ; behavior in this example).
     
    @@ -2777,7 +2816,7 @@ second_end:

    LLVM supports inline assembler expressions (as opposed - to Module-Level Inline Assembly) through the use of + to Module-Level Inline Assembly) through the use of a special value. This value represents the inline assembler as a string (containing the instructions to emit), a list of operand constraints (stored as a string), a flag that indicates whether or not the inline asm @@ -2819,23 +2858,27 @@ call void asm alignstack "eieio", ""()

    If both keywords appear the 'sideeffect' keyword must come first.

    + +

    -Inline Asm Metadata + Inline Asm Metadata

    -

    The call instructions that wrap inline asm nodes may have a "!srcloc" MDNode - attached to it that contains a list of constant integers. If present, the - code generator will use the integer as the location cookie value when report - errors through the LLVMContext error reporting mechanisms. This allows a - front-end to correlate backend errors that occur with inline asm back to the - source code that produced it. For example:

    +

    The call instructions that wrap inline asm nodes may have a + "!srcloc" MDNode attached to it that contains a list of constant + integers. If present, the code generator will use the integer as the + location cookie value when report errors through the LLVMContext + error reporting mechanisms. This allows a front-end to correlate backend + errors that occur with inline asm back to the source code that produced it. + For example:

     call void asm sideeffect "something bad", ""(), !srcloc !42
    @@ -2844,7 +2887,7 @@ call void asm sideeffect "something bad", ""(), !srcloc !42
     

    It is up to the front-end to make sense of the magic numbers it places in the - IR. If the MDNode contains multiple constants, the code generator will use + IR. If the MDNode contains multiple constants, the code generator will use the one that corresponds to the line of the asm that the error occurs on.

    @@ -2866,20 +2909,33 @@ call void asm sideeffect "something bad", ""(), !srcloc !42 preceding exclamation point ('!').

    A metadata string is a string surrounded by double quotes. It can contain - any character by escaping non-printable characters with "\xx" where "xx" is - the two digit hex code. For example: "!"test\00"".

    + any character by escaping non-printable characters with "\xx" where + "xx" is the two digit hex code. For example: + "!"test\00"".

    Metadata nodes are represented with notation similar to structure constants (a comma separated list of elements, surrounded by braces and preceded by an - exclamation point). For example: "!{ metadata !"test\00", i32 - 10}". Metadata nodes can have any values as their operand.

    + exclamation point). Metadata nodes can have any values as their operand. For + example:

    + +
    +
    +!{ metadata !"test\00", i32 10}
    +
    +

    A named metadata is a collection of metadata nodes, which can be looked up in the module symbol table. For - example: "!foo = metadata !{!4, !3}". + example:

    + +
    +
    +!foo =  metadata !{!4, !3}
    +
    +

    Metadata can be used as function arguments. Here llvm.dbg.value - function is using two metadata arguments.

    + function is using two metadata arguments:

    @@ -2888,7 +2944,8 @@ call void @llvm.dbg.value(metadata !24, i64 0, metadata !25)
     

    Metadata can be attached with an instruction. Here metadata !21 is - attached with add instruction using !dbg identifier.

    + attached to the add instruction using the !dbg + identifier:

    @@ -2896,6 +2953,324 @@ call void @llvm.dbg.value(metadata !24, i64 0, metadata !25)
     
    +

    More information about specific metadata nodes recognized by the optimizers + and code generator is found below.

    + + +

    + 'tbaa' Metadata +

    + +
    + +

    In LLVM IR, memory does not have types, so LLVM's own type system is not + suitable for doing TBAA. Instead, metadata is added to the IR to describe + a type system of a higher level language. This can be used to implement + typical C/C++ TBAA, but it can also be used to implement custom alias + analysis behavior for other languages.

    + +

    The current metadata format is very simple. TBAA metadata nodes have up to + three fields, e.g.:

    + +
    +
    +!0 = metadata !{ metadata !"an example type tree" }
    +!1 = metadata !{ metadata !"int", metadata !0 }
    +!2 = metadata !{ metadata !"float", metadata !0 }
    +!3 = metadata !{ metadata !"const float", metadata !2, i64 1 }
    +
    +
    + +

    The first field is an identity field. It can be any value, usually + a metadata string, which uniquely identifies the type. The most important + name in the tree is the name of the root node. Two trees with + different root node names are entirely disjoint, even if they + have leaves with common names.

    + +

    The second field identifies the type's parent node in the tree, or + is null or omitted for a root node. A type is considered to alias + all of its descendants and all of its ancestors in the tree. Also, + a type is considered to alias all types in other trees, so that + bitcode produced from multiple front-ends is handled conservatively.

    + +

    If the third field is present, it's an integer which if equal to 1 + indicates that the type is "constant" (meaning + pointsToConstantMemory should return true; see + other useful + AliasAnalysis methods).

    + +
    + + +

    + 'fpmath' Metadata +

    + +
    + +

    fpmath metadata may be attached to any instruction of floating point + type. It can be used to express the maximum acceptable error in the result of + that instruction, in ULPs, thus potentially allowing the compiler to use a + more efficient but less accurate method of computing it. ULP is defined as + follows:

    + +
    + +

    If x is a real number that lies between two finite consecutive + floating-point numbers a and b, without being equal to one + of them, then ulp(x) = |b - a|, otherwise ulp(x) is the + distance between the two non-equal finite floating-point numbers nearest + x. Moreover, ulp(NaN) is NaN.

    + +
    + +

    The metadata node shall consist of a single positive floating point number + representing the maximum relative error, for example:

    + +
    +
    +!0 = metadata !{ float 2.5 } ; maximum acceptable inaccuracy is 2.5 ULPs
    +
    +
    + +
    + + +

    + 'range' Metadata +

    + +
    +

    range metadata may be attached only to loads of integer types. It + expresses the possible ranges the loaded value is in. The ranges are + represented with a flattened list of integers. The loaded value is known to + be in the union of the ranges defined by each consecutive pair. Each pair + has the following properties:

    + + +

    Examples:

    +
    +
    +  %a = load i8* %x, align 1, !range !0 ; Can only be 0 or 1
    +  %b = load i8* %y, align 1, !range !1 ; Can only be 255 (-1), 0 or 1
    +  %c = load i8* %z, align 1, !range !2 ; Can only be 0, 1, 3, 4 or 5
    +...
    +!0 = metadata !{ i8 0, i8 2 }
    +!1 = metadata !{ i8 255, i8 2 }
    +!2 = metadata !{ i8 0, i8 2, i8 3, i8 6 }
    +
    +
    +
    +
    + + + + +

    + Module Flags Metadata +

    + + +
    + +

    Information about the module as a whole is difficult to convey to LLVM's + subsystems. The LLVM IR isn't sufficient to transmit this + information. The llvm.module.flags named metadata exists in order to + facilitate this. These flags are in the form of key / value pairs — + much like a dictionary — making it easy for any subsystem who cares + about a flag to look it up.

    + +

    The llvm.module.flags metadata contains a list of metadata + triplets. Each triplet has the following form:

    + + + +

    When two (or more) modules are merged together, the resulting + llvm.module.flags metadata is the union of the + modules' llvm.module.flags metadata. The only exception being a flag + with the Override behavior, which may override another flag's value + (see below).

    + +

    The following behaviors are supported:

    + + + + + + + + + + + + + + + + + + + + + + + + +
    ValueBehavior
    1 +
    +
    Error
    +
    Emits an error if two values disagree. It is an error to have an ID + with both an Error and a Warning behavior.
    +
    +
    2 +
    +
    Warning
    +
    Emits a warning if two values disagree.
    +
    +
    3 +
    +
    Require
    +
    Emits an error when the specified value is not present or doesn't + have the specified value. It is an error for two (or more) + llvm.module.flags with the same ID to have the Require + behavior but different values. There may be multiple Require flags + per ID.
    +
    +
    4 +
    +
    Override
    +
    Uses the specified value if the two values disagree. It is an + error for two (or more) llvm.module.flags with the same + ID to have the Override behavior but different values.
    +
    +
    + +

    An example of module flags:

    + +
    +!0 = metadata !{ i32 1, metadata !"foo", i32 1 }
    +!1 = metadata !{ i32 4, metadata !"bar", i32 37 }
    +!2 = metadata !{ i32 2, metadata !"qux", i32 42 }
    +!3 = metadata !{ i32 3, metadata !"qux",
    +  metadata !{
    +    metadata !"foo", i32 1
    +  }
    +}
    +!llvm.module.flags = !{ !0, !1, !2, !3 }
    +
    + + + + + +

    +Objective-C Garbage Collection Module Flags Metadata +

    + +
    + +

    On the Mach-O platform, Objective-C stores metadata about garbage collection + in a special section called "image info". The metadata consists of a version + number and a bitmask specifying what types of garbage collection are + supported (if any) by the file. If two or more modules are linked together + their garbage collection metadata needs to be merged rather than appended + together.

    + +

    The Objective-C garbage collection module flags metadata consists of the + following key-value pairs:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    KeyValue
    Objective-C Version[Required] — The Objective-C ABI + version. Valid values are 1 and 2.
    Objective-C Image Info Version[Required] — The version of the image info + section. Currently always 0.
    Objective-C Image Info Section[Required] — The section to place the + metadata. Valid values are "__OBJC, __image_info, regular" for + Objective-C ABI version 1, and "__DATA,__objc_imageinfo, regular, + no_dead_strip" for Objective-C ABI version 2.
    Objective-C Garbage Collection[Required] — Specifies whether garbage + collection is supported or not. Valid values are 0, for no garbage + collection, and 2, for garbage collection supported.
    Objective-C GC Only[Optional] — Specifies that only garbage + collection is supported. If present, its value must be 6. This flag + requires that the Objective-C Garbage Collection flag have the + value 2.
    + +

    Some important flag interactions:

    + + +
    @@ -2924,26 +3299,29 @@ href="#linkage_appending">appending linkage. This array contains a list of pointers to global variables and functions which may optionally have a pointer cast formed of bitcast or getelementptr. For example, a legal use of it is:

    +
    -  @X = global i8 4
    -  @Y = global i32 123
    +@X = global i8 4
    +@Y = global i32 123
     
    -  @llvm.used = appending global [2 x i8*] [
    -     i8* @X,
    -     i8* bitcast (i32* @Y to i8*)
    -  ], section "llvm.metadata"
    +@llvm.used = appending global [2 x i8*] [
    +   i8* @X,
    +   i8* bitcast (i32* @Y to i8*)
    +], section "llvm.metadata"
     
    +

    If a global variable appears in the @llvm.used list, then the -compiler, assembler, and linker are required to treat the symbol as if there is -a reference to the global that it cannot see. For example, if a variable has -internal linkage and no references other than that from the @llvm.used -list, it cannot be deleted. This is commonly used to represent references from -inline asms and other things the compiler cannot "see", and corresponds to -"attribute((used))" in GNU C.

    + compiler, assembler, and linker are required to treat the symbol as if there + is a reference to the global that it cannot see. For example, if a variable + has internal linkage and no references other than that from + the @llvm.used list, it cannot be deleted. This is commonly used to + represent references from inline asms and other things the compiler cannot + "see", and corresponds to "attribute((used))" in GNU C.

    On some targets, the code generator must emit a directive to the assembler or -object file to prevent the assembler and linker from molesting the symbol.

    + object file to prevent the assembler and linker from molesting the + symbol.

    @@ -2957,13 +3335,13 @@ object file to prevent the assembler and linker from molesting the symbol.

    The @llvm.compiler.used directive is the same as the -@llvm.used directive, except that it only prevents the compiler from -touching the symbol. On targets that support it, this allows an intelligent -linker to optimize references to the symbol without being impeded as it would be -by @llvm.used.

    + @llvm.used directive, except that it only prevents the compiler from + touching the symbol. On targets that support it, this allows an intelligent + linker to optimize references to the symbol without being impeded as it would + be by @llvm.used.

    This is a rare construct that should only be used in rare circumstances, and -should not be exposed to source languages.

    + should not be exposed to source languages.

    @@ -2973,12 +3351,19 @@ should not be exposed to source languages.

    + +
     %0 = type { i32, void ()* }
     @llvm.global_ctors = appending global [1 x %0] [%0 { i32 65535, void ()* @ctor }]
     
    -

    The @llvm.global_ctors array contains a list of constructor functions and associated priorities. The functions referenced by this array will be called in ascending order of priority (i.e. lowest first) when the module is loaded. The order of functions with the same priority is not defined. -

    +
    + +

    The @llvm.global_ctors array contains a list of constructor + functions and associated priorities. The functions referenced by this array + will be called in ascending order of priority (i.e. lowest first) when the + module is loaded. The order of functions with the same priority is not + defined.

    @@ -2988,13 +3373,18 @@ should not be exposed to source languages.

    + +
     %0 = type { i32, void ()* }
     @llvm.global_dtors = appending global [1 x %0] [%0 { i32 65535, void ()* @dtor }]
     
    +
    -

    The @llvm.global_dtors array contains a list of destructor functions and associated priorities. The functions referenced by this array will be called in descending order of priority (i.e. highest first) when the module is loaded. The order of functions with the same priority is not defined. -

    +

    The @llvm.global_dtors array contains a list of destructor functions + and associated priorities. The functions referenced by this array will be + called in descending order of priority (i.e. highest first) when the module + is loaded. The order of functions with the same priority is not defined.

    @@ -3033,7 +3423,6 @@ should not be exposed to source languages.

    'switch', 'indirectbr', 'invoke', - 'unwind', 'resume', and 'unreachable'.

    @@ -3253,15 +3642,15 @@ IfUnequal: 'normal' label or the 'exception' label. If the callee function returns with the "ret" instruction, control flow will return to the "normal" label. If the callee (or any - indirect callees) returns with the "unwind" - instruction, control is interrupted and continued at the dynamically nearest - "exception" label.

    + indirect callees) returns via the "resume" + instruction or other exception handling mechanism, control is interrupted and + continued at the dynamically nearest "exception" label.

    The 'exception' label is a landing pad for the exception. As such, 'exception' label is required to have the "landingpad" instruction, which contains - the information about about the behavior of the program after unwinding + the information about the behavior of the program after unwinding happens, as its first non-PHI instruction. The restrictions on the "landingpad" instruction's tightly couples it to the "invoke" instruction, so that the important information contained @@ -3297,8 +3686,9 @@ IfUnequal:

  • 'normal label': the label reached when the called function executes a 'ret' instruction.
  • -
  • 'exception label': the label reached when a callee returns with - the unwind instruction.
  • +
  • 'exception label': the label reached when a callee returns via + the resume instruction or other exception + handling mechanism.
  • The optional function attributes list. Only 'noreturn', 'nounwind', 'readonly' and @@ -3321,9 +3711,6 @@ IfUnequal: block to the "normal" label. If the callee unwinds then no return value is available.

    -

    Note that the code generator does not yet completely support unwind, and -that the invoke/unwind semantics are likely to change in future versions.

    -
    Example:
       %retval = invoke i32 @Test(i32 15) to label %Continue
    @@ -3332,38 +3719,6 @@ that the invoke/unwind semantics are likely to change in future versions.

    unwind label %TestCleanup ; {i32}:retval set
    - - - - -

    - 'unwind' Instruction -

    - -
    - -
    Syntax:
    -
    -  unwind
    -
    - -
    Overview:
    -

    The 'unwind' instruction unwinds the stack, continuing control flow - at the first callee in the dynamic call stack which used - an invoke instruction to perform the call. - This is primarily used to implement exception handling.

    - -
    Semantics:
    -

    The 'unwind' instruction causes execution of the current function to - immediately halt. The dynamic call stack is then searched for the - first invoke instruction on the call stack. - Once found, execution continues at the "exceptional" destination block - specified by the invoke instruction. If there is no invoke - instruction in the dynamic call chain, undefined behavior results.

    - -

    Note that the code generator does not yet completely support unwind, and -that the invoke/unwind semantics are likely to change in future versions.

    -
    @@ -3476,7 +3831,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    nuw and nsw stand for "No Unsigned Wrap" and "No Signed Wrap", respectively. If the nuw and/or nsw keywords are present, the result value of the add - is a trap value if unsigned and/or signed overflow, + is a poison value if unsigned and/or signed overflow, respectively, occurs.

    Example:
    @@ -3557,7 +3912,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    nuw and nsw stand for "No Unsigned Wrap" and "No Signed Wrap", respectively. If the nuw and/or nsw keywords are present, the result value of the sub - is a trap value if unsigned and/or signed overflow, + is a poison value if unsigned and/or signed overflow, respectively, occurs.

    Example:
    @@ -3644,7 +3999,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    nuw and nsw stand for "No Unsigned Wrap" and "No Signed Wrap", respectively. If the nuw and/or nsw keywords are present, the result value of the mul - is a trap value if unsigned and/or signed overflow, + is a poison value if unsigned and/or signed overflow, respectively, occurs.

    Example:
    @@ -3714,7 +4069,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    Division by zero leads to undefined behavior.

    If the exact keyword is present, the result value of the - udiv is a trap value if %op1 is not a + udiv is a poison value if %op1 is not a multiple of %op2 (as such, "((a udiv exact b) mul b) == a").

    @@ -3758,7 +4113,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    a 32-bit division of -2147483648 by -1.

    If the exact keyword is present, the result value of the - sdiv is a trap value if the result would + sdiv is a poison value if the result would be rounded.

    Example:
    @@ -3967,9 +4322,9 @@ that the invoke/unwind semantics are likely to change in future versions.

    shift amount in op2.

    If the nuw keyword is present, then the shift produces a - trap value if it shifts out any non-zero bits. If + poison value if it shifts out any non-zero bits. If the nsw keyword is present, then the shift produces a - trap value if it shifts out any bits that disagree + poison value if it shifts out any bits that disagree with the resultant sign bit. As such, NUW/NSW have the same semantics as they would if the shift were expressed as a mul instruction with the same nsw/nuw bits in (mul %op1, (shl 1, %op2)).

    @@ -4016,7 +4371,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    shift amount in op2.

    If the exact keyword is present, the result value of the - lshr is a trap value if any of the bits + lshr is a poison value if any of the bits shifted out are non-zero.

    @@ -4064,7 +4419,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    the corresponding shift amount in op2.

    If the exact keyword is present, the result value of the - ashr is a trap value if any of the bits + ashr is a poison value if any of the bits shifted out are non-zero.

    Example:
    @@ -4106,9 +4461,9 @@ that the invoke/unwind semantics are likely to change in future versions.

    - - - + + + @@ -4167,9 +4522,9 @@ that the invoke/unwind semantics are likely to change in future versions.

    In0In1OutIn0In1Out
    0
    - - - + + + @@ -4231,9 +4586,9 @@ that the invoke/unwind semantics are likely to change in future versions.

    In0In1OutIn0In1Out
    0
    - - - + + + @@ -4550,8 +4905,12 @@ that the invoke/unwind semantics are likely to change in future versions.

    'alloca' instruction is commonly used to represent automatic variables that must have an address available. When the function returns (either with the ret - or unwind instructions), the memory is - reclaimed. Allocating zero bytes is legal, but the result is undefined.

    + or resume instructions), the memory is + reclaimed. Allocating zero bytes is legal, but the result is undefined. + The order in which memory is allocated (ie., which way the stack grows) is + not specified.

    + +

    Example:
    @@ -4572,7 +4931,7 @@ that the invoke/unwind semantics are likely to change in future versions.

    Syntax:
    -  <result> = load [volatile] <ty>* <pointer>[, align <alignment>][, !nontemporal !<index>]
    +  <result> = load [volatile] <ty>* <pointer>[, align <alignment>][, !nontemporal !<index>][, !invariant.load !<index>]
       <result> = load atomic [volatile] <ty>* <pointer> [singlethread] <ordering>, align <alignment>
       !<index> = !{ i32 1 }
     
    @@ -4617,6 +4976,14 @@ that the invoke/unwind semantics are likely to change in future versions.

    The code generator may select special instructions to save cache bandwidth, such as the MOVNT instruction on x86.

    +

    The optional !invariant.load metadata must reference a single + metatadata name <index> corresponding to a metadata node with no + entries. The existence of the !invariant.load metatadata on the + instruction tells the optimizer and code generator that this load address + points to memory which does not change value during program execution. + The optimizer may then move this load around, for example, by hoisting it + out of loops using loop invariant code motion.

    +
    Semantics:

    The location of memory pointed to is loaded. If the value being loaded is of scalar type then the number of bytes read does not exceed the minimum number @@ -4644,8 +5011,8 @@ that the invoke/unwind semantics are likely to change in future versions.

    Syntax:
    -  store [volatile] <ty> <value>, <ty>* <pointer>[, align <alignment>][, !nontemporal !<index>]                   ; yields {void}
    -  store atomic [volatile] <ty> <value>, <ty>* <pointer> [singlethread] <ordering>, align <alignment>             ; yields {void}
    +  store [volatile] <ty> <value>, <ty>* <pointer>[, align <alignment>][, !nontemporal !<index>]        ; yields {void}
    +  store atomic [volatile] <ty> <value>, <ty>* <pointer> [singlethread] <ordering>, align <alignment>  ; yields {void}
     
    Overview:
    @@ -4774,7 +5141,7 @@ thread. (This is useful for interacting with signal handlers.)

    Syntax:
    -  cmpxchg [volatile] <ty>* <pointer>, <ty> <cmp>, <ty> <new> [singlethread] <ordering>                   ; yields {ty}
    +  cmpxchg [volatile] <ty>* <pointer>, <ty> <cmp>, <ty> <new> [singlethread] <ordering>  ; yields {ty}
     
    Overview:
    @@ -4832,13 +5199,13 @@ FIXME: Is a weaker ordering constraint on failure helpful in practice?
    Example:
     entry:
    -  %orig = atomic load i32* %ptr unordered                       ; yields {i32}
    +  %orig = atomic load i32* %ptr unordered                   ; yields {i32}
       br label %loop
     
     loop:
       %cmp = phi i32 [ %orig, %entry ], [%old, %loop]
       %squared = mul i32 %cmp, %cmp
    -  %old = cmpxchg i32* %ptr, i32 %cmp, i32 %squared                       ; yields {i32}
    +  %old = cmpxchg i32* %ptr, i32 %cmp, i32 %squared          ; yields {i32}
       %success = icmp eq i32 %cmp, %old
       br i1 %success, label %done, label %loop
     
    @@ -4930,6 +5297,7 @@ specified by the operation argument:

       <result> = getelementptr <pty>* <ptrval>{, <ty> <idx>}*
       <result> = getelementptr inbounds <pty>* <ptrval>{, <ty> <idx>}*
    +  <result> = getelementptr <ptr vector> ptrval, <vector index type> idx 
     
    Overview:
    @@ -4938,7 +5306,8 @@ specified by the operation argument:

    It performs address calculation only and does not access memory.

    Arguments:
    -

    The first argument is always a pointer, and forms the basis of the +

    The first argument is always a pointer or a vector of pointers, + and forms the basis of the calculation. The remaining arguments are indices that indicate which of the elements of the aggregate object are indexed. The interpretation of each index is dependent on the type being indexed into. The first index always @@ -4976,54 +5345,57 @@ int *foo(struct ST *s) { }

    -

    The LLVM code generated by the GCC frontend is:

    +

    The LLVM code generated by Clang is:

    -%RT = type { i8 , [10 x [20 x i32]], i8  }
    -%ST = type { i32, double, %RT }
    +%struct.RT = type { i8, [10 x [20 x i32]], i8 }
    +%struct.ST = type { i32, double, %struct.RT }
     
    -define i32* @foo(%ST* %s) {
    +define i32* @foo(%struct.ST* %s) nounwind uwtable readnone optsize ssp {
     entry:
    -  %reg = getelementptr %ST* %s, i32 1, i32 2, i32 1, i32 5, i32 13
    -  ret i32* %reg
    +  %arrayidx = getelementptr inbounds %struct.ST* %s, i64 1, i32 2, i32 1, i64 5, i64 13
    +  ret i32* %arrayidx
     }
     
    Semantics:
    -

    In the example above, the first index is indexing into the '%ST*' - type, which is a pointer, yielding a '%ST' = '{ i32, double, %RT - }' type, a structure. The second index indexes into the third element - of the structure, yielding a '%RT' = '{ i8 , [10 x [20 x i32]], - i8 }' type, another structure. The third index indexes into the second - element of the structure, yielding a '[10 x [20 x i32]]' type, an - array. The two dimensions of the array are subscripted into, yielding an - 'i32' type. The 'getelementptr' instruction returns a - pointer to this element, thus computing a value of 'i32*' type.

    +

    In the example above, the first index is indexing into the + '%struct.ST*' type, which is a pointer, yielding a + '%struct.ST' = '{ i32, double, %struct.RT }' type, a + structure. The second index indexes into the third element of the structure, + yielding a '%struct.RT' = '{ i8 , [10 x [20 x i32]], i8 }' + type, another structure. The third index indexes into the second element of + the structure, yielding a '[10 x [20 x i32]]' type, an array. The + two dimensions of the array are subscripted into, yielding an 'i32' + type. The 'getelementptr' instruction returns a pointer to this + element, thus computing a value of 'i32*' type.

    Note that it is perfectly legal to index partially through a structure, returning a pointer to an inner element. Because of this, the LLVM code for the given testcase is equivalent to:

    -
    -  define i32* @foo(%ST* %s) {
    -    %t1 = getelementptr %ST* %s, i32 1                        ; yields %ST*:%t1
    -    %t2 = getelementptr %ST* %t1, i32 0, i32 2                ; yields %RT*:%t2
    -    %t3 = getelementptr %RT* %t2, i32 0, i32 1                ; yields [10 x [20 x i32]]*:%t3
    -    %t4 = getelementptr [10 x [20 x i32]]* %t3, i32 0, i32 5  ; yields [20 x i32]*:%t4
    -    %t5 = getelementptr [20 x i32]* %t4, i32 0, i32 13        ; yields i32*:%t5
    -    ret i32* %t5
    -  }
    +
    +define i32* @foo(%struct.ST* %s) {
    +  %t1 = getelementptr %struct.ST* %s, i32 1                 ; yields %struct.ST*:%t1
    +  %t2 = getelementptr %struct.ST* %t1, i32 0, i32 2         ; yields %struct.RT*:%t2
    +  %t3 = getelementptr %struct.RT* %t2, i32 0, i32 1         ; yields [10 x [20 x i32]]*:%t3
    +  %t4 = getelementptr [10 x [20 x i32]]* %t3, i32 0, i32 5  ; yields [20 x i32]*:%t4
    +  %t5 = getelementptr [20 x i32]* %t4, i32 0, i32 13        ; yields i32*:%t5
    +  ret i32* %t5
    +}
     

    If the inbounds keyword is present, the result value of the - getelementptr is a trap value if the + getelementptr is a poison value if the base pointer is not an in bounds address of an allocated object, or if any of the addresses that would be formed by successive addition of the offsets implied by the indices to the base address with infinitely precise signed arithmetic are not an in bounds address of that allocated object. The in bounds addresses for an allocated object are all the addresses that point into the object, plus the address one - byte past the end.

    + byte past the end. + In cases where the base is a vector of pointers the inbounds keyword + applies to each of the computations element-wise.

    If the inbounds keyword is not present, the offsets are added to the base address with silently-wrapping two's complement arithmetic. If the @@ -5050,6 +5422,13 @@ entry: %iptr = getelementptr [10 x i32]* @arr, i16 0, i16 0

    +

    In cases where the pointer argument is a vector of pointers, only a + single index may be used, and the number of vector elements has to be + the same. For example:

    +
    + %A = getelementptr <4 x i8*> %ptrs, <4 x i64> %offsets,
    +
    + @@ -5422,13 +5801,16 @@ entry:
    Overview:
    -

    The 'ptrtoint' instruction converts the pointer value to - the integer type ty2.

    +

    The 'ptrtoint' instruction converts the pointer or a vector of + pointers value to + the integer (or vector of integers) type ty2.

    Arguments:

    The 'ptrtoint' instruction takes a value to cast, which - must be a pointer value, and a type to cast it to - ty2, which must be an integer type.

    + must be a a value of type pointer or a vector of + pointers, and a type to cast it to + ty2, which must be an integer or a vector + of integers type.

    Semantics:

    The 'ptrtoint' instruction converts value to integer type @@ -5441,8 +5823,9 @@ entry:

    Example:
    -  %X = ptrtoint i32* %X to i8           ; yields truncation on 32-bit architecture
    -  %Y = ptrtoint i32* %x to i64          ; yields zero extension on 32-bit architecture
    +  %X = ptrtoint i32* %P to i8                         ; yields truncation on 32-bit architecture
    +  %Y = ptrtoint i32* %P to i64                        ; yields zero extension on 32-bit architecture
    +  %Z = ptrtoint <4 x i32*> %P to <4 x i64>; yields vector zero extension for a vector of addresses on 32-bit architecture
     
    @@ -5481,6 +5864,7 @@ entry: %X = inttoptr i32 255 to i32* ; yields zero extension on 64-bit architecture %Y = inttoptr i32 255 to i32* ; yields no-op on 32-bit architecture %Z = inttoptr i64 0 to i32* ; yields truncation on 32-bit architecture + %Z = inttoptr <4 x i32> %G to <4 x i8*>; yields truncation of vector G to four pointers @@ -5515,8 +5899,9 @@ entry:

    The 'bitcast' instruction converts value to type ty2. It is always a no-op cast because no bits change with this conversion. The conversion is done as if the value had been - stored to memory and read back as type ty2. Pointer types may only - be converted to other pointer types with this instruction. To convert + stored to memory and read back as type ty2. + Pointer (or vector of pointers) types may only be converted to other pointer + (or vector of pointers) types with this instruction. To convert pointers to other types, use the inttoptr or ptrtoint instructions first.

    @@ -5524,7 +5909,8 @@ entry:
       %X = bitcast i8 255 to i8              ; yields i8 :-1
       %Y = bitcast i32* %x to sint*          ; yields sint*:%x
    -  %Z = bitcast <2 x int> %V to i64;      ; yields i64: %V
    +  %Z = bitcast <2 x int> %V to i64;        ; yields i64: %V
    +  %Z = bitcast <2 x i32*> %V to <2 x i64*> ; yields <2 x i64*>
     
    @@ -5555,8 +5941,8 @@ entry:
    Overview:

    The 'icmp' instruction returns a boolean value or a vector of - boolean values based on comparison of its two integer, integer vector, or - pointer operands.

    + boolean values based on comparison of its two integer, integer vector, + pointer, or pointer vector operands.

    Arguments:

    The 'icmp' instruction takes three operands. The first operand is @@ -5851,9 +6237,6 @@ Loop: ; Infinite loop that counts from 0 on up... %X = select i1 true, i8 17, i8 42 ; yields i8:17 -

    Note that the code generator does not yet support conditions - with vector type.

    - @@ -6020,8 +6403,8 @@ freestanding environments and non-C-based languages.

    Syntax:
    -  <resultval> = landingpad <somety> personality <type> <pers_fn> <clause>+
    -  <resultval> = landingpad <somety> personality <type> <pers_fn> cleanup <clause>*
    +  <resultval> = landingpad <resultty> personality <type> <pers_fn> <clause>+
    +  <resultval> = landingpad <resultty> personality <type> <pers_fn> cleanup <clause>*
     
       <clause> := catch <type> <value>
       <clause> := filter <array constant type> <array constant>
    @@ -6035,7 +6418,7 @@ freestanding environments and non-C-based languages.

    catch portion of a try/catch sequence. It defines values supplied by the personality function (pers_fn) upon re-entry to the function. The resultval has the - type somety.

    + type resultty.

    Arguments:

    This instruction takes a pers_fn value. This is the personality @@ -6059,7 +6442,11 @@ freestanding environments and non-C-based languages.

    The clauses are applied in order from top to bottom. If two landingpad instructions are merged together through inlining, the - clauses from the calling function are appended to the list of clauses.

    + clauses from the calling function are appended to the list of clauses. + When the call stack is being unwound due to an exception being thrown, the + exception is compared against each clause in turn. If it doesn't + match any of the clauses, and the cleanup flag is not set, then + unwinding continues further up the call stack.

    The landingpad instruction has several restrictions:

    @@ -6285,8 +6672,6 @@ declare void @llvm.va_end(i8*) - -

    Accurate Garbage Collection Intrinsics @@ -6982,8 +7367,6 @@ LLVM.

    - -

    'llvm.exp.*' Intrinsic @@ -7048,6 +7431,9 @@ LLVM.

    This function returns the same values as the libm log functions would, and handles error conditions in the same way.

    + + +

    'llvm.fma.*' Intrinsic

    @@ -7081,6 +7467,8 @@ LLVM.

    + +

    Bit Manipulation Intrinsics @@ -7175,12 +7563,12 @@ LLVM.

    targets support all bit widths or vector types, however.

    -  declare i8 @llvm.ctlz.i8 (i8  <src>)
    -  declare i16 @llvm.ctlz.i16(i16 <src>)
    -  declare i32 @llvm.ctlz.i32(i32 <src>)
    -  declare i64 @llvm.ctlz.i64(i64 <src>)
    -  declare i256 @llvm.ctlz.i256(i256 <src>)
    -  declare <2 x i32> @llvm.ctlz.v2i32(<2 x i32> <src;gt)
    +  declare i8   @llvm.ctlz.i8  (i8   <src>, i1 <is_zero_undef>)
    +  declare i16  @llvm.ctlz.i16 (i16  <src>, i1 <is_zero_undef>)
    +  declare i32  @llvm.ctlz.i32 (i32  <src>, i1 <is_zero_undef>)
    +  declare i64  @llvm.ctlz.i64 (i64  <src>, i1 <is_zero_undef>)
    +  declare i256 @llvm.ctlz.i256(i256 <src>, i1 <is_zero_undef>)
    +  declase <2 x i32> @llvm.ctlz.v2i32(<2 x i32> <src>, i1 <is_zero_undef>)
     

    Overview:
    @@ -7188,15 +7576,22 @@ LLVM.

    leading zeros in a variable.

    Arguments:
    -

    The only argument is the value to be counted. The argument may be of any - integer type, or any vector type with integer element type. - The return type must match the argument type.

    +

    The first argument is the value to be counted. This argument may be of any + integer type, or a vectory with integer element type. The return type + must match the first argument type.

    + +

    The second argument must be a constant and is a flag to indicate whether the + intrinsic should ensure that a zero as the first argument produces a defined + result. Historically some architectures did not provide a defined result for + zero values as efficiently, and many algorithms are now predicated on + avoiding zero-value inputs.

    Semantics:

    The 'llvm.ctlz' intrinsic counts the leading (most significant) - zeros in a variable, or within each element of the vector if the operation - is of vector type. If the src == 0 then the result is the size in bits of - the type of src. For example, llvm.ctlz(i32 2) = 30.

    + zeros in a variable, or within each element of the vector. + If src == 0 then the result is the size in bits of the type of + src if is_zero_undef == 0 and undef otherwise. + For example, llvm.ctlz(i32 2) = 30.

    @@ -7213,12 +7608,12 @@ LLVM.

    support all bit widths or vector types, however.

    -  declare i8 @llvm.cttz.i8 (i8  <src>)
    -  declare i16 @llvm.cttz.i16(i16 <src>)
    -  declare i32 @llvm.cttz.i32(i32 <src>)
    -  declare i64 @llvm.cttz.i64(i64 <src>)
    -  declare i256 @llvm.cttz.i256(i256 <src>)
    -  declase <2 x i32> @llvm.cttz.v2i32(<2 x i32> <src>)
    +  declare i8   @llvm.cttz.i8  (i8   <src>, i1 <is_zero_undef>)
    +  declare i16  @llvm.cttz.i16 (i16  <src>, i1 <is_zero_undef>)
    +  declare i32  @llvm.cttz.i32 (i32  <src>, i1 <is_zero_undef>)
    +  declare i64  @llvm.cttz.i64 (i64  <src>, i1 <is_zero_undef>)
    +  declare i256 @llvm.cttz.i256(i256 <src>, i1 <is_zero_undef>)
    +  declase <2 x i32> @llvm.cttz.v2i32(<2 x i32> <src>, i1 <is_zero_undef>)
     
    Overview:
    @@ -7226,15 +7621,22 @@ LLVM.

    trailing zeros.

    Arguments:
    -

    The only argument is the value to be counted. The argument may be of any - integer type, or a vectory with integer element type.. The return type - must match the argument type.

    +

    The first argument is the value to be counted. This argument may be of any + integer type, or a vectory with integer element type. The return type + must match the first argument type.

    + +

    The second argument must be a constant and is a flag to indicate whether the + intrinsic should ensure that a zero as the first argument produces a defined + result. Historically some architectures did not provide a defined result for + zero values as efficiently, and many algorithms are now predicated on + avoiding zero-value inputs.

    Semantics:

    The 'llvm.cttz' intrinsic counts the trailing (least significant) zeros in a variable, or within each element of a vector. - If the src == 0 then the result is the size in bits of - the type of src. For example, llvm.cttz(2) = 1.

    + If src == 0 then the result is the size in bits of the type of + src if is_zero_undef == 0 and undef otherwise. + For example, llvm.cttz(2) = 1.

    @@ -7546,7 +7948,8 @@ LLVM.

    -

    Half precision floating point is a storage-only format. This means that it is +

    For most target platforms, half precision floating point is a storage-only + format. This means that it is a dense encoding (in memory) but does not support computation in the format.

    @@ -7665,12 +8068,12 @@ LLVM.

    - Trampoline Intrinsic + Trampoline Intrinsics

    -

    This intrinsic makes it possible to excise one parameter, marked with +

    These intrinsics make it possible to excise one parameter, marked with the nest attribute, from a function. The result is a callable function pointer lacking the nest parameter - the caller does not need to @@ -7687,7 +8090,8 @@ LLVM.

       %tramp = alloca [10 x i8], align 4 ; size and alignment only correct for X86
       %tramp1 = getelementptr [10 x i8]* %tramp, i32 0, i32 0
    -  %p = call i8* @llvm.init.trampoline(i8* %tramp1, i8* bitcast (i32 (i8* nest , i32, i32)* @f to i8*), i8* %nval)
    +  call i8* @llvm.init.trampoline(i8* %tramp1, i8* bitcast (i32 (i8*, i32, i32)* @f to i8*), i8* %nval)
    +  %p = call i8* @llvm.adjust.trampoline(i8* %tramp1)
       %fp = bitcast i8* %p to i32 (i32, i32)*
     
    @@ -7705,12 +8109,12 @@ LLVM.

    Syntax:
    -  declare i8* @llvm.init.trampoline(i8* <tramp>, i8* <func>, i8* <nval>)
    +  declare void @llvm.init.trampoline(i8* <tramp>, i8* <func>, i8* <nval>)
     
    Overview:
    -

    This fills the memory pointed to by tramp with code and returns a - function pointer suitable for executing it.

    +

    This fills the memory pointed to by tramp with executable code, + turning it into a trampoline.

    Arguments:

    The llvm.init.trampoline intrinsic takes three arguments, all @@ -7724,514 +8128,50 @@ LLVM.

    Semantics:

    The block of memory pointed to by tramp is filled with target - dependent code, turning it into a function. A pointer to this function is - returned, but needs to be bitcast to an appropriate - function pointer type before being called. The new function's signature - is the same as that of func with any arguments marked with - the nest attribute removed. At most one such nest argument - is allowed, and it must be of pointer type. Calling the new function is - equivalent to calling func with the same argument list, but - with nval used for the missing nest argument. If, after - calling llvm.init.trampoline, the memory pointed to - by tramp is modified, then the effect of any later call to the - returned function pointer is undefined.

    - -
    - -
    - - -

    - Atomic Operations and Synchronization Intrinsics -

    - -
    - -

    These intrinsic functions expand the "universal IR" of LLVM to represent - hardware constructs for atomic operations and memory synchronization. This - provides an interface to the hardware, not an interface to the programmer. It - is aimed at a low enough level to allow any programming models or APIs - (Application Programming Interfaces) which need atomic behaviors to map - cleanly onto it. It is also modeled primarily on hardware behavior. Just as - hardware provides a "universal IR" for source languages, it also provides a - starting point for developing a "universal" atomic operation and - synchronization IR.

    - -

    These do not form an API such as high-level threading libraries, - software transaction memory systems, atomic primitives, and intrinsic - functions as found in BSD, GNU libc, atomic_ops, APR, and other system and - application libraries. The hardware interface provided by LLVM should allow - a clean implementation of all of these APIs and parallel programming models. - No one model or paradigm should be selected above others unless the hardware - itself ubiquitously does so.

    - - -

    - 'llvm.memory.barrier' Intrinsic -

    - -
    -
    Syntax:
    -
    -  declare void @llvm.memory.barrier(i1 <ll>, i1 <ls>, i1 <sl>, i1 <ss>, i1 <device>)
    -
    - -
    Overview:
    -

    The llvm.memory.barrier intrinsic guarantees ordering between - specific pairs of memory access types.

    - -
    Arguments:
    -

    The llvm.memory.barrier intrinsic requires five boolean arguments. - The first four arguments enables a specific barrier as listed below. The - fifth argument specifies that the barrier applies to io or device or uncached - memory.

    - -
      -
    • ll: load-load barrier
    • -
    • ls: load-store barrier
    • -
    • sl: store-load barrier
    • -
    • ss: store-store barrier
    • -
    • device: barrier applies to device and uncached memory also.
    • -
    - -
    Semantics:
    -

    This intrinsic causes the system to enforce some ordering constraints upon - the loads and stores of the program. This barrier does not - indicate when any events will occur, it only enforces - an order in which they occur. For any of the specified pairs of load - and store operations (f.ex. load-load, or store-load), all of the first - operations preceding the barrier will complete before any of the second - operations succeeding the barrier begin. Specifically the semantics for each - pairing is as follows:

    - -
      -
    • ll: All loads before the barrier must complete before any load - after the barrier begins.
    • -
    • ls: All loads before the barrier must complete before any - store after the barrier begins.
    • -
    • ss: All stores before the barrier must complete before any - store after the barrier begins.
    • -
    • sl: All stores before the barrier must complete before any - load after the barrier begins.
    • -
    - -

    These semantics are applied with a logical "and" behavior when more than one - is enabled in a single memory barrier intrinsic.

    - -

    Backends may implement stronger barriers than those requested when they do - not support as fine grained a barrier as requested. Some architectures do - not need all types of barriers and on such architectures, these become - noops.

    - -
    Example:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 4, %ptr
    -
    -%result1  = load i32* %ptr      ; yields {i32}:result1 = 4
    -            call void @llvm.memory.barrier(i1 false, i1 true, i1 false, i1 false, i1 true)
    -                                ; guarantee the above finishes
    -            store i32 8, %ptr   ; before this begins
    -
    - -
    - - -

    - 'llvm.atomic.cmp.swap.*' Intrinsic -

    - -
    - -
    Syntax:
    -

    This is an overloaded intrinsic. You can use llvm.atomic.cmp.swap on - any integer bit width and for different address spaces. Not all targets - support all bit widths however.

    - -
    -  declare i8 @llvm.atomic.cmp.swap.i8.p0i8(i8* <ptr>, i8 <cmp>, i8 <val>)
    -  declare i16 @llvm.atomic.cmp.swap.i16.p0i16(i16* <ptr>, i16 <cmp>, i16 <val>)
    -  declare i32 @llvm.atomic.cmp.swap.i32.p0i32(i32* <ptr>, i32 <cmp>, i32 <val>)
    -  declare i64 @llvm.atomic.cmp.swap.i64.p0i64(i64* <ptr>, i64 <cmp>, i64 <val>)
    -
    - -
    Overview:
    -

    This loads a value in memory and compares it to a given value. If they are - equal, it stores a new value into the memory.

    - -
    Arguments:
    -

    The llvm.atomic.cmp.swap intrinsic takes three arguments. The result - as well as both cmp and val must be integer values with the - same bit width. The ptr argument must be a pointer to a value of - this integer type. While any bit width integer may be used, targets may only - lower representations they support in hardware.

    - -
    Semantics:
    -

    This entire intrinsic must be executed atomically. It first loads the value - in memory pointed to by ptr and compares it with the - value cmp. If they are equal, val is stored into the - memory. The loaded value is yielded in all cases. This provides the - equivalent of an atomic compare-and-swap operation within the SSA - framework.

    - -
    Examples:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 4, %ptr
    -
    -%val1     = add i32 4, 4
    -%result1  = call i32 @llvm.atomic.cmp.swap.i32.p0i32(i32* %ptr, i32 4, %val1)
    -                                          ; yields {i32}:result1 = 4
    -%stored1  = icmp eq i32 %result1, 4       ; yields {i1}:stored1 = true
    -%memval1  = load i32* %ptr                ; yields {i32}:memval1 = 8
    -
    -%val2     = add i32 1, 1
    -%result2  = call i32 @llvm.atomic.cmp.swap.i32.p0i32(i32* %ptr, i32 5, %val2)
    -                                          ; yields {i32}:result2 = 8
    -%stored2  = icmp eq i32 %result2, 5       ; yields {i1}:stored2 = false
    -
    -%memval2  = load i32* %ptr                ; yields {i32}:memval2 = 8
    -
    - -
    - - -

    - 'llvm.atomic.swap.*' Intrinsic -

    - -
    -
    Syntax:
    - -

    This is an overloaded intrinsic. You can use llvm.atomic.swap on any - integer bit width. Not all targets support all bit widths however.

    - -
    -  declare i8 @llvm.atomic.swap.i8.p0i8(i8* <ptr>, i8 <val>)
    -  declare i16 @llvm.atomic.swap.i16.p0i16(i16* <ptr>, i16 <val>)
    -  declare i32 @llvm.atomic.swap.i32.p0i32(i32* <ptr>, i32 <val>)
    -  declare i64 @llvm.atomic.swap.i64.p0i64(i64* <ptr>, i64 <val>)
    -
    - -
    Overview:
    -

    This intrinsic loads the value stored in memory at ptr and yields - the value from memory. It then stores the value in val in the memory - at ptr.

    - -
    Arguments:
    -

    The llvm.atomic.swap intrinsic takes two arguments. Both - the val argument and the result must be integers of the same bit - width. The first argument, ptr, must be a pointer to a value of this - integer type. The targets may only lower integer representations they - support.

    - -
    Semantics:
    -

    This intrinsic loads the value pointed to by ptr, yields it, and - stores val back into ptr atomically. This provides the - equivalent of an atomic swap operation within the SSA framework.

    - -
    Examples:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 4, %ptr
    -
    -%val1     = add i32 4, 4
    -%result1  = call i32 @llvm.atomic.swap.i32.p0i32(i32* %ptr, i32 %val1)
    -                                        ; yields {i32}:result1 = 4
    -%stored1  = icmp eq i32 %result1, 4     ; yields {i1}:stored1 = true
    -%memval1  = load i32* %ptr              ; yields {i32}:memval1 = 8
    -
    -%val2     = add i32 1, 1
    -%result2  = call i32 @llvm.atomic.swap.i32.p0i32(i32* %ptr, i32 %val2)
    -                                        ; yields {i32}:result2 = 8
    -
    -%stored2  = icmp eq i32 %result2, 8     ; yields {i1}:stored2 = true
    -%memval2  = load i32* %ptr              ; yields {i32}:memval2 = 2
    -
    - -
    - - -

    - 'llvm.atomic.load.add.*' Intrinsic -

    - -
    - -
    Syntax:
    -

    This is an overloaded intrinsic. You can use llvm.atomic.load.add on - any integer bit width. Not all targets support all bit widths however.

    - -
    -  declare i8 @llvm.atomic.load.add.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.add.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.add.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.add.i64.p0i64(i64* <ptr>, i64 <delta>)
    -
    - -
    Overview:
    -

    This intrinsic adds delta to the value stored in memory - at ptr. It yields the original value at ptr.

    - -
    Arguments:
    -

    The intrinsic takes two arguments, the first a pointer to an integer value - and the second an integer value. The result is also an integer value. These - integer types can have any bit width, but they must all have the same bit - width. The targets may only lower integer representations they support.

    - -
    Semantics:
    -

    This intrinsic does a series of operations atomically. It first loads the - value stored at ptr. It then adds delta, stores the result - to ptr. It yields the original value stored at ptr.

    - -
    Examples:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 4, %ptr
    -%result1  = call i32 @llvm.atomic.load.add.i32.p0i32(i32* %ptr, i32 4)
    -                                ; yields {i32}:result1 = 4
    -%result2  = call i32 @llvm.atomic.load.add.i32.p0i32(i32* %ptr, i32 2)
    -                                ; yields {i32}:result2 = 8
    -%result3  = call i32 @llvm.atomic.load.add.i32.p0i32(i32* %ptr, i32 5)
    -                                ; yields {i32}:result3 = 10
    -%memval1  = load i32* %ptr      ; yields {i32}:memval1 = 15
    -
    - -
    - - -

    - 'llvm.atomic.load.sub.*' Intrinsic -

    - -
    - -
    Syntax:
    -

    This is an overloaded intrinsic. You can use llvm.atomic.load.sub on - any integer bit width and for different address spaces. Not all targets - support all bit widths however.

    - -
    -  declare i8 @llvm.atomic.load.sub.i8.p0i32(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.sub.i16.p0i32(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.sub.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.sub.i64.p0i32(i64* <ptr>, i64 <delta>)
    -
    - -
    Overview:
    -

    This intrinsic subtracts delta to the value stored in memory at - ptr. It yields the original value at ptr.

    - -
    Arguments:
    -

    The intrinsic takes two arguments, the first a pointer to an integer value - and the second an integer value. The result is also an integer value. These - integer types can have any bit width, but they must all have the same bit - width. The targets may only lower integer representations they support.

    - -
    Semantics:
    -

    This intrinsic does a series of operations atomically. It first loads the - value stored at ptr. It then subtracts delta, stores the - result to ptr. It yields the original value stored - at ptr.

    - -
    Examples:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 8, %ptr
    -%result1  = call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %ptr, i32 4)
    -                                ; yields {i32}:result1 = 8
    -%result2  = call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %ptr, i32 2)
    -                                ; yields {i32}:result2 = 4
    -%result3  = call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %ptr, i32 5)
    -                                ; yields {i32}:result3 = 2
    -%memval1  = load i32* %ptr      ; yields {i32}:memval1 = -3
    -
    - -
    - - -

    - - 'llvm.atomic.load.and.*' Intrinsic - -
    - - 'llvm.atomic.load.nand.*' Intrinsic - -
    - - 'llvm.atomic.load.or.*' Intrinsic - -
    - - 'llvm.atomic.load.xor.*' Intrinsic - -

    - -
    - -
    Syntax:
    -

    These are overloaded intrinsics. You can - use llvm.atomic.load_and, llvm.atomic.load_nand, - llvm.atomic.load_or, and llvm.atomic.load_xor on any integer - bit width and for different address spaces. Not all targets support all bit - widths however.

    - -
    -  declare i8 @llvm.atomic.load.and.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.and.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.and.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.and.i64.p0i64(i64* <ptr>, i64 <delta>)
    -
    - -
    -  declare i8 @llvm.atomic.load.or.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.or.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.or.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.or.i64.p0i64(i64* <ptr>, i64 <delta>)
    -
    - -
    -  declare i8 @llvm.atomic.load.nand.i8.p0i32(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.nand.i16.p0i32(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.nand.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.nand.i64.p0i32(i64* <ptr>, i64 <delta>)
    -
    - -
    -  declare i8 @llvm.atomic.load.xor.i8.p0i32(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.xor.i16.p0i32(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.xor.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.xor.i64.p0i32(i64* <ptr>, i64 <delta>)
    -
    - -
    Overview:
    -

    These intrinsics bitwise the operation (and, nand, or, xor) delta to - the value stored in memory at ptr. It yields the original value - at ptr.

    - -
    Arguments:
    -

    These intrinsics take two arguments, the first a pointer to an integer value - and the second an integer value. The result is also an integer value. These - integer types can have any bit width, but they must all have the same bit - width. The targets may only lower integer representations they support.

    - -
    Semantics:
    -

    These intrinsics does a series of operations atomically. They first load the - value stored at ptr. They then do the bitwise - operation delta, store the result to ptr. They yield the - original value stored at ptr.

    - -
    Examples:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 0x0F0F, %ptr
    -%result0  = call i32 @llvm.atomic.load.nand.i32.p0i32(i32* %ptr, i32 0xFF)
    -                                ; yields {i32}:result0 = 0x0F0F
    -%result1  = call i32 @llvm.atomic.load.and.i32.p0i32(i32* %ptr, i32 0xFF)
    -                                ; yields {i32}:result1 = 0xFFFFFFF0
    -%result2  = call i32 @llvm.atomic.load.or.i32.p0i32(i32* %ptr, i32 0F)
    -                                ; yields {i32}:result2 = 0xF0
    -%result3  = call i32 @llvm.atomic.load.xor.i32.p0i32(i32* %ptr, i32 0F)
    -                                ; yields {i32}:result3 = FF
    -%memval1  = load i32* %ptr      ; yields {i32}:memval1 = F0
    -
    - + dependent code, turning it into a function. Then tramp needs to be + passed to llvm.adjust.trampoline to get a pointer + which can be bitcast (to a new function) and + called. The new function's signature is the same as that of + func with any arguments marked with the nest attribute + removed. At most one such nest argument is allowed, and it must be of + pointer type. Calling the new function is equivalent to calling func + with the same argument list, but with nval used for the missing + nest argument. If, after calling llvm.init.trampoline, the + memory pointed to by tramp is modified, then the effect of any later call + to the returned function pointer is undefined.

    - - 'llvm.atomic.load.max.*' Intrinsic - -
    - - 'llvm.atomic.load.min.*' Intrinsic - -
    - - 'llvm.atomic.load.umax.*' Intrinsic - -
    - - 'llvm.atomic.load.umin.*' Intrinsic + + 'llvm.adjust.trampoline' Intrinsic

    Syntax:
    -

    These are overloaded intrinsics. You can use llvm.atomic.load_max, - llvm.atomic.load_min, llvm.atomic.load_umax, and - llvm.atomic.load_umin on any integer bit width and for different - address spaces. Not all targets support all bit widths however.

    -
    -  declare i8 @llvm.atomic.load.max.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.max.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.max.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.max.i64.p0i64(i64* <ptr>, i64 <delta>)
    -
    - -
    -  declare i8 @llvm.atomic.load.min.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.min.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.min.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.min.i64.p0i64(i64* <ptr>, i64 <delta>)
    -
    - -
    -  declare i8 @llvm.atomic.load.umax.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.umax.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.umax.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.umax.i64.p0i64(i64* <ptr>, i64 <delta>)
    -
    - -
    -  declare i8 @llvm.atomic.load.umin.i8.p0i8(i8* <ptr>, i8 <delta>)
    -  declare i16 @llvm.atomic.load.umin.i16.p0i16(i16* <ptr>, i16 <delta>)
    -  declare i32 @llvm.atomic.load.umin.i32.p0i32(i32* <ptr>, i32 <delta>)
    -  declare i64 @llvm.atomic.load.umin.i64.p0i64(i64* <ptr>, i64 <delta>)
    +  declare i8* @llvm.adjust.trampoline(i8* <tramp>)
     
    Overview:
    -

    These intrinsics takes the signed or unsigned minimum or maximum of - delta and the value stored in memory at ptr. It yields the - original value at ptr.

    +

    This performs any required machine-specific adjustment to the address of a + trampoline (passed as tramp).

    Arguments:
    -

    These intrinsics take two arguments, the first a pointer to an integer value - and the second an integer value. The result is also an integer value. These - integer types can have any bit width, but they must all have the same bit - width. The targets may only lower integer representations they support.

    +

    tramp must point to a block of memory which already has trampoline code + filled in by a previous call to llvm.init.trampoline + .

    Semantics:
    -

    These intrinsics does a series of operations atomically. They first load the - value stored at ptr. They then do the signed or unsigned min or - max delta and the value, store the result to ptr. They - yield the original value stored at ptr.

    - -
    Examples:
    -
    -%mallocP  = tail call i8* @malloc(i32 ptrtoint (i32* getelementptr (i32* null, i32 1) to i32))
    -%ptr      = bitcast i8* %mallocP to i32*
    -            store i32 7, %ptr
    -%result0  = call i32 @llvm.atomic.load.min.i32.p0i32(i32* %ptr, i32 -2)
    -                                ; yields {i32}:result0 = 7
    -%result1  = call i32 @llvm.atomic.load.max.i32.p0i32(i32* %ptr, i32 8)
    -                                ; yields {i32}:result1 = -2
    -%result2  = call i32 @llvm.atomic.load.umin.i32.p0i32(i32* %ptr, i32 10)
    -                                ; yields {i32}:result2 = 8
    -%result3  = call i32 @llvm.atomic.load.umax.i32.p0i32(i32* %ptr, i32 30)
    -                                ; yields {i32}:result3 = 8
    -%memval1  = load i32* %ptr      ; yields {i32}:memval1 = 30
    -
    +

    On some architectures the address of the code to be executed needs to be + different to the address where the trampoline is actually stored. This + intrinsic returns the executable address corresponding to tramp + after performing the required machine specific adjustments. + The pointer returned can then be bitcast and + executed. +

    @@ -8456,9 +8396,33 @@ LLVM.

    None.

    Semantics:
    -

    This intrinsics is lowered to the target dependent trap instruction. If the +

    This intrinsic is lowered to the target dependent trap instruction. If the target does not have a trap instruction, this intrinsic will be lowered to - the call of the abort() function.

    + a call of the abort() function.

    + +
    + + +

    + 'llvm.debugtrap' Intrinsic +

    + +
    + +
    Syntax:
    +
    +  declare void @llvm.debugtrap()
    +
    + +
    Overview:
    +

    The 'llvm.debugtrap' intrinsic.

    + +
    Arguments:
    +

    None.

    + +
    Semantics:
    +

    This intrinsic is lowered to code which is intended to cause an execution + trap with the intention of requesting the attention of a debugger.

    @@ -8505,8 +8469,8 @@ LLVM.

    Syntax:
    -  declare i32 @llvm.objectsize.i32(i8* <object>, i1 <type>)
    -  declare i64 @llvm.objectsize.i64(i8* <object>, i1 <type>)
    +  declare i32 @llvm.objectsize.i32(i8* <object>, i1 <min>)
    +  declare i64 @llvm.objectsize.i64(i8* <object>, i1 <min>)
     
    Overview:
    @@ -8519,22 +8483,46 @@ LLVM.

    Arguments:

    The llvm.objectsize intrinsic takes two arguments. The first argument is a pointer to or into the object. The second argument - is a boolean 0 or 1. This argument determines whether you want the - maximum (0) or minimum (1) bytes remaining. This needs to be a literal 0 or - 1, variables are not allowed.

    + is a boolean and determines whether llvm.objectsize returns 0 (if + true) or -1 (if false) when the object size is unknown. + The second argument only accepts constants.

    Semantics:
    -

    The llvm.objectsize intrinsic is lowered to either a constant - representing the size of the object concerned, or i32/i64 -1 or 0, - depending on the type argument, if the size cannot be determined at - compile time.

    +

    The llvm.objectsize intrinsic is lowered to a constant representing + the size of the object concerned. If the size cannot be determined at compile + time, llvm.objectsize returns i32/i64 -1 or 0 + (depending on the min argument).

    + +

    + 'llvm.expect' Intrinsic +

    + +
    + +
    Syntax:
    +
    +  declare i32 @llvm.expect.i32(i32 <val>, i32 <expected_val>)
    +  declare i64 @llvm.expect.i64(i64 <val>, i64 <expected_val>)
    +
    + +
    Overview:
    +

    The llvm.expect intrinsic provides information about expected (the + most probable) value of val, which can be used by optimizers.

    +
    Arguments:
    +

    The llvm.expect intrinsic takes two arguments. The first + argument is a value. The second argument is an expected value, this needs to + be a constant value, variables are not allowed.

    + +
    Semantics:
    +

    This intrinsic is lowered to the val.

    +
    In0In1OutIn0In1Out
    0