dwww Home | Manual pages | Find package

LLI(1)                           User Commands                          LLI(1)

NAME
       lli - manual page for lli 11

DESCRIPTION
       OVERVIEW: llvm interpreter & dynamic compiler

       USAGE: lli [options] <input bitcode> <program arguments>...

       OPTIONS:

       Color Options:

       --color                                                               -
              Use colors in output (default=autodetect)

       General options:

       -O=<char>                                                             -
              Optimization level. [-O0, -O1, -O2, or -O3] (default = '-O2')

       --aarch64-neon-syntax=<value>                                         -
              Choose style of NEON code to emit from AArch64 backend:

       =generic
              -   Emit generic NEON assembly

       =apple -   Emit Apple-style NEON assembly

       --addrsig                                                             -
              Emit an address-significance table

       --amdgpu-bypass-slow-div                                              -
              Skip 64-bit divide for dynamic 32-bit values

       --amdgpu-disable-loop-alignment                                       -
              Do not align and prefetch loops

       --amdgpu-disable-power-sched                                          -
              Disable scheduling to minimize mAI power bursts

       --amdgpu-dpp-combine                                                  -
              Enable DPP combiner

       --amdgpu-dump-hsa-metadata                                            -
              Dump AMDGPU HSA Metadata

       --amdgpu-enable-global-sgpr-addr                                      -
              Enable use of SGPR regs for GLOBAL LOAD/STORE instructions

       --amdgpu-enable-merge-m0                                              -
              Merge and hoist M0 initializations

       --amdgpu-promote-alloca-to-vector-limit=<uint>                        -
              Maximum byte size to consider promote alloca to vector

       --amdgpu-reserve-vgpr-for-sgpr-spill                                  -
              Allocates one VGPR for future SGPR Spill

       --amdgpu-sdwa-peephole                                                -
              Enable SDWA peepholer

       --amdgpu-verify-hsa-metadata                                          -
              Verify AMDGPU HSA Metadata

       --amdgpu-vgpr-index-mode                                              -
              Use GPR indexing mode instead of movrel for vector indexing

       --arm-add-build-attributes                                           -

       --arm-implicit-it=<value>                                             -
              Allow conditional instructions outdside of an IT block

       =always
              -   Accept in both ISAs, emit implicit ITs in Thumb

       =never -   Warn in ARM, reject in Thumb

       =arm   -   Accept in ARM, reject in Thumb

       =thumb -   Warn in ARM, emit implicit ITs in Thumb

       --asm-show-inst                                                       -
              Emit internal instruction representation to assembly file

       --atomic-counter-update-promoted                                      -
       Do counter update using atomic fetch add
              for promoted counters only

       --atomic-first-counter                                                -
              Use  atomic  fetch  add for first counter in a function (usually
              the entry counter)

       --basicblock-sections=<all | <function list (file)> | labels | none>  -
              Emit basic blocks into separate sections

       --bounds-checking-single-trap                                         -
              Use one trap block per function

       --cfg-hide-deoptimize-paths                                          -

       --cfg-hide-unreachable-paths                                         -

       --code-model=<value>                                                  -
              Choose code model

       =tiny  -   Tiny code model

       =small -   Small code model

       =kernel
              -   Kernel code model

       =medium
              -   Medium code model

       =large -   Large code model

       --compile-threads=<uint>                                              -
              Choose the number of compile threads (jit-kind=orc-lazy only)

       --cost-kind=<value>                                                   -
              Target cost kind

       =throughput
              -   Reciprocal throughput

       =latency
              -   Instruction latency

       =code-size
              -   Code size

       --cvp-dont-add-nowrap-flags                                          -

       --data-sections                                                       -
              Emit data into separate sections

       --debug-entry-values                                                  -
              Enable debug info for the debug entry values.

       --debugger-tune=<value>                                               -
              Tune debug info for a particular debugger

       =gdb   -   gdb

       =lldb  -   lldb

       =sce   -   SCE targets (e.g. PS4)

       --debugify-level=<value>                                              -
              Kind of debug info to add

       =locations
              -   Locations only

       =location+variables
              -   Locations and Variables

       --debugify-quiet                                                      -
              Suppress verbose debugify output

       --denormal-fp-math=<value>                                            -
              Select which denormal numbers the code is permitted to require

       =ieee  -   IEEE 754 denormal numbers

       =preserve-sign
              -    the  sign  of a  flushed-to-zero number is preserved in the
              sign of 0

       =positive-zero
              -   denormals are flushed to positive zero

       --denormal-fp-math-f32=<value>                                        -
              Select  which  denormal numbers the code is permitted to require
              for float

       =ieee  -   IEEE 754 denormal numbers

       =preserve-sign
              -   the sign of a  flushed-to-zero number is  preserved  in  the
              sign of 0

       =positive-zero
              -   denormals are flushed to positive zero

       --disable-lazy-compilation                                            -
              Disable JIT lazy compilation

       --disable-promote-alloca-to-lds                                       -
              Disable promote alloca to LDS

       --disable-promote-alloca-to-vector                                    -
              Disable promote alloca to vector

       --disable-tail-calls                                                  -
              Never emit tail calls

       --dlopen=<string>                                                     -
              Dynamic libraries to load before linking

       --do-counter-promotion                                                -
              Do counter register promotion

       --dwarf-version=<int>                                                 -
              Dwarf version

       --dwarf64                                                             -
              Generate debugging info in the 64-bit DWARF format

       --emit-call-site-info                                                 -
              Emit call site debug information, if debug  information  is  en-
              abled.

       --emscripten-cxx-exceptions-allowed=<string>                          -
              The list of function names in which  Emscripten-style  exception
              handling  is enabled (see emscripten EMSCRIPTEN_CATCHING_ALLOWED
              options)

       --emulated-tls                                                        -
              Use emulated TLS model

       --enable-cache-manager                                                -
              Use cache manager to save/load modules

       --enable-cse-in-irtranslator                                          -
              Should enable CSE in irtranslator

       --enable-cse-in-legalizer                                             -
              Should enable CSE in Legalizer

       --enable-emscripten-cxx-exceptions                                    -
              WebAssembly Emscripten-style exception handling

       --enable-emscripten-sjlj                                              -
              WebAssembly Emscripten-style setjmp/longjmp handling

       --enable-gvn-hoist                                                    -
              Enable the GVN hoisting pass (default = off)

       --enable-gvn-memdep                                                  -

       --enable-gvn-sink                                                     -
              Enable the GVN sinking pass (default = off)

       --enable-load-in-loop-pre                                            -

       --enable-load-pre                                                    -

       --enable-loop-simplifycfg-term-folding                               -

       --enable-name-compression                                             -
              Enable name/filename string compression

       --enable-no-infs-fp-math                                              -
              Enable FP math optimizations that assume no +-Infs

       --enable-no-nans-fp-math                                              -
              Enable FP math optimizations that assume no NaNs

       --enable-no-signed-zeros-fp-math                                      -
              Enable FP math optimizations that assume the sign of  0  is  in-
              significant

       --enable-no-trapping-fp-math                                          -
              Enable setting the FP exceptions build attribute not to use  ex-
              ceptions

       --enable-unsafe-fp-math                                               -
              Enable optimizations that may decrease FP precision

       --entry-function=<function>                                           -
              Specify the entry function (default = 'main') of the executable

       --exception-model=<value>                                             -
              exception model

       =default
              -   default exception handling model

       =dwarf -   DWARF-like CFI based exception handling

       =sjlj  -   SjLj exception handling

       =arm   -   ARM EHABI exceptions

       =wineh -   Windows exception model

       =wasm  -   WebAssembly exception handling

       --extra-archive=<input archive>                                       -
              Extra archive files to be loaded

       --extra-module=<input  bitcode>                                       -
              Extra modules to be loaded

       --extra-object=<input object>                                         -
              Extra object files to be loaded

       --fake-argv0=<executable>                                             -
              Override the 'argv[0]' value passed into the executing program

       --fatal-warnings                                                      -
              Treat warnings as errors

       --filetype=<value>                                                    -
              Choose a file type (not all types are supported by all targets):

       =asm   -   Emit an assembly ('.s') file

       =obj   -   Emit a native object ('.o') file

       =null  -   Emit nothing, for performance testing

       --float-abi=<value>                                                   -
              Choose float ABI type

       =default
              -   Target default float ABI type

       =soft  -   Soft float ABI (implied by -soft-float)

       =hard  -   Hard float ABI (uses FP registers)

       --force-dwarf-frame-section                                           -
              Always emit a debug frame section.

       --force-interpreter                                                   -
              Force interpretation: disable JIT

       --fp-contract=<value>                                                 -
              Enable aggressive formation of fused FP ops

       =fast  -   Fuse FP ops whenever profitable

       =on    -   Only fuse 'blessed' FP ops.

       =off   -   Only fuse FP ops when the result won't be affected.

       --frame-pointer=<value>                                               -
              Specify frame pointer elimination optimization

       =all   -   Disable frame pointer elimination

       =non-leaf
              -   Disable frame pointer elimination for non-leaf frame

       =none  -   Enable frame pointer elimination

       --function-sections                                                   -
              Emit functions into separate sections

       --gpsize=<uint>                                                       -
       Global Pointer Addressing Size.
              The default size is 8.

       --hash-based-counter-split                                            -
              Rename counter variable of a comdat function based on cfg hash

       --hot-cold-split                                                      -
              Enable hot-cold splitting pass

       --import-all-index                                                    -
              Import all external functions in index.

       --incremental-linker-compatible                                       -
              When  used  with  filetype=obj, emit an object file which can be
              used with an incremental linker

       --instcombine-code-sinking                                            -
              Enable code sinking

       --instcombine-guard-widening-window=<uint>                            -
              How wide an instruction window to  bypass  looking  for  another
              guard

       --instcombine-max-iterations=<uint>                                   -
              Limit the maximum number of instruction combining iterations

       --instcombine-max-num-phis=<uint>                                     -
              Maximum number phis to handle in intptr/ptrint folding

       --instcombine-maxarray-size=<uint>                                    -
              Maximum array size considered when doing a combine

       --instcombine-negator-enabled                                         -
              Should we attempt to sink negations?

       --instcombine-negator-max-depth=<uint>                                -
              What is the maximal lookup depth when trying to check  for  via-
              bility of negation sinking.

       --instrprof-atomic-counter-update-all                                 -
              Make all profile counter updates atomic (for testing only)

       --internalize-public-api-file=<filename>                              -
              A file containing list of symbol names to preserve

       --internalize-public-api-list=<list>                                  -
              A list of symbol names to preserve

       --iterative-counter-promotion                                         -
              Allow counter promotion across the whole loop nest.

       --jd=<string>                                                         -
              Specifies the JITDylib to be used for any subsequent -extra-mod-
              ule arguments.

       --jit-kind=<value>                                                    -
              Choose underlying JIT kind.

       =mcjit
              -   MCJIT

       =orc-mcjit
              -   Orc-based MCJIT replacement (deprecated)

       =orc-lazy
              -   Orc-based lazy JIT.

       --load=<pluginfilename>                                               -
              Load the specified plugin

       --lto-embed-bitcode                                                   -
              Embed LLVM bitcode in object files produced by LTO

       --lto-pass-remarks-filter=<regex>                                     -
              Only  record  optimization remarks from passes whose names match
              the given regular expression

       --lto-pass-remarks-format=<format>                                    -
              The format used for serializing remarks (default: YAML)

       --lto-pass-remarks-output=<filename>                                  -
              Output filename for pass remarks

       --march=<string>                                                      -
              Architecture to generate code for (see --version)

       --matrix-default-layout=<value>                                       -
              Sets the default matrix layout

       =column-major
              -   Use column-major layout

       =row-major
              -   Use row-major layout

       --mattr=<a1,+a2,-a3,...>                                              -
              Target specific attributes (-mattr=help for details)

       --max-counter-promotions=<int>                                        -
              Max number of allowed counter promotions

       --max-counter-promotions-per-loop=<uint>                              -
              Max  number counter promotions per loop to avoid increasing reg-
              ister pressure too much

       --mc-relax-all                                                        -
              When used with filetype=obj, relax all fixups in the emitted ob-
              ject file

       --mcjit-remote-process=<filename>                                     -
       Specify  the  filename of the process to launch for remote MCJIT execu-
       tion.
              If  none  is  specified,  remote  execution  will  be  simulated
              in-process.

       --mcpu=<cpu-name>                                                     -
              Target a specific cpu type (-mcpu=help for details)

       --meabi=<value>                                                       -
              Set EABI type (default depends on triple):

       =default
              -   Triple default EABI version

       =4     -   EABI version 4

       =5     -   EABI version 5

       =gnu   -   EABI GNU

       --memop-size-large=<uint>                                             -
              Set large value thresthold in memory intrinsic  size  profiling.
              Value of 0 disables the large value profiling.

       --memop-size-range=<string>                                           -
              Set the range of size in memory intrinsic calls to  be  profiled
              precisely, in a format of <start_val>:<end_val>

       --merror-missing-parenthesis                                          -
              Error for missing parenthesis around predicate registers

       --merror-noncontigious-register                                       -
              Error for register names that aren't contigious

       --mhvx                                                                -
              Enable Hexagon Vector eXtensions

       --mhvx=<value>                                                        -
              Enable Hexagon Vector eXtensions

       =v60   -   Build for HVX v60

       =v62   -   Build for HVX v62

       =v65   -   Build for HVX v65

       =v66   -   Build for HVX v66

       =v67   -   Build for HVX v67

       --mips-compact-branches=<value>                                       -
              MIPS Specific: Compact branch policy.

       =never
              -   Do not use compact branches if possible.

       =optimal
              -   Use compact branches where appropriate (default).

       =always
              -   Always use compact branches if possible.

       --mips16-constant-islands                                             -
              Enable mips16 constant islands.

       --mips16-hard-float                                                   -
              Enable mips16 hard float.

       --mir-strip-debugify-only                                             -
              Should  mir-strip-debug  only  strip  debug info from debugified
              modules by default

       --mno-compound                                                        -
              Disable looking for compound instructions for Hexagon

       --mno-fixup                                                           -
              Disable fixing up resolved relocations for Hexagon

       --mno-ldc1-sdc1                                                       -
              Expand  double precision loads and stores to their single preci-
              sion counterparts

       --mno-pairing                                                         -
              Disable looking for duplex instructions for Hexagon

       --mtriple=<string>                                                    -
              Override target triple for module

       --mwarn-missing-parenthesis                                           -
              Warn for missing parenthesis around predicate registers

       --mwarn-noncontigious-register                                        -
              Warn for register names that arent contigious

       --mwarn-sign-mismatch                                                 -
              Warn for mismatching a signed and unsigned value

       --no-deprecated-warn                                                  -
              Suppress all deprecated warnings

       --no-discriminators                                                   -
              Disable generation of discriminator information.

       --no-process-syms                                                     -
              Do not resolve lli process symbols in JIT'd code

       --no-warn                                                             -
              Suppress all warnings

       --no-xray-index                                                       -
              Don't emit xray_fn_idx section

       --nozero-initialized-in-bss                                           -
              Don't place zero-initialized symbols into bss section

       --nvptx-sched4reg                                                     -
              NVPTX Specific: schedule for register pressue

       --object-cache-dir=<string>                                           -
              Directory to store cached object files (must be user writable)

       --per-module-lazy                                                     -
              Performs lazy compilation on whole module boundaries rather than
              individual functions

       --poison-checking-function-local                                      -
              Check that returns are non-poison (for testing)

       --r600-ir-structurize                                                 -
              Use StructurizeCFG IR pass

       --rdf-dump                                                           -

       --rdf-limit=<uint>                                                   -

       --relax-elf-relocations                                               -
              Emit GOTPCRELX/REX_GOTPCRELX instead of GOTPCREL on x86-64 ELF

       --relocation-model=<value>                                            -
              Choose relocation model

       =static
              -   Non-relocatable code

       =pic   -   Fully relocatable, position independent code

       =dynamic-no-pic
              -   Relocatable external references, non-relocatable code

       =ropi  -   Code and read-only data relocatable, accessed PC-relative

       =rwpi  -    Read-write  data  relocatable,  accessed relative to static
              base

       =ropi-rwpi
              -   Combination of ropi and rwpi

       --remote-mcjit                                                        -
              Execute MCJIT'ed code in a separate process.

       --runtime-counter-relocation                                          -
              Enable relocating counters at runtime.

       --safepoint-ir-verifier-print-only                                   -

       --sample-profile-check-record-coverage=<N>                            -
              Emit  a  warning if less than N% of records in the input profile
              are matched to the IR.

       --sample-profile-check-sample-coverage=<N>                            -
              Emit  a  warning if less than N% of samples in the input profile
              are matched to the IR.

       --sample-profile-max-propagate-iterations=<uint>                      -
              Maximum number of iterations to go through when propagating sam-
              ple block/edge weights through the CFG.

       --soft-float                                                          -
              Generate software floating point library calls

       --speculative-counter-promotion-max-exiting=<uint>                    -
       The max number of exiting blocks of a loop to allow
              speculative counter promotion

       --speculative-counter-promotion-to-loop                               -
       When  the option is false, if the target block is in a loop, the promo-
       tion will be disallowed unless the promoted counter
              update can be further/iteratively promoted into an acyclic   re-
              gion.

       --stack-alignment=<uint>                                              -
              Override default stack alignment

       --stack-size-section                                                  -
              Emit a section containing stack size metadata

       --stack-symbol-ordering                                               -
              Order local stack symbols.

       --stackrealign                                                        -
              Force align the stack to the minimum alignment

       --summary-file=<string>                                               -
              The summary file to use for function importing.

       --tail-predication=<value>                                            -
              MVE tail-predication options

       =disabled
              -   Don't tail-predicate loops

       =enabled-no-reductions
              -   Enable tail-predication, but not for reduction loops

       =enabled
              -   Enable tail-predication, including reduction loops

       =force-enabled-no-reductions
              -    Enable  tail-predication,  but not for reduction loops, and
              force this which might be unsafe

       =force-enabled
              -   Enable  tail-predication,  including  reduction  loops,  and
              force this which might be unsafe

       --tailcallopt                                                         -
              Turn fastcc calls into tail calls by (potentially) changing ABI.

       --thread-entry=<string>                                               -
              calls  the  given entry-point on a new thread (jit-kind=orc-lazy
              only)

       --thread-model=<value>                                                -
              Choose threading model

       =posix
              -   POSIX thread model

       =single
              -   Single thread model

       --threads=<int>                                                      -

       --tls-size=<uint>                                                     -
              Bit size of immediate TLS offsets

       --unique-bb-section-names                                             -
              Give unique names to every basic block section

       --unique-section-names                                                -
              Give unique names to every section

       --use-ctors                                                           -
              Use .ctors instead of .init_array.

       --verify-region-info                                                  -
              Verify region info (time consuming)

       --vp-counters-per-site=<number>                                       -
              The  average number of profile counters allocated per value pro-
              filing site.

       --vp-static-alloc                                                     -
              Do static counter allocation for value profiler

       --x86-align-branch=<string>                                           -
       Specify types of branches to align (plus separated list of types):
              jcc      indicates conditional jumps  fused     indicates  fused
              conditional  jumps jmp      indicates direct unconditional jumps
              call     indicates direct and indirect calls ret       indicates
              rets indirect indicates indirect unconditional jumps

       --x86-align-branch-boundary=<uint>                                    -
              Control how the assembler should align branches with NOP. If the
              boundary's  size is not 0, it should be a power of 2 and no less
              than 32. Branches will be aligned to prevent from  being  across
              or  against  the boundary of specified size. The default value 0
              does not align branches.

       --x86-branches-within-32B-boundaries                                  -
       Align  selected instructions to mitigate negative performance impact of
       Intel's micro code update for errata skx102.
              May break assumptions about labels corresponding  to  particular
              instructions, and should be used with caution.

       --x86-pad-max-prefix-size=<uint>                                      -
              Maximum number of prefixes to use for padding

       Generic Options:

       --help                                                                -
              Display available options (--help-hidden for more)

       --help-list                                                           -
              Display list of available options (--help-list-hidden for more)

       --version                                                             -
              Display the version of this program

       Polly Options: Configure the polly loop optimizer

       --polly                                                               -
              Enable the polly optimizer (only at -O3)

       --polly-2nd-level-tiling                                              -
              Enable a 2nd level loop of loop tiling

       --polly-ast-print-accesses                                            -
              Print memory access functions

       --polly-context=<isl parameter set>                                   -
              Provide additional constraints on the context parameters

       --polly-dce-precise-steps=<int>                                       -
              The number of precise steps  between  two  approximating  itera-
              tions.  (A value of -1 schedules another approximation stage be-
              fore the actual dead code elimination.

       --polly-delicm-max-ops=<int>                                          -
              Maximum  number  of isl operations to invest for lifetime analy-
              sis; 0=no limit

       --polly-detect-full-functions                                         -
              Allow the detection of full functions

       --polly-dump-after                                                    -
              Dump module after Polly transformations  into  a  file  suffixed
              with "-after"

       --polly-dump-after-file=<string>                                      -
              Dump module after Polly transformations to the given file

       --polly-dump-before                                                   -
              Dump  module  before  Polly transformations into a file suffixed
              with "-before"

       --polly-dump-before-file=<string>                                     -
              Dump module before Polly transformations to the given file

       --polly-enable-simplify                                               -
              Simplify SCoP after optimizations

       --polly-ignore-func=<string>                                          -
              Ignore  functions  that  match  a regex. Multiple regexes can be
              comma separated. Scop detection will ignore all  functions  that
              match ANY of the regexes provided.

       --polly-isl-arg=<argument>                                            -
              Option passed to ISL

       --polly-on-isl-error-abort                                            -
              Abort if an isl error is encountered

       --polly-only-func=<string>                                            -
              Only run on functions that match a regex. Multiple  regexes  can
              be  comma  separated.  Scop  detection will run on all functions
              that match ANY of the regexes provided.

       --polly-only-region=<identifier>                                      -
              Only run on certain regions (The provided identifier must appear
              in the name of the region's entry block

       --polly-only-scop-detection                                           -
              Only run scop detection, but no other optimizations

       --polly-optimized-scops                                               -
              Polly - Dump polyhedral description of Scops optimized with  the
              isl  scheduling  optimizer and the set of post-scheduling trans-
              formations is applied on the schedule tree

       --polly-parallel                                                      -
              Generate thread parallel code (isl codegen only)

       --polly-parallel-force                                                -
              Force generation of thread parallel code ignoring any cost model

       --polly-pattern-matching-based-opts                                   -
              Perform optimizations based on pattern matching

       --polly-process-unprofitable                                          -
              Process scops that are unlikely to benefit from Polly  optimiza-
              tions.

       --polly-register-tiling                                               -
              Enable register tiling

       --polly-report                                                        -
              Print information about the activities of Polly

       --polly-show                                                          -
              Highlight the code regions that will be optimized in a (CFG  BBs
              and LLVM-IR instructions)

       --polly-show-only                                                     -
              Highlight the code regions that will be optimized in a (CFG only
              BBs)

       --polly-stmt-granularity=<value>                                      -
              Algorithm to use for splitting basic blocks into multiple state-
              ments

       =bb    -   One statement per basic block

       =scalar-indep
              -   Scalar independence heuristic

       =store -   Store-level granularity

       --polly-target=<value>                                                -
              The hardware to target

       =cpu   -   generate CPU code

       --polly-tiling                                                        -
              Enable loop tiling

       --polly-vectorizer=<value>                                            -
              Select the vectorization strategy

       =none  -   No Vectorization

       =polly -   Polly internal vectorizer

       =stripmine
              -   Strip-mine outer loops for the loop-vectorizer to trigger

lli 11                           January 2022                           LLI(1)

Generated by dwww version 1.14 on Fri Jan 24 06:33:51 CET 2025.