Module core::arch::nvptx

source ·
🔬This is a nightly-only experimental API. (stdarch_nvptx #111199)
Available on target_arch="nvptx64" only.
Expand description

Platform-specific intrinsics for the NVPTX platform.

See the module documentation for more details.

Structs§

  • f16x2Experimental
    PTX-specific 32-bit wide floating point (f16 x 2) vector type

Functions§

  • __assert_failExperimental
    Syscall to be used whenever the assert expression produces a false value.
  • _block_dim_xExperimental
    x-th thread-block dimension.
  • _block_dim_yExperimental
    y-th thread-block dimension.
  • _block_dim_zExperimental
    z-th thread-block dimension.
  • _block_idx_xExperimental
    x-th thread-block index.
  • _block_idx_yExperimental
    y-th thread-block index.
  • _block_idx_zExperimental
    z-th thread-block index.
  • _grid_dim_xExperimental
    x-th block-grid dimension.
  • _grid_dim_yExperimental
    y-th block-grid dimension.
  • _grid_dim_zExperimental
    z-th block-grid dimension.
  • _syncthreadsExperimental
    Synchronizes all threads in the block.
  • _thread_idx_xExperimental
    x-th thread index.
  • _thread_idx_yExperimental
    y-th thread index.
  • _thread_idx_zExperimental
    z-th thread index.
  • f16x2_addExperimental
    Add two values, round to nearest even
  • f16x2_fmaExperimental
    Fused multiply-add, round to nearest even
  • f16x2_maxExperimental
    Find the maximum of two values
  • f16x2_max_nanExperimental
    Find the maximum of two values, NaNs pass through.
  • f16x2_minExperimental
    Find the minimum of two values
  • f16x2_min_nanExperimental
    Find the minimum of two values, NaNs pass through.
  • f16x2_mulExperimental
    Multiply two values, round to nearest even
  • f16x2_negExperimental
    Arithmetic negate
  • f16x2_subExperimental
    Subtract two values, round to nearest even
  • freeExperimental
    Free previously dynamically allocated memory.
  • mallocExperimental
    Allocate memory dynamically from a fixed-size heap in global memory.
  • trapExperimental
    Generates the trap instruction TRAP
  • vprintfExperimental
    Print formatted output from a kernel to a host-side output stream.