bpf/standardization/instruction-set.rst

4d496be9SDavid Vernet.. contents::
4d496be9SDavid Vernet.. sectnum::
4d496be9SDavid Vernet
*7d35eb1aSDavid Vernet=======================================
*7d35eb1aSDavid VernetBPF Instruction Set Specification, v1.0
*7d35eb1aSDavid Vernet=======================================
4d496be9SDavid Vernet
*7d35eb1aSDavid VernetThis document specifies version 1.0 of the BPF instruction set.
4d496be9SDavid Vernet
4d496be9SDavid VernetDocumentation conventions
4d496be9SDavid Vernet=========================
4d496be9SDavid Vernet
2369e526SWill HawkinsFor brevity and consistency, this document refers to families
2369e526SWill Hawkinsof types using a shorthand syntax and refers to several expository,
2369e526SWill Hawkinsmnemonic functions when describing the semantics of instructions.
2369e526SWill HawkinsThe range of valid values for those types and the semantics of those
2369e526SWill Hawkinsfunctions are defined in the following subsections.
2369e526SWill Hawkins
2369e526SWill HawkinsTypes
2369e526SWill Hawkins-----
2369e526SWill HawkinsThis document refers to integer types with the notation `SN` to specify
2369e526SWill Hawkinsa type's signedness (`S`) and bit width (`N`), respectively.
2369e526SWill Hawkins
2369e526SWill Hawkins.. table:: Meaning of signedness notation.
2369e526SWill Hawkins
2369e526SWill Hawkins  ==== =========
2369e526SWill Hawkins  `S`  Meaning
2369e526SWill Hawkins  ==== =========
2369e526SWill Hawkins  `u`  unsigned
2369e526SWill Hawkins  `s`  signed
2369e526SWill Hawkins  ==== =========
2369e526SWill Hawkins
2369e526SWill Hawkins.. table:: Meaning of bit-width notation.
2369e526SWill Hawkins
2369e526SWill Hawkins  ===== =========
2369e526SWill Hawkins  `N`   Bit width
2369e526SWill Hawkins  ===== =========
2369e526SWill Hawkins  `8`   8 bits
2369e526SWill Hawkins  `16`  16 bits
2369e526SWill Hawkins  `32`  32 bits
2369e526SWill Hawkins  `64`  64 bits
2369e526SWill Hawkins  `128` 128 bits
2369e526SWill Hawkins  ===== =========
2369e526SWill Hawkins
2369e526SWill HawkinsFor example, `u32` is a type whose valid values are all the 32-bit unsigned
2369e526SWill Hawkinsnumbers and `s16` is a types whose valid values are all the 16-bit signed
2369e526SWill Hawkinsnumbers.
2369e526SWill Hawkins
2369e526SWill HawkinsFunctions
2369e526SWill Hawkins---------
2369e526SWill Hawkins* `htobe16`: Takes an unsigned 16-bit number in host-endian format and
2369e526SWill Hawkins  returns the equivalent number as an unsigned 16-bit number in big-endian
2369e526SWill Hawkins  format.
2369e526SWill Hawkins* `htobe32`: Takes an unsigned 32-bit number in host-endian format and
2369e526SWill Hawkins  returns the equivalent number as an unsigned 32-bit number in big-endian
2369e526SWill Hawkins  format.
2369e526SWill Hawkins* `htobe64`: Takes an unsigned 64-bit number in host-endian format and
2369e526SWill Hawkins  returns the equivalent number as an unsigned 64-bit number in big-endian
2369e526SWill Hawkins  format.
2369e526SWill Hawkins* `htole16`: Takes an unsigned 16-bit number in host-endian format and
2369e526SWill Hawkins  returns the equivalent number as an unsigned 16-bit number in little-endian
2369e526SWill Hawkins  format.
2369e526SWill Hawkins* `htole32`: Takes an unsigned 32-bit number in host-endian format and
2369e526SWill Hawkins  returns the equivalent number as an unsigned 32-bit number in little-endian
2369e526SWill Hawkins  format.
2369e526SWill Hawkins* `htole64`: Takes an unsigned 64-bit number in host-endian format and
2369e526SWill Hawkins  returns the equivalent number as an unsigned 64-bit number in little-endian
2369e526SWill Hawkins  format.
2369e526SWill Hawkins* `bswap16`: Takes an unsigned 16-bit number in either big- or little-endian
2369e526SWill Hawkins  format and returns the equivalent number with the same bit width but
2369e526SWill Hawkins  opposite endianness.
2369e526SWill Hawkins* `bswap32`: Takes an unsigned 32-bit number in either big- or little-endian
2369e526SWill Hawkins  format and returns the equivalent number with the same bit width but
2369e526SWill Hawkins  opposite endianness.
2369e526SWill Hawkins* `bswap64`: Takes an unsigned 64-bit number in either big- or little-endian
2369e526SWill Hawkins  format and returns the equivalent number with the same bit width but
2369e526SWill Hawkins  opposite endianness.
4d496be9SDavid Vernet
e546a119SWill Hawkins
e546a119SWill HawkinsDefinitions
e546a119SWill Hawkins-----------
e546a119SWill Hawkins
e546a119SWill Hawkins.. glossary::
e546a119SWill Hawkins
e546a119SWill Hawkins  Sign Extend
e546a119SWill Hawkins    To `sign extend an` ``X`` `-bit number, A, to a` ``Y`` `-bit number, B  ,` means to
e546a119SWill Hawkins
e546a119SWill Hawkins    #. Copy all ``X`` bits from `A` to the lower ``X`` bits of `B`.
e546a119SWill Hawkins    #. Set the value of the remaining ``Y`` - ``X`` bits of `B` to the value of
e546a119SWill Hawkins       the  most-significant bit of `A`.
e546a119SWill Hawkins
e546a119SWill Hawkins.. admonition:: Example
e546a119SWill Hawkins
e546a119SWill Hawkins  Sign extend an 8-bit number ``A`` to a 16-bit number ``B`` on a big-endian platform:
e546a119SWill Hawkins  ::
e546a119SWill Hawkins
e546a119SWill Hawkins    A:          10000110
e546a119SWill Hawkins    B: 11111111 10000110
e546a119SWill Hawkins
4d496be9SDavid VernetInstruction encoding
4d496be9SDavid Vernet====================
4d496be9SDavid Vernet
*7d35eb1aSDavid VernetBPF has two instruction encodings:
4d496be9SDavid Vernet
4d496be9SDavid Vernet* the basic instruction encoding, which uses 64 bits to encode an instruction
4d496be9SDavid Vernet* the wide instruction encoding, which appends a second 64-bit immediate (i.e.,
4d496be9SDavid Vernet  constant) value after the basic instruction for a total of 128 bits.
4d496be9SDavid Vernet
4d496be9SDavid VernetThe fields conforming an encoded basic instruction are stored in the
4d496be9SDavid Vernetfollowing order::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  opcode:8 src_reg:4 dst_reg:4 offset:16 imm:32 // In little-endian BPF.
4d496be9SDavid Vernet  opcode:8 dst_reg:4 src_reg:4 offset:16 imm:32 // In big-endian BPF.
4d496be9SDavid Vernet
4d496be9SDavid Vernet**imm**
4d496be9SDavid Vernet  signed integer immediate value
4d496be9SDavid Vernet
4d496be9SDavid Vernet**offset**
4d496be9SDavid Vernet  signed integer offset used with pointer arithmetic
4d496be9SDavid Vernet
4d496be9SDavid Vernet**src_reg**
4d496be9SDavid Vernet  the source register number (0-10), except where otherwise specified
4d496be9SDavid Vernet  (`64-bit immediate instructions`_ reuse this field for other purposes)
4d496be9SDavid Vernet
4d496be9SDavid Vernet**dst_reg**
4d496be9SDavid Vernet  destination register number (0-10)
4d496be9SDavid Vernet
4d496be9SDavid Vernet**opcode**
4d496be9SDavid Vernet  operation to perform
4d496be9SDavid Vernet
4d496be9SDavid VernetNote that the contents of multi-byte fields ('imm' and 'offset') are
4d496be9SDavid Vernetstored using big-endian byte ordering in big-endian BPF and
4d496be9SDavid Vernetlittle-endian byte ordering in little-endian BPF.
4d496be9SDavid Vernet
4d496be9SDavid VernetFor example::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  opcode                  offset imm          assembly
4d496be9SDavid Vernet         src_reg dst_reg
4d496be9SDavid Vernet  07     0       1        00 00  44 33 22 11  r1 += 0x11223344 // little
4d496be9SDavid Vernet         dst_reg src_reg
4d496be9SDavid Vernet  07     1       0        00 00  11 22 33 44  r1 += 0x11223344 // big
4d496be9SDavid Vernet
4d496be9SDavid VernetNote that most instructions do not use all of the fields.
4d496be9SDavid VernetUnused fields shall be cleared to zero.
4d496be9SDavid Vernet
4d496be9SDavid VernetAs discussed below in `64-bit immediate instructions`_, a 64-bit immediate
4d496be9SDavid Vernetinstruction uses a 64-bit immediate value that is constructed as follows.
4d496be9SDavid VernetThe 64 bits following the basic instruction contain a pseudo instruction
4d496be9SDavid Vernetusing the same format but with opcode, dst_reg, src_reg, and offset all set to zero,
4d496be9SDavid Vernetand imm containing the high 32 bits of the immediate value.
4d496be9SDavid Vernet
4d496be9SDavid VernetThis is depicted in the following figure::
4d496be9SDavid Vernet
4d496be9SDavid Vernet        basic_instruction
4d496be9SDavid Vernet  .-----------------------------.
4d496be9SDavid Vernet  |                             |
4d496be9SDavid Vernet  code:8 regs:8 offset:16 imm:32 unused:32 imm:32
4d496be9SDavid Vernet                                 |              |
4d496be9SDavid Vernet                                 '--------------'
4d496be9SDavid Vernet                                pseudo instruction
4d496be9SDavid Vernet
4d496be9SDavid VernetThus the 64-bit immediate value is constructed as follows:
4d496be9SDavid Vernet
4d496be9SDavid Vernet  imm64 = (next_imm << 32) | imm
4d496be9SDavid Vernet
4d496be9SDavid Vernetwhere 'next_imm' refers to the imm value of the pseudo instruction
4d496be9SDavid Vernetfollowing the basic instruction.  The unused bytes in the pseudo
4d496be9SDavid Vernetinstruction are reserved and shall be cleared to zero.
4d496be9SDavid Vernet
4d496be9SDavid VernetInstruction classes
4d496be9SDavid Vernet-------------------
4d496be9SDavid Vernet
4d496be9SDavid VernetThe three LSB bits of the 'opcode' field store the instruction class:
4d496be9SDavid Vernet
4d496be9SDavid Vernet=========  =====  ===============================  ===================================
4d496be9SDavid Vernetclass      value  description                      reference
4d496be9SDavid Vernet=========  =====  ===============================  ===================================
4d496be9SDavid VernetBPF_LD     0x00   non-standard load operations     `Load and store instructions`_
4d496be9SDavid VernetBPF_LDX    0x01   load into register operations    `Load and store instructions`_
4d496be9SDavid VernetBPF_ST     0x02   store from immediate operations  `Load and store instructions`_
4d496be9SDavid VernetBPF_STX    0x03   store from register operations   `Load and store instructions`_
4d496be9SDavid VernetBPF_ALU    0x04   32-bit arithmetic operations     `Arithmetic and jump instructions`_
4d496be9SDavid VernetBPF_JMP    0x05   64-bit jump operations           `Arithmetic and jump instructions`_
4d496be9SDavid VernetBPF_JMP32  0x06   32-bit jump operations           `Arithmetic and jump instructions`_
4d496be9SDavid VernetBPF_ALU64  0x07   64-bit arithmetic operations     `Arithmetic and jump instructions`_
4d496be9SDavid Vernet=========  =====  ===============================  ===================================
4d496be9SDavid Vernet
4d496be9SDavid VernetArithmetic and jump instructions
4d496be9SDavid Vernet================================
4d496be9SDavid Vernet
4d496be9SDavid VernetFor arithmetic and jump instructions (``BPF_ALU``, ``BPF_ALU64``, ``BPF_JMP`` and
4d496be9SDavid Vernet``BPF_JMP32``), the 8-bit 'opcode' field is divided into three parts:
4d496be9SDavid Vernet
4d496be9SDavid Vernet==============  ======  =================
4d496be9SDavid Vernet4 bits (MSB)    1 bit   3 bits (LSB)
4d496be9SDavid Vernet==============  ======  =================
4d496be9SDavid Vernetcode            source  instruction class
4d496be9SDavid Vernet==============  ======  =================
4d496be9SDavid Vernet
4d496be9SDavid Vernet**code**
4d496be9SDavid Vernet  the operation code, whose meaning varies by instruction class
4d496be9SDavid Vernet
4d496be9SDavid Vernet**source**
4d496be9SDavid Vernet  the source operand location, which unless otherwise specified is one of:
4d496be9SDavid Vernet
4d496be9SDavid Vernet  ======  =====  ==============================================
4d496be9SDavid Vernet  source  value  description
4d496be9SDavid Vernet  ======  =====  ==============================================
4d496be9SDavid Vernet  BPF_K   0x00   use 32-bit 'imm' value as source operand
4d496be9SDavid Vernet  BPF_X   0x08   use 'src_reg' register value as source operand
4d496be9SDavid Vernet  ======  =====  ==============================================
4d496be9SDavid Vernet
4d496be9SDavid Vernet**instruction class**
4d496be9SDavid Vernet  the instruction class (see `Instruction classes`_)
4d496be9SDavid Vernet
4d496be9SDavid VernetArithmetic instructions
4d496be9SDavid Vernet-----------------------
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_ALU`` uses 32-bit wide operands while ``BPF_ALU64`` uses 64-bit wide operands for
4d496be9SDavid Vernetotherwise identical operations.
4d496be9SDavid VernetThe 'code' field encodes the operation as below, where 'src' and 'dst' refer
4d496be9SDavid Vernetto the values of the source and destination registers, respectively.
4d496be9SDavid Vernet
fb213ecbSYonghong Song=========  =====  =======  ==========================================================
245d4c40SYonghong Songcode       value  offset   description
fb213ecbSYonghong Song=========  =====  =======  ==========================================================
245d4c40SYonghong SongBPF_ADD    0x00   0        dst += src
245d4c40SYonghong SongBPF_SUB    0x10   0        dst -= src
245d4c40SYonghong SongBPF_MUL    0x20   0        dst \*= src
245d4c40SYonghong SongBPF_DIV    0x30   0        dst = (src != 0) ? (dst / src) : 0
245d4c40SYonghong SongBPF_SDIV   0x30   1        dst = (src != 0) ? (dst s/ src) : 0
245d4c40SYonghong SongBPF_OR     0x40   0        dst \|= src
245d4c40SYonghong SongBPF_AND    0x50   0        dst &= src
245d4c40SYonghong SongBPF_LSH    0x60   0        dst <<= (src & mask)
245d4c40SYonghong SongBPF_RSH    0x70   0        dst >>= (src & mask)
245d4c40SYonghong SongBPF_NEG    0x80   0        dst = -dst
245d4c40SYonghong SongBPF_MOD    0x90   0        dst = (src != 0) ? (dst % src) : dst
245d4c40SYonghong SongBPF_SMOD   0x90   1        dst = (src != 0) ? (dst s% src) : dst
245d4c40SYonghong SongBPF_XOR    0xa0   0        dst ^= src
245d4c40SYonghong SongBPF_MOV    0xb0   0        dst = src
245d4c40SYonghong SongBPF_MOVSX  0xb0   8/16/32  dst = (s8,s16,s32)src
e546a119SWill HawkinsBPF_ARSH   0xc0   0        :term:`sign extending<Sign Extend>` dst >>= (src & mask)
245d4c40SYonghong SongBPF_END    0xd0   0        byte swap operations (see `Byte swap instructions`_ below)
fb213ecbSYonghong Song=========  =====  =======  ==========================================================
4d496be9SDavid Vernet
4d496be9SDavid VernetUnderflow and overflow are allowed during arithmetic operations, meaning
*7d35eb1aSDavid Vernetthe 64-bit or 32-bit value will wrap. If BPF program execution would
4d496be9SDavid Vernetresult in division by zero, the destination register is instead set to zero.
4d496be9SDavid VernetIf execution would result in modulo by zero, for ``BPF_ALU64`` the value of
4d496be9SDavid Vernetthe destination register is unchanged whereas for ``BPF_ALU`` the upper
4d496be9SDavid Vernet32 bits of the destination register are zeroed.
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_ADD | BPF_X | BPF_ALU`` means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  dst = (u32) ((u32) dst + (u32) src)
4d496be9SDavid Vernet
4d496be9SDavid Vernetwhere '(u32)' indicates that the upper 32 bits are zeroed.
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_ADD | BPF_X | BPF_ALU64`` means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  dst = dst + src
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_XOR | BPF_K | BPF_ALU`` means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  dst = (u32) dst ^ (u32) imm32
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_XOR | BPF_K | BPF_ALU64`` means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  dst = dst ^ imm32
4d496be9SDavid Vernet
ee932bf9SYonghong SongNote that most instructions have instruction offset of 0. Only three instructions
ee932bf9SYonghong Song(``BPF_SDIV``, ``BPF_SMOD``, ``BPF_MOVSX``) have a non-zero offset.
245d4c40SYonghong Song
e546a119SWill HawkinsThe division and modulo operations support both unsigned and signed flavors.
245d4c40SYonghong Song
ee932bf9SYonghong SongFor unsigned operations (``BPF_DIV`` and ``BPF_MOD``), for ``BPF_ALU``,
ee932bf9SYonghong Song'imm' is interpreted as a 32-bit unsigned value. For ``BPF_ALU64``,
e546a119SWill Hawkins'imm' is first :term:`sign extended<Sign Extend>` from 32 to 64 bits, and then
e546a119SWill Hawkinsinterpreted as a 64-bit unsigned value.
ee932bf9SYonghong Song
ee932bf9SYonghong SongFor signed operations (``BPF_SDIV`` and ``BPF_SMOD``), for ``BPF_ALU``,
ee932bf9SYonghong Song'imm' is interpreted as a 32-bit signed value. For ``BPF_ALU64``, 'imm'
e546a119SWill Hawkinsis first :term:`sign extended<Sign Extend>` from 32 to 64 bits, and then
e546a119SWill Hawkinsinterpreted as a 64-bit signed value.
ee932bf9SYonghong Song
ee932bf9SYonghong SongThe ``BPF_MOVSX`` instruction does a move operation with sign extension.
e546a119SWill Hawkins``BPF_ALU | BPF_MOVSX`` :term:`sign extends<Sign Extend>` 8-bit and 16-bit operands into 32
ee932bf9SYonghong Songbit operands, and zeroes the remaining upper 32 bits.
e546a119SWill Hawkins``BPF_ALU64 | BPF_MOVSX`` :term:`sign extends<Sign Extend>` 8-bit, 16-bit, and 32-bit
ee932bf9SYonghong Songoperands into 64 bit operands.
4d496be9SDavid Vernet
4d496be9SDavid VernetShift operations use a mask of 0x3F (63) for 64-bit operations and 0x1F (31)
4d496be9SDavid Vernetfor 32-bit operations.
4d496be9SDavid Vernet
4d496be9SDavid VernetByte swap instructions
ee932bf9SYonghong Song----------------------
4d496be9SDavid Vernet
245d4c40SYonghong SongThe byte swap instructions use instruction classes of ``BPF_ALU`` and ``BPF_ALU64``
245d4c40SYonghong Songand a 4-bit 'code' field of ``BPF_END``.
4d496be9SDavid Vernet
4d496be9SDavid VernetThe byte swap instructions operate on the destination register
4d496be9SDavid Vernetonly and do not use a separate source register or immediate value.
4d496be9SDavid Vernet
ee932bf9SYonghong SongFor ``BPF_ALU``, the 1-bit source operand field in the opcode is used to
ee932bf9SYonghong Songselect what byte order the operation converts from or to. For
ee932bf9SYonghong Song``BPF_ALU64``, the 1-bit source operand field in the opcode is reserved
ee932bf9SYonghong Songand must be set to 0.
4d496be9SDavid Vernet
245d4c40SYonghong Song=========  =========  =====  =================================================
245d4c40SYonghong Songclass      source     value  description
245d4c40SYonghong Song=========  =========  =====  =================================================
245d4c40SYonghong SongBPF_ALU    BPF_TO_LE  0x00   convert between host byte order and little endian
245d4c40SYonghong SongBPF_ALU    BPF_TO_BE  0x08   convert between host byte order and big endian
ee932bf9SYonghong SongBPF_ALU64  Reserved   0x00   do byte swap unconditionally
245d4c40SYonghong Song=========  =========  =====  =================================================
4d496be9SDavid Vernet
4d496be9SDavid VernetThe 'imm' field encodes the width of the swap operations.  The following widths
4d496be9SDavid Vernetare supported: 16, 32 and 64.
4d496be9SDavid Vernet
4d496be9SDavid VernetExamples:
4d496be9SDavid Vernet
2369e526SWill Hawkins``BPF_ALU | BPF_TO_LE | BPF_END`` with imm = 16/32/64 means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  dst = htole16(dst)
2369e526SWill Hawkins  dst = htole32(dst)
2369e526SWill Hawkins  dst = htole64(dst)
4d496be9SDavid Vernet
2369e526SWill Hawkins``BPF_ALU | BPF_TO_BE | BPF_END`` with imm = 16/32/64 means::
4d496be9SDavid Vernet
2369e526SWill Hawkins  dst = htobe16(dst)
2369e526SWill Hawkins  dst = htobe32(dst)
4d496be9SDavid Vernet  dst = htobe64(dst)
4d496be9SDavid Vernet
245d4c40SYonghong Song``BPF_ALU64 | BPF_TO_LE | BPF_END`` with imm = 16/32/64 means::
245d4c40SYonghong Song
2369e526SWill Hawkins  dst = bswap16(dst)
2369e526SWill Hawkins  dst = bswap32(dst)
2369e526SWill Hawkins  dst = bswap64(dst)
245d4c40SYonghong Song
4d496be9SDavid VernetJump instructions
4d496be9SDavid Vernet-----------------
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_JMP32`` uses 32-bit wide operands while ``BPF_JMP`` uses 64-bit wide operands for
4d496be9SDavid Vernetotherwise identical operations.
4d496be9SDavid VernetThe 'code' field encodes the operation as below:
4d496be9SDavid Vernet
4d496be9SDavid Vernet========  =====  ===  ===========================================  =========================================
4d496be9SDavid Vernetcode      value  src  description                                  notes
4d496be9SDavid Vernet========  =====  ===  ===========================================  =========================================
245d4c40SYonghong SongBPF_JA    0x0    0x0  PC += offset                                 BPF_JMP class
245d4c40SYonghong SongBPF_JA    0x0    0x0  PC += imm                                    BPF_JMP32 class
4d496be9SDavid VernetBPF_JEQ   0x1    any  PC += offset if dst == src
4d496be9SDavid VernetBPF_JGT   0x2    any  PC += offset if dst > src                    unsigned
4d496be9SDavid VernetBPF_JGE   0x3    any  PC += offset if dst >= src                   unsigned
4d496be9SDavid VernetBPF_JSET  0x4    any  PC += offset if dst & src
4d496be9SDavid VernetBPF_JNE   0x5    any  PC += offset if dst != src
4d496be9SDavid VernetBPF_JSGT  0x6    any  PC += offset if dst > src                    signed
4d496be9SDavid VernetBPF_JSGE  0x7    any  PC += offset if dst >= src                   signed
4d496be9SDavid VernetBPF_CALL  0x8    0x0  call helper function by address              see `Helper functions`_
2d71a90fSWill HawkinsBPF_CALL  0x8    0x1  call PC += imm                               see `Program-local functions`_
4d496be9SDavid VernetBPF_CALL  0x8    0x2  call helper function by BTF ID               see `Helper functions`_
4d496be9SDavid VernetBPF_EXIT  0x9    0x0  return                                       BPF_JMP only
4d496be9SDavid VernetBPF_JLT   0xa    any  PC += offset if dst < src                    unsigned
4d496be9SDavid VernetBPF_JLE   0xb    any  PC += offset if dst <= src                   unsigned
4d496be9SDavid VernetBPF_JSLT  0xc    any  PC += offset if dst < src                    signed
4d496be9SDavid VernetBPF_JSLE  0xd    any  PC += offset if dst <= src                   signed
4d496be9SDavid Vernet========  =====  ===  ===========================================  =========================================
4d496be9SDavid Vernet
*7d35eb1aSDavid VernetThe BPF program needs to store the return value into register R0 before doing a
4d496be9SDavid Vernet``BPF_EXIT``.
4d496be9SDavid Vernet
4d496be9SDavid VernetExample:
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_JSGE | BPF_X | BPF_JMP32`` (0x7e) means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  if (s32)dst s>= (s32)src goto +offset
4d496be9SDavid Vernet
4d496be9SDavid Vernetwhere 's>=' indicates a signed '>=' comparison.
4d496be9SDavid Vernet
245d4c40SYonghong Song``BPF_JA | BPF_K | BPF_JMP32`` (0x06) means::
245d4c40SYonghong Song
245d4c40SYonghong Song  gotol +imm
245d4c40SYonghong Song
245d4c40SYonghong Songwhere 'imm' means the branch offset comes from insn 'imm' field.
245d4c40SYonghong Song
ee932bf9SYonghong SongNote that there are two flavors of ``BPF_JA`` instructions. The
ee932bf9SYonghong Song``BPF_JMP`` class permits a 16-bit jump offset specified by the 'offset'
ee932bf9SYonghong Songfield, whereas the ``BPF_JMP32`` class permits a 32-bit jump offset
ee932bf9SYonghong Songspecified by the 'imm' field. A > 16-bit conditional jump may be
ee932bf9SYonghong Songconverted to a < 16-bit conditional jump plus a 32-bit unconditional
ee932bf9SYonghong Songjump.
245d4c40SYonghong Song
4d496be9SDavid VernetHelper functions
4d496be9SDavid Vernet~~~~~~~~~~~~~~~~
4d496be9SDavid Vernet
4d496be9SDavid VernetHelper functions are a concept whereby BPF programs can call into a
4d496be9SDavid Vernetset of function calls exposed by the underlying platform.
4d496be9SDavid Vernet
4d496be9SDavid VernetHistorically, each helper function was identified by an address
4d496be9SDavid Vernetencoded in the imm field.  The available helper functions may differ
4d496be9SDavid Vernetfor each program type, but address values are unique across all program types.
4d496be9SDavid Vernet
4d496be9SDavid VernetPlatforms that support the BPF Type Format (BTF) support identifying
4d496be9SDavid Verneta helper function by a BTF ID encoded in the imm field, where the BTF ID
4d496be9SDavid Vernetidentifies the helper name and type.
4d496be9SDavid Vernet
4d496be9SDavid VernetProgram-local functions
4d496be9SDavid Vernet~~~~~~~~~~~~~~~~~~~~~~~
4d496be9SDavid VernetProgram-local functions are functions exposed by the same BPF program as the
4d496be9SDavid Vernetcaller, and are referenced by offset from the call instruction, similar to
2d71a90fSWill Hawkins``BPF_JA``.  The offset is encoded in the imm field of the call instruction.
2d71a90fSWill HawkinsA ``BPF_EXIT`` within the program-local function will return to the caller.
4d496be9SDavid Vernet
4d496be9SDavid VernetLoad and store instructions
4d496be9SDavid Vernet===========================
4d496be9SDavid Vernet
4d496be9SDavid VernetFor load and store instructions (``BPF_LD``, ``BPF_LDX``, ``BPF_ST``, and ``BPF_STX``), the
4d496be9SDavid Vernet8-bit 'opcode' field is divided as:
4d496be9SDavid Vernet
4d496be9SDavid Vernet============  ======  =================
4d496be9SDavid Vernet3 bits (MSB)  2 bits  3 bits (LSB)
4d496be9SDavid Vernet============  ======  =================
4d496be9SDavid Vernetmode          size    instruction class
4d496be9SDavid Vernet============  ======  =================
4d496be9SDavid Vernet
4d496be9SDavid VernetThe mode modifier is one of:
4d496be9SDavid Vernet
4d496be9SDavid Vernet  =============  =====  ====================================  =============
4d496be9SDavid Vernet  mode modifier  value  description                           reference
4d496be9SDavid Vernet  =============  =====  ====================================  =============
4d496be9SDavid Vernet  BPF_IMM        0x00   64-bit immediate instructions         `64-bit immediate instructions`_
4d496be9SDavid Vernet  BPF_ABS        0x20   legacy BPF packet access (absolute)   `Legacy BPF Packet access instructions`_
4d496be9SDavid Vernet  BPF_IND        0x40   legacy BPF packet access (indirect)   `Legacy BPF Packet access instructions`_
4d496be9SDavid Vernet  BPF_MEM        0x60   regular load and store operations     `Regular load and store operations`_
245d4c40SYonghong Song  BPF_MEMSX      0x80   sign-extension load operations        `Sign-extension load operations`_
4d496be9SDavid Vernet  BPF_ATOMIC     0xc0   atomic operations                     `Atomic operations`_
4d496be9SDavid Vernet  =============  =====  ====================================  =============
4d496be9SDavid Vernet
4d496be9SDavid VernetThe size modifier is one of:
4d496be9SDavid Vernet
4d496be9SDavid Vernet  =============  =====  =====================
4d496be9SDavid Vernet  size modifier  value  description
4d496be9SDavid Vernet  =============  =====  =====================
4d496be9SDavid Vernet  BPF_W          0x00   word        (4 bytes)
4d496be9SDavid Vernet  BPF_H          0x08   half word   (2 bytes)
4d496be9SDavid Vernet  BPF_B          0x10   byte
4d496be9SDavid Vernet  BPF_DW         0x18   double word (8 bytes)
4d496be9SDavid Vernet  =============  =====  =====================
4d496be9SDavid Vernet
4d496be9SDavid VernetRegular load and store operations
4d496be9SDavid Vernet---------------------------------
4d496be9SDavid Vernet
4d496be9SDavid VernetThe ``BPF_MEM`` mode modifier is used to encode regular load and store
4d496be9SDavid Vernetinstructions that transfer data between a register and memory.
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_MEM | <size> | BPF_STX`` means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  *(size *) (dst + offset) = src
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_MEM | <size> | BPF_ST`` means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  *(size *) (dst + offset) = imm32
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_MEM | <size> | BPF_LDX`` means::
4d496be9SDavid Vernet
245d4c40SYonghong Song  dst = *(unsigned size *) (src + offset)
4d496be9SDavid Vernet
245d4c40SYonghong SongWhere size is one of: ``BPF_B``, ``BPF_H``, ``BPF_W``, or ``BPF_DW`` and
ee932bf9SYonghong Song'unsigned size' is one of u8, u16, u32 or u64.
245d4c40SYonghong Song
fb213ecbSYonghong SongSign-extension load operations
fb213ecbSYonghong Song------------------------------
fb213ecbSYonghong Song
e546a119SWill HawkinsThe ``BPF_MEMSX`` mode modifier is used to encode :term:`sign-extension<Sign Extend>` load
245d4c40SYonghong Songinstructions that transfer data between a register and memory.
245d4c40SYonghong Song
245d4c40SYonghong Song``BPF_MEMSX | <size> | BPF_LDX`` means::
245d4c40SYonghong Song
245d4c40SYonghong Song  dst = *(signed size *) (src + offset)
245d4c40SYonghong Song
245d4c40SYonghong SongWhere size is one of: ``BPF_B``, ``BPF_H`` or ``BPF_W``, and
ee932bf9SYonghong Song'signed size' is one of s8, s16 or s32.
4d496be9SDavid Vernet
4d496be9SDavid VernetAtomic operations
4d496be9SDavid Vernet-----------------
4d496be9SDavid Vernet
4d496be9SDavid VernetAtomic operations are operations that operate on memory and can not be
4d496be9SDavid Vernetinterrupted or corrupted by other access to the same memory region
*7d35eb1aSDavid Vernetby other BPF programs or means outside of this specification.
4d496be9SDavid Vernet
*7d35eb1aSDavid VernetAll atomic operations supported by BPF are encoded as store operations
4d496be9SDavid Vernetthat use the ``BPF_ATOMIC`` mode modifier as follows:
4d496be9SDavid Vernet
4d496be9SDavid Vernet* ``BPF_ATOMIC | BPF_W | BPF_STX`` for 32-bit operations
4d496be9SDavid Vernet* ``BPF_ATOMIC | BPF_DW | BPF_STX`` for 64-bit operations
4d496be9SDavid Vernet* 8-bit and 16-bit wide atomic operations are not supported.
4d496be9SDavid Vernet
4d496be9SDavid VernetThe 'imm' field is used to encode the actual atomic operation.
4d496be9SDavid VernetSimple atomic operation use a subset of the values defined to encode
4d496be9SDavid Vernetarithmetic operations in the 'imm' field to encode the atomic operation:
4d496be9SDavid Vernet
4d496be9SDavid Vernet========  =====  ===========
4d496be9SDavid Vernetimm       value  description
4d496be9SDavid Vernet========  =====  ===========
4d496be9SDavid VernetBPF_ADD   0x00   atomic add
4d496be9SDavid VernetBPF_OR    0x40   atomic or
4d496be9SDavid VernetBPF_AND   0x50   atomic and
4d496be9SDavid VernetBPF_XOR   0xa0   atomic xor
4d496be9SDavid Vernet========  =====  ===========
4d496be9SDavid Vernet
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_ATOMIC | BPF_W  | BPF_STX`` with 'imm' = BPF_ADD means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  *(u32 *)(dst + offset) += src
4d496be9SDavid Vernet
4d496be9SDavid Vernet``BPF_ATOMIC | BPF_DW | BPF_STX`` with 'imm' = BPF ADD means::
4d496be9SDavid Vernet
4d496be9SDavid Vernet  *(u64 *)(dst + offset) += src
4d496be9SDavid Vernet
4d496be9SDavid VernetIn addition to the simple atomic operations, there also is a modifier and
4d496be9SDavid Vernettwo complex atomic operations:
4d496be9SDavid Vernet
4d496be9SDavid Vernet===========  ================  ===========================
4d496be9SDavid Vernetimm          value             description
4d496be9SDavid Vernet===========  ================  ===========================
4d496be9SDavid VernetBPF_FETCH    0x01              modifier: return old value
4d496be9SDavid VernetBPF_XCHG     0xe0 | BPF_FETCH  atomic exchange
4d496be9SDavid VernetBPF_CMPXCHG  0xf0 | BPF_FETCH  atomic compare and exchange
4d496be9SDavid Vernet===========  ================  ===========================
4d496be9SDavid Vernet
4d496be9SDavid VernetThe ``BPF_FETCH`` modifier is optional for simple atomic operations, and
4d496be9SDavid Vernetalways set for the complex atomic operations.  If the ``BPF_FETCH`` flag
4d496be9SDavid Vernetis set, then the operation also overwrites ``src`` with the value that
4d496be9SDavid Vernetwas in memory before it was modified.
4d496be9SDavid Vernet
4d496be9SDavid VernetThe ``BPF_XCHG`` operation atomically exchanges ``src`` with the value
4d496be9SDavid Vernetaddressed by ``dst + offset``.
4d496be9SDavid Vernet
4d496be9SDavid VernetThe ``BPF_CMPXCHG`` operation atomically compares the value addressed by
4d496be9SDavid Vernet``dst + offset`` with ``R0``. If they match, the value addressed by
4d496be9SDavid Vernet``dst + offset`` is replaced with ``src``. In either case, the
4d496be9SDavid Vernetvalue that was at ``dst + offset`` before the operation is zero-extended
4d496be9SDavid Vernetand loaded back to ``R0``.
4d496be9SDavid Vernet
4d496be9SDavid Vernet64-bit immediate instructions
4d496be9SDavid Vernet-----------------------------
4d496be9SDavid Vernet
4d496be9SDavid VernetInstructions with the ``BPF_IMM`` 'mode' modifier use the wide instruction
4d496be9SDavid Vernetencoding defined in `Instruction encoding`_, and use the 'src' field of the
4d496be9SDavid Vernetbasic instruction to hold an opcode subtype.
4d496be9SDavid Vernet
4d496be9SDavid VernetThe following table defines a set of ``BPF_IMM | BPF_DW | BPF_LD`` instructions
4d496be9SDavid Vernetwith opcode subtypes in the 'src' field, using new terms such as "map"
4d496be9SDavid Vernetdefined further below:
4d496be9SDavid Vernet
4d496be9SDavid Vernet=========================  ======  ===  =========================================  ===========  ==============
4d496be9SDavid Vernetopcode construction        opcode  src  pseudocode                                 imm type     dst type
4d496be9SDavid Vernet=========================  ======  ===  =========================================  ===========  ==============
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x0  dst = imm64                                integer      integer
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x1  dst = map_by_fd(imm)                       map fd       map
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x2  dst = map_val(map_by_fd(imm)) + next_imm   map fd       data pointer
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x3  dst = var_addr(imm)                        variable id  data pointer
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x4  dst = code_addr(imm)                       integer      code pointer
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x5  dst = map_by_idx(imm)                      map index    map
4d496be9SDavid VernetBPF_IMM | BPF_DW | BPF_LD  0x18    0x6  dst = map_val(map_by_idx(imm)) + next_imm  map index    data pointer
4d496be9SDavid Vernet=========================  ======  ===  =========================================  ===========  ==============
4d496be9SDavid Vernet
4d496be9SDavid Vernetwhere
4d496be9SDavid Vernet
4d496be9SDavid Vernet* map_by_fd(imm) means to convert a 32-bit file descriptor into an address of a map (see `Maps`_)
4d496be9SDavid Vernet* map_by_idx(imm) means to convert a 32-bit index into an address of a map
4d496be9SDavid Vernet* map_val(map) gets the address of the first value in a given map
4d496be9SDavid Vernet* var_addr(imm) gets the address of a platform variable (see `Platform Variables`_) with a given id
4d496be9SDavid Vernet* code_addr(imm) gets the address of the instruction at a specified relative offset in number of (64-bit) instructions
4d496be9SDavid Vernet* the 'imm type' can be used by disassemblers for display
4d496be9SDavid Vernet* the 'dst type' can be used for verification and JIT compilation purposes
4d496be9SDavid Vernet
4d496be9SDavid VernetMaps
4d496be9SDavid Vernet~~~~
4d496be9SDavid Vernet
*7d35eb1aSDavid VernetMaps are shared memory regions accessible by BPF programs on some platforms.
4d496be9SDavid VernetA map can have various semantics as defined in a separate document, and may or
4d496be9SDavid Vernetmay not have a single contiguous memory region, but the 'map_val(map)' is
4d496be9SDavid Vernetcurrently only defined for maps that do have a single contiguous memory region.
4d496be9SDavid Vernet
4d496be9SDavid VernetEach map can have a file descriptor (fd) if supported by the platform, where
4d496be9SDavid Vernet'map_by_fd(imm)' means to get the map with the specified file descriptor. Each
4d496be9SDavid VernetBPF program can also be defined to use a set of maps associated with the
4d496be9SDavid Vernetprogram at load time, and 'map_by_idx(imm)' means to get the map with the given
4d496be9SDavid Vernetindex in the set associated with the BPF program containing the instruction.
4d496be9SDavid Vernet
4d496be9SDavid VernetPlatform Variables
4d496be9SDavid Vernet~~~~~~~~~~~~~~~~~~
4d496be9SDavid Vernet
4d496be9SDavid VernetPlatform variables are memory regions, identified by integer ids, exposed by
4d496be9SDavid Vernetthe runtime and accessible by BPF programs on some platforms.  The
4d496be9SDavid Vernet'var_addr(imm)' operation means to get the address of the memory region
4d496be9SDavid Vernetidentified by the given id.
4d496be9SDavid Vernet
4d496be9SDavid VernetLegacy BPF Packet access instructions
4d496be9SDavid Vernet-------------------------------------
4d496be9SDavid Vernet
*7d35eb1aSDavid VernetBPF previously introduced special instructions for access to packet data that were
4d496be9SDavid Vernetcarried over from classic BPF. However, these instructions are
4d496be9SDavid Vernetdeprecated and should no longer be used.