xref: /openbmc/qemu/docs/devel/control-flow-integrity.rst (revision a111824382dad27db8c358b2b9b26cdf30eaf49f)
1*a1118243SDaniele Buono============================
2*a1118243SDaniele BuonoControl-Flow Integrity (CFI)
3*a1118243SDaniele Buono============================
4*a1118243SDaniele Buono
5*a1118243SDaniele BuonoThis document describes the current control-flow integrity (CFI) mechanism in
6*a1118243SDaniele BuonoQEMU. How it can be enabled, its benefits and deficiencies, and how it affects
7*a1118243SDaniele Buononew and existing code in QEMU
8*a1118243SDaniele Buono
9*a1118243SDaniele BuonoBasics
10*a1118243SDaniele Buono------
11*a1118243SDaniele Buono
12*a1118243SDaniele BuonoCFI is a hardening technique that focusing on guaranteeing that indirect
13*a1118243SDaniele Buonofunction calls have not been altered by an attacker.
14*a1118243SDaniele BuonoThe type used in QEMU is a forward-edge control-flow integrity that ensures
15*a1118243SDaniele Buonofunction calls performed through function pointers, always call a "compatible"
16*a1118243SDaniele Buonofunction. A compatible function is a function with the same signature of the
17*a1118243SDaniele Buonofunction pointer declared in the source code.
18*a1118243SDaniele Buono
19*a1118243SDaniele BuonoThis type of CFI is entirely compiler-based and relies on the compiler knowing
20*a1118243SDaniele Buonothe signature of every function and every function pointer used in the code.
21*a1118243SDaniele BuonoAs of now, the only compiler that provides support for CFI is Clang.
22*a1118243SDaniele Buono
23*a1118243SDaniele BuonoCFI is best used on production binaries, to protect against unknown attack
24*a1118243SDaniele Buonovectors.
25*a1118243SDaniele Buono
26*a1118243SDaniele BuonoIn case of a CFI violation (i.e. call to a non-compatible function) QEMU will
27*a1118243SDaniele Buonoterminate abruptly, to stop the possible attack.
28*a1118243SDaniele Buono
29*a1118243SDaniele BuonoBuilding with CFI
30*a1118243SDaniele Buono-----------------
31*a1118243SDaniele Buono
32*a1118243SDaniele BuonoNOTE: CFI requires the use of link-time optimization. Therefore, when CFI is
33*a1118243SDaniele Buonoselected, LTO will be automatically enabled.
34*a1118243SDaniele Buono
35*a1118243SDaniele BuonoTo build with CFI, the minimum requirement is Clang 6+. If you
36*a1118243SDaniele Buonoare planning to also enable fuzzing, then Clang 11+ is needed (more on this
37*a1118243SDaniele Buonolater).
38*a1118243SDaniele Buono
39*a1118243SDaniele BuonoGiven the use of LTO, a version of AR that supports LLVM IR is required.
40*a1118243SDaniele BuonoThe easies way of doing this is by selecting the AR provided by LLVM::
41*a1118243SDaniele Buono
42*a1118243SDaniele Buono AR=llvm-ar-9 CC=clang-9 CXX=lang++-9 /path/to/configure --enable-cfi
43*a1118243SDaniele Buono
44*a1118243SDaniele BuonoCFI is enabled on every binary produced.
45*a1118243SDaniele Buono
46*a1118243SDaniele BuonoIf desired, an additional flag to increase the verbosity of the output in case
47*a1118243SDaniele Buonoof a CFI violation is offered (``--enable-debug-cfi``).
48*a1118243SDaniele Buono
49*a1118243SDaniele BuonoUsing QEMU built with CFI
50*a1118243SDaniele Buono-------------------------
51*a1118243SDaniele Buono
52*a1118243SDaniele BuonoA binary with CFI will work exactly like a standard binary. In case of a CFI
53*a1118243SDaniele Buonoviolation, the binary will terminate with an illegal instruction signal.
54*a1118243SDaniele Buono
55*a1118243SDaniele BuonoIncompatible code with CFI
56*a1118243SDaniele Buono--------------------------
57*a1118243SDaniele Buono
58*a1118243SDaniele BuonoAs mentioned above, CFI is entirely compiler-based and therefore relies on
59*a1118243SDaniele Buonocompile-time knowledge of the code. This means that, while generally supported
60*a1118243SDaniele Buonofor most code, some specific use pattern can break CFI compatibility, and
61*a1118243SDaniele Buonocreate false-positives. The two main patterns that can cause issues are:
62*a1118243SDaniele Buono
63*a1118243SDaniele Buono* Just-in-time compiled code: since such code is created at runtime, the jump
64*a1118243SDaniele Buono  to the buffer containing JIT code will fail.
65*a1118243SDaniele Buono
66*a1118243SDaniele Buono* Libraries loaded dynamically, e.g. with dlopen/dlsym, since the library was
67*a1118243SDaniele Buono  not known at compile time.
68*a1118243SDaniele Buono
69*a1118243SDaniele BuonoCurrent areas of QEMU that are not entirely compatible with CFI are:
70*a1118243SDaniele Buono
71*a1118243SDaniele Buono1. TCG, since the idea of TCG is to pre-compile groups of instructions at
72*a1118243SDaniele Buono   runtime to speed-up interpretation, quite similarly to a JIT compiler
73*a1118243SDaniele Buono
74*a1118243SDaniele Buono2. TCI, where the interpreter has to interpret the generic *call* operation
75*a1118243SDaniele Buono
76*a1118243SDaniele Buono3. Plugins, since a plugin is implemented as an external library
77*a1118243SDaniele Buono
78*a1118243SDaniele Buono4. Modules, since they are implemented as an external library
79*a1118243SDaniele Buono
80*a1118243SDaniele Buono5. Directly calling signal handlers from the QEMU source code, since the
81*a1118243SDaniele Buono   signal handler may have been provided by an external library or even plugged
82*a1118243SDaniele Buono   at runtime.
83*a1118243SDaniele Buono
84*a1118243SDaniele BuonoDisabling CFI for a specific function
85*a1118243SDaniele Buono-------------------------------------
86*a1118243SDaniele Buono
87*a1118243SDaniele BuonoIf you are working on function that is performing a call using an
88*a1118243SDaniele Buonoincompatible way, as described before, you can selectively disable CFI checks
89*a1118243SDaniele Buonofor such function by using the decorator ``QEMU_DISABLE_CFI`` at function
90*a1118243SDaniele Buonodefinition, and add an explanation on why the function is not compatible
91*a1118243SDaniele Buonowith CFI. An example of the use of ``QEMU_DISABLE_CFI`` is provided here::
92*a1118243SDaniele Buono
93*a1118243SDaniele Buono	/*
94*a1118243SDaniele Buono	 * Disable CFI checks.
95*a1118243SDaniele Buono	 * TCG creates binary blobs at runtime, with the transformed code.
96*a1118243SDaniele Buono	 * A TB is a blob of binary code, created at runtime and called with an
97*a1118243SDaniele Buono	 * indirect function call. Since such function did not exist at compile time,
98*a1118243SDaniele Buono	 * the CFI runtime has no way to verify its signature and would fail.
99*a1118243SDaniele Buono	 * TCG is not considered a security-sensitive part of QEMU so this does not
100*a1118243SDaniele Buono	 * affect the impact of CFI in environment with high security requirements
101*a1118243SDaniele Buono	 */
102*a1118243SDaniele Buono	QEMU_DISABLE_CFI
103*a1118243SDaniele Buono	static inline tcg_target_ulong cpu_tb_exec(CPUState *cpu, TranslationBlock *itb)
104*a1118243SDaniele Buono
105*a1118243SDaniele BuonoNOTE: CFI needs to be disabled at the **caller** function, (i.e. a compatible
106*a1118243SDaniele Buonocfi function that calls a non-compatible one), since the check is performed
107*a1118243SDaniele Buonowhen the function call is performed.
108*a1118243SDaniele Buono
109*a1118243SDaniele BuonoCFI and fuzzing
110*a1118243SDaniele Buono---------------
111*a1118243SDaniele Buono
112*a1118243SDaniele BuonoThere is generally no advantage of using CFI and fuzzing together, because
113*a1118243SDaniele Buonothey target different environments (production for CFI, debug for fuzzing).
114*a1118243SDaniele Buono
115*a1118243SDaniele BuonoCFI could be used in conjunction with fuzzing to identify a broader set of
116*a1118243SDaniele Buonobugs that may not end immediately in a segmentation fault or triggering
117*a1118243SDaniele Buonoan assertion. However, other sanitizers such as address and ub sanitizers
118*a1118243SDaniele Buonocan identify such bugs in a more precise way than CFI.
119*a1118243SDaniele Buono
120*a1118243SDaniele BuonoThere is, however, an interesting use case in using CFI in conjunction with
121*a1118243SDaniele Buonofuzzing, that is to make sure that CFI is not triggering any false positive
122*a1118243SDaniele Buonoin remote-but-possible parts of the code.
123*a1118243SDaniele Buono
124*a1118243SDaniele BuonoCFI can be enabled with fuzzing, but with some caveats:
125*a1118243SDaniele Buono1. Fuzzing relies on the linker performing function wrapping at link-time.
126*a1118243SDaniele BuonoThe standard BFD linker does not support function wrapping when LTO is
127*a1118243SDaniele Buonoalso enabled. The workaround is to use LLVM's lld linker.
128*a1118243SDaniele Buono2. Fuzzing also relies on a custom linker script, which is only supported by
129*a1118243SDaniele Buonolld with version 11+.
130*a1118243SDaniele Buono
131*a1118243SDaniele BuonoIn other words, to compile with fuzzing and CFI, clang 11+ is required, and
132*a1118243SDaniele Buonolld needs to be used as a linker::
133*a1118243SDaniele Buono
134*a1118243SDaniele Buono AR=llvm-ar-11 CC=clang-11 CXX=lang++-11 /path/to/configure --enable-cfi \
135*a1118243SDaniele Buono                           -enable-fuzzing --extra-ldflags="-fuse-ld=lld"
136*a1118243SDaniele Buono
137*a1118243SDaniele Buonoand then, compile the fuzzers as usual.
138