xref: /openbmc/linux/Documentation/core-api/asm-annotations.rst (revision 1ac731c529cd4d6adbce134754b51ff7d822b145)
1f4bf1cd4SJonathan CorbetAssembler Annotations
2f4bf1cd4SJonathan Corbet=====================
3f4bf1cd4SJonathan Corbet
4f4bf1cd4SJonathan CorbetCopyright (c) 2017-2019 Jiri Slaby
5f4bf1cd4SJonathan Corbet
6f4bf1cd4SJonathan CorbetThis document describes the new macros for annotation of data and code in
7f4bf1cd4SJonathan Corbetassembly. In particular, it contains information about ``SYM_FUNC_START``,
8f4bf1cd4SJonathan Corbet``SYM_FUNC_END``, ``SYM_CODE_START``, and similar.
9f4bf1cd4SJonathan Corbet
10f4bf1cd4SJonathan CorbetRationale
11f4bf1cd4SJonathan Corbet---------
12f4bf1cd4SJonathan CorbetSome code like entries, trampolines, or boot code needs to be written in
13f4bf1cd4SJonathan Corbetassembly. The same as in C, such code is grouped into functions and
14f4bf1cd4SJonathan Corbetaccompanied with data. Standard assemblers do not force users into precisely
15f4bf1cd4SJonathan Corbetmarking these pieces as code, data, or even specifying their length.
16f4bf1cd4SJonathan CorbetNevertheless, assemblers provide developers with such annotations to aid
17f4bf1cd4SJonathan Corbetdebuggers throughout assembly. On top of that, developers also want to mark
18f4bf1cd4SJonathan Corbetsome functions as *global* in order to be visible outside of their translation
19f4bf1cd4SJonathan Corbetunits.
20f4bf1cd4SJonathan Corbet
21f4bf1cd4SJonathan CorbetOver time, the Linux kernel has adopted macros from various projects (like
22f4bf1cd4SJonathan Corbet``binutils``) to facilitate such annotations. So for historic reasons,
23f4bf1cd4SJonathan Corbetdevelopers have been using ``ENTRY``, ``END``, ``ENDPROC``, and other
24f4bf1cd4SJonathan Corbetannotations in assembly.  Due to the lack of their documentation, the macros
25f4bf1cd4SJonathan Corbetare used in rather wrong contexts at some locations. Clearly, ``ENTRY`` was
26f4bf1cd4SJonathan Corbetintended to denote the beginning of global symbols (be it data or code).
27f4bf1cd4SJonathan Corbet``END`` used to mark the end of data or end of special functions with
28f4bf1cd4SJonathan Corbet*non-standard* calling convention. In contrast, ``ENDPROC`` should annotate
29f4bf1cd4SJonathan Corbetonly ends of *standard* functions.
30f4bf1cd4SJonathan Corbet
31f4bf1cd4SJonathan CorbetWhen these macros are used correctly, they help assemblers generate a nice
32f4bf1cd4SJonathan Corbetobject with both sizes and types set correctly. For example, the result of
33f4bf1cd4SJonathan Corbet``arch/x86/lib/putuser.S``::
34f4bf1cd4SJonathan Corbet
35f4bf1cd4SJonathan Corbet   Num:    Value          Size Type    Bind   Vis      Ndx Name
36f4bf1cd4SJonathan Corbet    25: 0000000000000000    33 FUNC    GLOBAL DEFAULT    1 __put_user_1
37f4bf1cd4SJonathan Corbet    29: 0000000000000030    37 FUNC    GLOBAL DEFAULT    1 __put_user_2
38f4bf1cd4SJonathan Corbet    32: 0000000000000060    36 FUNC    GLOBAL DEFAULT    1 __put_user_4
39f4bf1cd4SJonathan Corbet    35: 0000000000000090    37 FUNC    GLOBAL DEFAULT    1 __put_user_8
40f4bf1cd4SJonathan Corbet
41f4bf1cd4SJonathan CorbetThis is not only important for debugging purposes. When there are properly
42f4bf1cd4SJonathan Corbetannotated objects like this, tools can be run on them to generate more useful
43f4bf1cd4SJonathan Corbetinformation. In particular, on properly annotated objects, ``objtool`` can be
44f4bf1cd4SJonathan Corbetrun to check and fix the object if needed. Currently, ``objtool`` can report
45f4bf1cd4SJonathan Corbetmissing frame pointer setup/destruction in functions. It can also
46f4bf1cd4SJonathan Corbetautomatically generate annotations for the ORC unwinder
47*ff61f079SJonathan Corbet(Documentation/arch/x86/orc-unwinder.rst)
48f4bf1cd4SJonathan Corbetfor most code. Both of these are especially important to support reliable
49f4bf1cd4SJonathan Corbetstack traces which are in turn necessary for kernel live patching
50f4bf1cd4SJonathan Corbet(Documentation/livepatch/livepatch.rst).
51f4bf1cd4SJonathan Corbet
52f4bf1cd4SJonathan CorbetCaveat and Discussion
53f4bf1cd4SJonathan Corbet---------------------
54f4bf1cd4SJonathan CorbetAs one might realize, there were only three macros previously. That is indeed
55f4bf1cd4SJonathan Corbetinsufficient to cover all the combinations of cases:
56f4bf1cd4SJonathan Corbet
57f4bf1cd4SJonathan Corbet* standard/non-standard function
58f4bf1cd4SJonathan Corbet* code/data
59f4bf1cd4SJonathan Corbet* global/local symbol
60f4bf1cd4SJonathan Corbet
61f4bf1cd4SJonathan CorbetThere was a discussion_ and instead of extending the current ``ENTRY/END*``
62f4bf1cd4SJonathan Corbetmacros, it was decided that brand new macros should be introduced instead::
63f4bf1cd4SJonathan Corbet
64f4bf1cd4SJonathan Corbet    So how about using macro names that actually show the purpose, instead
65f4bf1cd4SJonathan Corbet    of importing all the crappy, historic, essentially randomly chosen
66f4bf1cd4SJonathan Corbet    debug symbol macro names from the binutils and older kernels?
67f4bf1cd4SJonathan Corbet
68f4bf1cd4SJonathan Corbet.. _discussion: https://lore.kernel.org/r/20170217104757.28588-1-jslaby@suse.cz
69f4bf1cd4SJonathan Corbet
70f4bf1cd4SJonathan CorbetMacros Description
71f4bf1cd4SJonathan Corbet------------------
72f4bf1cd4SJonathan Corbet
73f4bf1cd4SJonathan CorbetThe new macros are prefixed with the ``SYM_`` prefix and can be divided into
74f4bf1cd4SJonathan Corbetthree main groups:
75f4bf1cd4SJonathan Corbet
76f4bf1cd4SJonathan Corbet1. ``SYM_FUNC_*`` -- to annotate C-like functions. This means functions with
77f4bf1cd4SJonathan Corbet   standard C calling conventions. For example, on x86, this means that the
78f4bf1cd4SJonathan Corbet   stack contains a return address at the predefined place and a return from
79f4bf1cd4SJonathan Corbet   the function can happen in a standard way. When frame pointers are enabled,
80f4bf1cd4SJonathan Corbet   save/restore of frame pointer shall happen at the start/end of a function,
81f4bf1cd4SJonathan Corbet   respectively, too.
82f4bf1cd4SJonathan Corbet
83f4bf1cd4SJonathan Corbet   Checking tools like ``objtool`` should ensure such marked functions conform
84f4bf1cd4SJonathan Corbet   to these rules. The tools can also easily annotate these functions with
85f4bf1cd4SJonathan Corbet   debugging information (like *ORC data*) automatically.
86f4bf1cd4SJonathan Corbet
87f4bf1cd4SJonathan Corbet2. ``SYM_CODE_*`` -- special functions called with special stack. Be it
88f4bf1cd4SJonathan Corbet   interrupt handlers with special stack content, trampolines, or startup
89f4bf1cd4SJonathan Corbet   functions.
90f4bf1cd4SJonathan Corbet
91f4bf1cd4SJonathan Corbet   Checking tools mostly ignore checking of these functions. But some debug
92f4bf1cd4SJonathan Corbet   information still can be generated automatically. For correct debug data,
93f4bf1cd4SJonathan Corbet   this code needs hints like ``UNWIND_HINT_REGS`` provided by developers.
94f4bf1cd4SJonathan Corbet
95f4bf1cd4SJonathan Corbet3. ``SYM_DATA*`` -- obviously data belonging to ``.data`` sections and not to
96f4bf1cd4SJonathan Corbet   ``.text``. Data do not contain instructions, so they have to be treated
97f4bf1cd4SJonathan Corbet   specially by the tools: they should not treat the bytes as instructions,
98f4bf1cd4SJonathan Corbet   nor assign any debug information to them.
99f4bf1cd4SJonathan Corbet
100f4bf1cd4SJonathan CorbetInstruction Macros
101f4bf1cd4SJonathan Corbet~~~~~~~~~~~~~~~~~~
102f4bf1cd4SJonathan CorbetThis section covers ``SYM_FUNC_*`` and ``SYM_CODE_*`` enumerated above.
103f4bf1cd4SJonathan Corbet
104f4bf1cd4SJonathan Corbet``objtool`` requires that all code must be contained in an ELF symbol. Symbol
105f4bf1cd4SJonathan Corbetnames that have a ``.L`` prefix do not emit symbol table entries. ``.L``
106f4bf1cd4SJonathan Corbetprefixed symbols can be used within a code region, but should be avoided for
107f4bf1cd4SJonathan Corbetdenoting a range of code via ``SYM_*_START/END`` annotations.
108f4bf1cd4SJonathan Corbet
109f4bf1cd4SJonathan Corbet* ``SYM_FUNC_START`` and ``SYM_FUNC_START_LOCAL`` are supposed to be **the
110f4bf1cd4SJonathan Corbet  most frequent markings**. They are used for functions with standard calling
111f4bf1cd4SJonathan Corbet  conventions -- global and local. Like in C, they both align the functions to
112f4bf1cd4SJonathan Corbet  architecture specific ``__ALIGN`` bytes. There are also ``_NOALIGN`` variants
113f4bf1cd4SJonathan Corbet  for special cases where developers do not want this implicit alignment.
114f4bf1cd4SJonathan Corbet
115f4bf1cd4SJonathan Corbet  ``SYM_FUNC_START_WEAK`` and ``SYM_FUNC_START_WEAK_NOALIGN`` markings are
116f4bf1cd4SJonathan Corbet  also offered as an assembler counterpart to the *weak* attribute known from
117f4bf1cd4SJonathan Corbet  C.
118f4bf1cd4SJonathan Corbet
119f4bf1cd4SJonathan Corbet  All of these **shall** be coupled with ``SYM_FUNC_END``. First, it marks
120f4bf1cd4SJonathan Corbet  the sequence of instructions as a function and computes its size to the
121f4bf1cd4SJonathan Corbet  generated object file. Second, it also eases checking and processing such
122f4bf1cd4SJonathan Corbet  object files as the tools can trivially find exact function boundaries.
123f4bf1cd4SJonathan Corbet
124f4bf1cd4SJonathan Corbet  So in most cases, developers should write something like in the following
125f4bf1cd4SJonathan Corbet  example, having some asm instructions in between the macros, of course::
126f4bf1cd4SJonathan Corbet
127f4bf1cd4SJonathan Corbet    SYM_FUNC_START(memset)
128f4bf1cd4SJonathan Corbet        ... asm insns ...
129f4bf1cd4SJonathan Corbet    SYM_FUNC_END(memset)
130f4bf1cd4SJonathan Corbet
131f4bf1cd4SJonathan Corbet  In fact, this kind of annotation corresponds to the now deprecated ``ENTRY``
132f4bf1cd4SJonathan Corbet  and ``ENDPROC`` macros.
133f4bf1cd4SJonathan Corbet
134f4bf1cd4SJonathan Corbet* ``SYM_FUNC_ALIAS``, ``SYM_FUNC_ALIAS_LOCAL``, and ``SYM_FUNC_ALIAS_WEAK`` can
135f4bf1cd4SJonathan Corbet  be used to define multiple names for a function. The typical use is::
136f4bf1cd4SJonathan Corbet
137f4bf1cd4SJonathan Corbet    SYM_FUNC_START(__memset)
138f4bf1cd4SJonathan Corbet        ... asm insns ...
139f4bf1cd4SJonathan Corbet    SYN_FUNC_END(__memset)
140f4bf1cd4SJonathan Corbet    SYM_FUNC_ALIAS(memset, __memset)
141f4bf1cd4SJonathan Corbet
142f4bf1cd4SJonathan Corbet  In this example, one can call ``__memset`` or ``memset`` with the same
143f4bf1cd4SJonathan Corbet  result, except the debug information for the instructions is generated to
144f4bf1cd4SJonathan Corbet  the object file only once -- for the non-``ALIAS`` case.
145f4bf1cd4SJonathan Corbet
146f4bf1cd4SJonathan Corbet* ``SYM_CODE_START`` and ``SYM_CODE_START_LOCAL`` should be used only in
147f4bf1cd4SJonathan Corbet  special cases -- if you know what you are doing. This is used exclusively
148f4bf1cd4SJonathan Corbet  for interrupt handlers and similar where the calling convention is not the C
149f4bf1cd4SJonathan Corbet  one. ``_NOALIGN`` variants exist too. The use is the same as for the ``FUNC``
150f4bf1cd4SJonathan Corbet  category above::
151f4bf1cd4SJonathan Corbet
152f4bf1cd4SJonathan Corbet    SYM_CODE_START_LOCAL(bad_put_user)
153f4bf1cd4SJonathan Corbet        ... asm insns ...
154f4bf1cd4SJonathan Corbet    SYM_CODE_END(bad_put_user)
155f4bf1cd4SJonathan Corbet
156f4bf1cd4SJonathan Corbet  Again, every ``SYM_CODE_START*`` **shall** be coupled by ``SYM_CODE_END``.
157f4bf1cd4SJonathan Corbet
158f4bf1cd4SJonathan Corbet  To some extent, this category corresponds to deprecated ``ENTRY`` and
159f4bf1cd4SJonathan Corbet  ``END``. Except ``END`` had several other meanings too.
160f4bf1cd4SJonathan Corbet
161f4bf1cd4SJonathan Corbet* ``SYM_INNER_LABEL*`` is used to denote a label inside some
162f4bf1cd4SJonathan Corbet  ``SYM_{CODE,FUNC}_START`` and ``SYM_{CODE,FUNC}_END``.  They are very similar
163f4bf1cd4SJonathan Corbet  to C labels, except they can be made global. An example of use::
164f4bf1cd4SJonathan Corbet
165f4bf1cd4SJonathan Corbet    SYM_CODE_START(ftrace_caller)
166f4bf1cd4SJonathan Corbet        /* save_mcount_regs fills in first two parameters */
167f4bf1cd4SJonathan Corbet        ...
168f4bf1cd4SJonathan Corbet
169f4bf1cd4SJonathan Corbet    SYM_INNER_LABEL(ftrace_caller_op_ptr, SYM_L_GLOBAL)
170f4bf1cd4SJonathan Corbet        /* Load the ftrace_ops into the 3rd parameter */
171f4bf1cd4SJonathan Corbet        ...
172f4bf1cd4SJonathan Corbet
173f4bf1cd4SJonathan Corbet    SYM_INNER_LABEL(ftrace_call, SYM_L_GLOBAL)
174f4bf1cd4SJonathan Corbet        call ftrace_stub
175f4bf1cd4SJonathan Corbet        ...
176f4bf1cd4SJonathan Corbet        retq
177f4bf1cd4SJonathan Corbet    SYM_CODE_END(ftrace_caller)
178f4bf1cd4SJonathan Corbet
179f4bf1cd4SJonathan CorbetData Macros
180f4bf1cd4SJonathan Corbet~~~~~~~~~~~
181f4bf1cd4SJonathan CorbetSimilar to instructions, there is a couple of macros to describe data in the
182f4bf1cd4SJonathan Corbetassembly.
183f4bf1cd4SJonathan Corbet
184f4bf1cd4SJonathan Corbet* ``SYM_DATA_START`` and ``SYM_DATA_START_LOCAL`` mark the start of some data
185f4bf1cd4SJonathan Corbet  and shall be used in conjunction with either ``SYM_DATA_END``, or
186f4bf1cd4SJonathan Corbet  ``SYM_DATA_END_LABEL``. The latter adds also a label to the end, so that
187f4bf1cd4SJonathan Corbet  people can use ``lstack`` and (local) ``lstack_end`` in the following
188f4bf1cd4SJonathan Corbet  example::
189f4bf1cd4SJonathan Corbet
190f4bf1cd4SJonathan Corbet    SYM_DATA_START_LOCAL(lstack)
191f4bf1cd4SJonathan Corbet        .skip 4096
192f4bf1cd4SJonathan Corbet    SYM_DATA_END_LABEL(lstack, SYM_L_LOCAL, lstack_end)
193f4bf1cd4SJonathan Corbet
194f4bf1cd4SJonathan Corbet* ``SYM_DATA`` and ``SYM_DATA_LOCAL`` are variants for simple, mostly one-line
195f4bf1cd4SJonathan Corbet  data::
196f4bf1cd4SJonathan Corbet
197f4bf1cd4SJonathan Corbet    SYM_DATA(HEAP,     .long rm_heap)
198f4bf1cd4SJonathan Corbet    SYM_DATA(heap_end, .long rm_stack)
199f4bf1cd4SJonathan Corbet
200f4bf1cd4SJonathan Corbet  In the end, they expand to ``SYM_DATA_START`` with ``SYM_DATA_END``
201f4bf1cd4SJonathan Corbet  internally.
202f4bf1cd4SJonathan Corbet
203f4bf1cd4SJonathan CorbetSupport Macros
204f4bf1cd4SJonathan Corbet~~~~~~~~~~~~~~
205f4bf1cd4SJonathan CorbetAll the above reduce themselves to some invocation of ``SYM_START``,
206f4bf1cd4SJonathan Corbet``SYM_END``, or ``SYM_ENTRY`` at last. Normally, developers should avoid using
207f4bf1cd4SJonathan Corbetthese.
208f4bf1cd4SJonathan Corbet
209f4bf1cd4SJonathan CorbetFurther, in the above examples, one could see ``SYM_L_LOCAL``. There are also
210f4bf1cd4SJonathan Corbet``SYM_L_GLOBAL`` and ``SYM_L_WEAK``. All are intended to denote linkage of a
211f4bf1cd4SJonathan Corbetsymbol marked by them. They are used either in ``_LABEL`` variants of the
212f4bf1cd4SJonathan Corbetearlier macros, or in ``SYM_START``.
213f4bf1cd4SJonathan Corbet
214f4bf1cd4SJonathan Corbet
215f4bf1cd4SJonathan CorbetOverriding Macros
216f4bf1cd4SJonathan Corbet~~~~~~~~~~~~~~~~~
217f4bf1cd4SJonathan CorbetArchitecture can also override any of the macros in their own
218f4bf1cd4SJonathan Corbet``asm/linkage.h``, including macros specifying the type of a symbol
219f4bf1cd4SJonathan Corbet(``SYM_T_FUNC``, ``SYM_T_OBJECT``, and ``SYM_T_NONE``).  As every macro
220f4bf1cd4SJonathan Corbetdescribed in this file is surrounded by ``#ifdef`` + ``#endif``, it is enough
221f4bf1cd4SJonathan Corbetto define the macros differently in the aforementioned architecture-dependent
222f4bf1cd4SJonathan Corbetheader.
223