1.. Copyright 2010 Nicolas Palix <npalix@diku.dk> 2.. Copyright 2010 Julia Lawall <julia@diku.dk> 3.. Copyright 2010 Gilles Muller <Gilles.Muller@lip6.fr> 4 5.. highlight:: none 6 7.. _devtools_coccinelle: 8 9Coccinelle 10========== 11 12Coccinelle is a tool for pattern matching and text transformation that has 13many uses in kernel development, including the application of complex, 14tree-wide patches and detection of problematic programming patterns. 15 16Getting Coccinelle 17------------------ 18 19The semantic patches included in the kernel use features and options 20which are provided by Coccinelle version 1.0.0-rc11 and above. 21Using earlier versions will fail as the option names used by 22the Coccinelle files and coccicheck have been updated. 23 24Coccinelle is available through the package manager 25of many distributions, e.g. : 26 27 - Debian 28 - Fedora 29 - Ubuntu 30 - OpenSUSE 31 - Arch Linux 32 - NetBSD 33 - FreeBSD 34 35Some distribution packages are obsolete and it is recommended 36to use the latest version released from the Coccinelle homepage at 37http://coccinelle.lip6.fr/ 38 39Or from Github at: 40 41https://github.com/coccinelle/coccinelle 42 43Once you have it, run the following commands:: 44 45 ./autogen 46 ./configure 47 make 48 49as a regular user, and install it with:: 50 51 sudo make install 52 53More detailed installation instructions to build from source can be 54found at: 55 56https://github.com/coccinelle/coccinelle/blob/master/install.txt 57 58Supplemental documentation 59-------------------------- 60 61For supplemental documentation refer to the wiki: 62 63https://bottest.wiki.kernel.org/coccicheck 64 65The wiki documentation always refers to the linux-next version of the script. 66 67For Semantic Patch Language(SmPL) grammar documentation refer to: 68 69http://coccinelle.lip6.fr/documentation.php 70 71Using Coccinelle on the Linux kernel 72------------------------------------ 73 74A Coccinelle-specific target is defined in the top level 75Makefile. This target is named ``coccicheck`` and calls the ``coccicheck`` 76front-end in the ``scripts`` directory. 77 78Four basic modes are defined: ``patch``, ``report``, ``context``, and 79``org``. The mode to use is specified by setting the MODE variable with 80``MODE=<mode>``. 81 82- ``patch`` proposes a fix, when possible. 83 84- ``report`` generates a list in the following format: 85 file:line:column-column: message 86 87- ``context`` highlights lines of interest and their context in a 88 diff-like style. Lines of interest are indicated with ``-``. 89 90- ``org`` generates a report in the Org mode format of Emacs. 91 92Note that not all semantic patches implement all modes. For easy use 93of Coccinelle, the default mode is "report". 94 95Two other modes provide some common combinations of these modes. 96 97- ``chain`` tries the previous modes in the order above until one succeeds. 98 99- ``rep+ctxt`` runs successively the report mode and the context mode. 100 It should be used with the C option (described later) 101 which checks the code on a file basis. 102 103Examples 104~~~~~~~~ 105 106To make a report for every semantic patch, run the following command:: 107 108 make coccicheck MODE=report 109 110To produce patches, run:: 111 112 make coccicheck MODE=patch 113 114 115The coccicheck target applies every semantic patch available in the 116sub-directories of ``scripts/coccinelle`` to the entire Linux kernel. 117 118For each semantic patch, a commit message is proposed. It gives a 119description of the problem being checked by the semantic patch, and 120includes a reference to Coccinelle. 121 122As with any static code analyzer, Coccinelle produces false 123positives. Thus, reports must be carefully checked, and patches 124reviewed. 125 126To enable verbose messages set the V= variable, for example:: 127 128 make coccicheck MODE=report V=1 129 130Coccinelle parallelization 131-------------------------- 132 133By default, coccicheck tries to run as parallel as possible. To change 134the parallelism, set the J= variable. For example, to run across 4 CPUs:: 135 136 make coccicheck MODE=report J=4 137 138As of Coccinelle 1.0.2 Coccinelle uses Ocaml parmap for parallelization; 139if support for this is detected you will benefit from parmap parallelization. 140 141When parmap is enabled coccicheck will enable dynamic load balancing by using 142``--chunksize 1`` argument. This ensures we keep feeding threads with work 143one by one, so that we avoid the situation where most work gets done by only 144a few threads. With dynamic load balancing, if a thread finishes early we keep 145feeding it more work. 146 147When parmap is enabled, if an error occurs in Coccinelle, this error 148value is propagated back, and the return value of the ``make coccicheck`` 149command captures this return value. 150 151Using Coccinelle with a single semantic patch 152--------------------------------------------- 153 154The optional make variable COCCI can be used to check a single 155semantic patch. In that case, the variable must be initialized with 156the name of the semantic patch to apply. 157 158For instance:: 159 160 make coccicheck COCCI=<my_SP.cocci> MODE=patch 161 162or:: 163 164 make coccicheck COCCI=<my_SP.cocci> MODE=report 165 166 167Controlling Which Files are Processed by Coccinelle 168--------------------------------------------------- 169 170By default the entire kernel source tree is checked. 171 172To apply Coccinelle to a specific directory, ``M=`` can be used. 173For example, to check drivers/net/wireless/ one may write:: 174 175 make coccicheck M=drivers/net/wireless/ 176 177To apply Coccinelle on a file basis, instead of a directory basis, the 178C variable is used by the makefile to select which files to work with. 179This variable can be used to run scripts for the entire kernel, a 180specific directory, or for a single file. 181 182For example, to check drivers/bluetooth/bfusb.c, the value 1 is 183passed to the C variable to check files that make considers 184need to be compiled.:: 185 186 make C=1 CHECK=scripts/coccicheck drivers/bluetooth/bfusb.o 187 188The value 2 is passed to the C variable to check files regardless of 189whether they need to be compiled or not.:: 190 191 make C=2 CHECK=scripts/coccicheck drivers/bluetooth/bfusb.o 192 193In these modes, which work on a file basis, there is no information 194about semantic patches displayed, and no commit message proposed. 195 196This runs every semantic patch in scripts/coccinelle by default. The 197COCCI variable may additionally be used to only apply a single 198semantic patch as shown in the previous section. 199 200The "report" mode is the default. You can select another one with the 201MODE variable explained above. 202 203Debugging Coccinelle SmPL patches 204--------------------------------- 205 206Using coccicheck is best as it provides in the spatch command line 207include options matching the options used when we compile the kernel. 208You can learn what these options are by using V=1; you could then 209manually run Coccinelle with debug options added. 210 211Alternatively you can debug running Coccinelle against SmPL patches 212by asking for stderr to be redirected to stderr. By default stderr 213is redirected to /dev/null; if you'd like to capture stderr you 214can specify the ``DEBUG_FILE="file.txt"`` option to coccicheck. For 215instance:: 216 217 rm -f cocci.err 218 make coccicheck COCCI=scripts/coccinelle/free/kfree.cocci MODE=report DEBUG_FILE=cocci.err 219 cat cocci.err 220 221You can use SPFLAGS to add debugging flags; for instance you may want to 222add both --profile --show-trying to SPFLAGS when debugging. For example 223you may want to use:: 224 225 rm -f err.log 226 export COCCI=scripts/coccinelle/misc/irqf_oneshot.cocci 227 make coccicheck DEBUG_FILE="err.log" MODE=report SPFLAGS="--profile --show-trying" M=./drivers/mfd/arizona-irq.c 228 229err.log will now have the profiling information, while stdout will 230provide some progress information as Coccinelle moves forward with 231work. 232 233DEBUG_FILE support is only supported when using coccinelle >= 1.0.2. 234 235.cocciconfig support 236-------------------- 237 238Coccinelle supports reading .cocciconfig for default Coccinelle options that 239should be used every time spatch is spawned. The order of precedence for 240variables for .cocciconfig is as follows: 241 242- Your current user's home directory is processed first 243- Your directory from which spatch is called is processed next 244- The directory provided with the --dir option is processed last, if used 245 246Since coccicheck runs through make, it naturally runs from the kernel 247proper dir; as such the second rule above would be implied for picking up a 248.cocciconfig when using ``make coccicheck``. 249 250``make coccicheck`` also supports using M= targets. If you do not supply 251any M= target, it is assumed you want to target the entire kernel. 252The kernel coccicheck script has:: 253 254 if [ "$KBUILD_EXTMOD" = "" ] ; then 255 OPTIONS="--dir $srctree $COCCIINCLUDE" 256 else 257 OPTIONS="--dir $KBUILD_EXTMOD $COCCIINCLUDE" 258 fi 259 260KBUILD_EXTMOD is set when an explicit target with M= is used. For both cases 261the spatch --dir argument is used, as such third rule applies when whether M= 262is used or not, and when M= is used the target directory can have its own 263.cocciconfig file. When M= is not passed as an argument to coccicheck the 264target directory is the same as the directory from where spatch was called. 265 266If not using the kernel's coccicheck target, keep the above precedence 267order logic of .cocciconfig reading. If using the kernel's coccicheck target, 268override any of the kernel's .coccicheck's settings using SPFLAGS. 269 270We help Coccinelle when used against Linux with a set of sensible default 271options for Linux with our own Linux .cocciconfig. This hints to coccinelle 272that git can be used for ``git grep`` queries over coccigrep. A timeout of 200 273seconds should suffice for now. 274 275The options picked up by coccinelle when reading a .cocciconfig do not appear 276as arguments to spatch processes running on your system. To confirm what 277options will be used by Coccinelle run:: 278 279 spatch --print-options-only 280 281You can override with your own preferred index option by using SPFLAGS. Take 282note that when there are conflicting options Coccinelle takes precedence for 283the last options passed. Using .cocciconfig is possible to use idutils, however 284given the order of precedence followed by Coccinelle, since the kernel now 285carries its own .cocciconfig, you will need to use SPFLAGS to use idutils if 286desired. See below section "Additional flags" for more details on how to use 287idutils. 288 289Additional flags 290---------------- 291 292Additional flags can be passed to spatch through the SPFLAGS 293variable. This works as Coccinelle respects the last flags 294given to it when options are in conflict. :: 295 296 make SPFLAGS=--use-glimpse coccicheck 297 298Coccinelle supports idutils as well but requires coccinelle >= 1.0.6. 299When no ID file is specified coccinelle assumes your ID database file 300is in the file .id-utils.index on the top level of the kernel. Coccinelle 301carries a script scripts/idutils_index.sh which creates the database with:: 302 303 mkid -i C --output .id-utils.index 304 305If you have another database filename you can also just symlink with this 306name. :: 307 308 make SPFLAGS=--use-idutils coccicheck 309 310Alternatively you can specify the database filename explicitly, for 311instance:: 312 313 make SPFLAGS="--use-idutils /full-path/to/ID" coccicheck 314 315See ``spatch --help`` to learn more about spatch options. 316 317Note that the ``--use-glimpse`` and ``--use-idutils`` options 318require external tools for indexing the code. None of them is 319thus active by default. However, by indexing the code with 320one of these tools, and according to the cocci file used, 321spatch could proceed the entire code base more quickly. 322 323SmPL patch specific options 324--------------------------- 325 326SmPL patches can have their own requirements for options passed 327to Coccinelle. SmPL patch-specific options can be provided by 328providing them at the top of the SmPL patch, for instance:: 329 330 // Options: --no-includes --include-headers 331 332SmPL patch Coccinelle requirements 333---------------------------------- 334 335As Coccinelle features get added some more advanced SmPL patches 336may require newer versions of Coccinelle. If an SmPL patch requires 337a minimum version of Coccinelle, this can be specified as follows, 338as an example if requiring at least Coccinelle >= 1.0.5:: 339 340 // Requires: 1.0.5 341 342Proposing new semantic patches 343------------------------------ 344 345New semantic patches can be proposed and submitted by kernel 346developers. For sake of clarity, they should be organized in the 347sub-directories of ``scripts/coccinelle/``. 348 349 350Detailed description of the ``report`` mode 351------------------------------------------- 352 353``report`` generates a list in the following format:: 354 355 file:line:column-column: message 356 357Example 358~~~~~~~ 359 360Running:: 361 362 make coccicheck MODE=report COCCI=scripts/coccinelle/api/err_cast.cocci 363 364will execute the following part of the SmPL script:: 365 366 <smpl> 367 @r depends on !context && !patch && (org || report)@ 368 expression x; 369 position p; 370 @@ 371 372 ERR_PTR@p(PTR_ERR(x)) 373 374 @script:python depends on report@ 375 p << r.p; 376 x << r.x; 377 @@ 378 379 msg="ERR_CAST can be used with %s" % (x) 380 coccilib.report.print_report(p[0], msg) 381 </smpl> 382 383This SmPL excerpt generates entries on the standard output, as 384illustrated below:: 385 386 /home/user/linux/crypto/ctr.c:188:9-16: ERR_CAST can be used with alg 387 /home/user/linux/crypto/authenc.c:619:9-16: ERR_CAST can be used with auth 388 /home/user/linux/crypto/xts.c:227:9-16: ERR_CAST can be used with alg 389 390 391Detailed description of the ``patch`` mode 392------------------------------------------ 393 394When the ``patch`` mode is available, it proposes a fix for each problem 395identified. 396 397Example 398~~~~~~~ 399 400Running:: 401 402 make coccicheck MODE=patch COCCI=scripts/coccinelle/api/err_cast.cocci 403 404will execute the following part of the SmPL script:: 405 406 <smpl> 407 @ depends on !context && patch && !org && !report @ 408 expression x; 409 @@ 410 411 - ERR_PTR(PTR_ERR(x)) 412 + ERR_CAST(x) 413 </smpl> 414 415This SmPL excerpt generates patch hunks on the standard output, as 416illustrated below:: 417 418 diff -u -p a/crypto/ctr.c b/crypto/ctr.c 419 --- a/crypto/ctr.c 2010-05-26 10:49:38.000000000 +0200 420 +++ b/crypto/ctr.c 2010-06-03 23:44:49.000000000 +0200 421 @@ -185,7 +185,7 @@ static struct crypto_instance *crypto_ct 422 alg = crypto_attr_alg(tb[1], CRYPTO_ALG_TYPE_CIPHER, 423 CRYPTO_ALG_TYPE_MASK); 424 if (IS_ERR(alg)) 425 - return ERR_PTR(PTR_ERR(alg)); 426 + return ERR_CAST(alg); 427 428 /* Block size must be >= 4 bytes. */ 429 err = -EINVAL; 430 431Detailed description of the ``context`` mode 432-------------------------------------------- 433 434``context`` highlights lines of interest and their context 435in a diff-like style. 436 437 **NOTE**: The diff-like output generated is NOT an applicable patch. The 438 intent of the ``context`` mode is to highlight the important lines 439 (annotated with minus, ``-``) and gives some surrounding context 440 lines around. This output can be used with the diff mode of 441 Emacs to review the code. 442 443Example 444~~~~~~~ 445 446Running:: 447 448 make coccicheck MODE=context COCCI=scripts/coccinelle/api/err_cast.cocci 449 450will execute the following part of the SmPL script:: 451 452 <smpl> 453 @ depends on context && !patch && !org && !report@ 454 expression x; 455 @@ 456 457 * ERR_PTR(PTR_ERR(x)) 458 </smpl> 459 460This SmPL excerpt generates diff hunks on the standard output, as 461illustrated below:: 462 463 diff -u -p /home/user/linux/crypto/ctr.c /tmp/nothing 464 --- /home/user/linux/crypto/ctr.c 2010-05-26 10:49:38.000000000 +0200 465 +++ /tmp/nothing 466 @@ -185,7 +185,6 @@ static struct crypto_instance *crypto_ct 467 alg = crypto_attr_alg(tb[1], CRYPTO_ALG_TYPE_CIPHER, 468 CRYPTO_ALG_TYPE_MASK); 469 if (IS_ERR(alg)) 470 - return ERR_PTR(PTR_ERR(alg)); 471 472 /* Block size must be >= 4 bytes. */ 473 err = -EINVAL; 474 475Detailed description of the ``org`` mode 476---------------------------------------- 477 478``org`` generates a report in the Org mode format of Emacs. 479 480Example 481~~~~~~~ 482 483Running:: 484 485 make coccicheck MODE=org COCCI=scripts/coccinelle/api/err_cast.cocci 486 487will execute the following part of the SmPL script:: 488 489 <smpl> 490 @r depends on !context && !patch && (org || report)@ 491 expression x; 492 position p; 493 @@ 494 495 ERR_PTR@p(PTR_ERR(x)) 496 497 @script:python depends on org@ 498 p << r.p; 499 x << r.x; 500 @@ 501 502 msg="ERR_CAST can be used with %s" % (x) 503 msg_safe=msg.replace("[","@(").replace("]",")") 504 coccilib.org.print_todo(p[0], msg_safe) 505 </smpl> 506 507This SmPL excerpt generates Org entries on the standard output, as 508illustrated below:: 509 510 * TODO [[view:/home/user/linux/crypto/ctr.c::face=ovl-face1::linb=188::colb=9::cole=16][ERR_CAST can be used with alg]] 511 * TODO [[view:/home/user/linux/crypto/authenc.c::face=ovl-face1::linb=619::colb=9::cole=16][ERR_CAST can be used with auth]] 512 * TODO [[view:/home/user/linux/crypto/xts.c::face=ovl-face1::linb=227::colb=9::cole=16][ERR_CAST can be used with alg]] 513