1.. SPDX-License-Identifier: GPL-2.0 2 3=============== 4Detailed Usages 5=============== 6 7DAMON provides below interfaces for different users. 8 9- *DAMON user space tool.* 10 `This <https://github.com/awslabs/damo>`_ is for privileged people such as 11 system administrators who want a just-working human-friendly interface. 12 Using this, users can use the DAMON’s major features in a human-friendly way. 13 It may not be highly tuned for special cases, though. It supports both 14 virtual and physical address spaces monitoring. For more detail, please 15 refer to its `usage document 16 <https://github.com/awslabs/damo/blob/next/USAGE.md>`_. 17- *sysfs interface.* 18 :ref:`This <sysfs_interface>` is for privileged user space programmers who 19 want more optimized use of DAMON. Using this, users can use DAMON’s major 20 features by reading from and writing to special sysfs files. Therefore, 21 you can write and use your personalized DAMON sysfs wrapper programs that 22 reads/writes the sysfs files instead of you. The `DAMON user space tool 23 <https://github.com/awslabs/damo>`_ is one example of such programs. It 24 supports both virtual and physical address spaces monitoring. Note that this 25 interface provides only simple :ref:`statistics <damos_stats>` for the 26 monitoring results. For detailed monitoring results, DAMON provides a 27 :ref:`tracepoint <tracepoint>`. 28- *debugfs interface.* 29 :ref:`This <debugfs_interface>` is almost identical to :ref:`sysfs interface 30 <sysfs_interface>`. This will be removed after next LTS kernel is released, 31 so users should move to the :ref:`sysfs interface <sysfs_interface>`. 32- *Kernel Space Programming Interface.* 33 :doc:`This </mm/damon/api>` is for kernel space programmers. Using this, 34 users can utilize every feature of DAMON most flexibly and efficiently by 35 writing kernel space DAMON application programs for you. You can even extend 36 DAMON for various address spaces. For detail, please refer to the interface 37 :doc:`document </mm/damon/api>`. 38 39.. _sysfs_interface: 40 41sysfs Interface 42=============== 43 44DAMON sysfs interface is built when ``CONFIG_DAMON_SYSFS`` is defined. It 45creates multiple directories and files under its sysfs directory, 46``<sysfs>/kernel/mm/damon/``. You can control DAMON by writing to and reading 47from the files under the directory. 48 49For a short example, users can monitor the virtual address space of a given 50workload as below. :: 51 52 # cd /sys/kernel/mm/damon/admin/ 53 # echo 1 > kdamonds/nr_kdamonds && echo 1 > kdamonds/0/contexts/nr_contexts 54 # echo vaddr > kdamonds/0/contexts/0/operations 55 # echo 1 > kdamonds/0/contexts/0/targets/nr_targets 56 # echo $(pidof <workload>) > kdamonds/0/contexts/0/targets/0/pid_target 57 # echo on > kdamonds/0/state 58 59Files Hierarchy 60--------------- 61 62The files hierarchy of DAMON sysfs interface is shown below. In the below 63figure, parents-children relations are represented with indentations, each 64directory is having ``/`` suffix, and files in each directory are separated by 65comma (","). :: 66 67 /sys/kernel/mm/damon/admin 68 │ kdamonds/nr_kdamonds 69 │ │ 0/state,pid 70 │ │ │ contexts/nr_contexts 71 │ │ │ │ 0/avail_operations,operations 72 │ │ │ │ │ monitoring_attrs/ 73 │ │ │ │ │ │ intervals/sample_us,aggr_us,update_us 74 │ │ │ │ │ │ nr_regions/min,max 75 │ │ │ │ │ targets/nr_targets 76 │ │ │ │ │ │ 0/pid_target 77 │ │ │ │ │ │ │ regions/nr_regions 78 │ │ │ │ │ │ │ │ 0/start,end 79 │ │ │ │ │ │ │ │ ... 80 │ │ │ │ │ │ ... 81 │ │ │ │ │ schemes/nr_schemes 82 │ │ │ │ │ │ 0/action 83 │ │ │ │ │ │ │ access_pattern/ 84 │ │ │ │ │ │ │ │ sz/min,max 85 │ │ │ │ │ │ │ │ nr_accesses/min,max 86 │ │ │ │ │ │ │ │ age/min,max 87 │ │ │ │ │ │ │ quotas/ms,bytes,reset_interval_ms 88 │ │ │ │ │ │ │ │ weights/sz_permil,nr_accesses_permil,age_permil 89 │ │ │ │ │ │ │ watermarks/metric,interval_us,high,mid,low 90 │ │ │ │ │ │ │ filters/nr_filters 91 │ │ │ │ │ │ │ │ 0/type,matching,memcg_id 92 │ │ │ │ │ │ │ stats/nr_tried,sz_tried,nr_applied,sz_applied,qt_exceeds 93 │ │ │ │ │ │ │ tried_regions/ 94 │ │ │ │ │ │ │ │ 0/start,end,nr_accesses,age 95 │ │ │ │ │ │ │ │ ... 96 │ │ │ │ │ │ ... 97 │ │ │ │ ... 98 │ │ ... 99 100Root 101---- 102 103The root of the DAMON sysfs interface is ``<sysfs>/kernel/mm/damon/``, and it 104has one directory named ``admin``. The directory contains the files for 105privileged user space programs' control of DAMON. User space tools or deamons 106having the root permission could use this directory. 107 108kdamonds/ 109--------- 110 111The monitoring-related information including request specifications and results 112are called DAMON context. DAMON executes each context with a kernel thread 113called kdamond, and multiple kdamonds could run in parallel. 114 115Under the ``admin`` directory, one directory, ``kdamonds``, which has files for 116controlling the kdamonds exist. In the beginning, this directory has only one 117file, ``nr_kdamonds``. Writing a number (``N``) to the file creates the number 118of child directories named ``0`` to ``N-1``. Each directory represents each 119kdamond. 120 121kdamonds/<N>/ 122------------- 123 124In each kdamond directory, two files (``state`` and ``pid``) and one directory 125(``contexts``) exist. 126 127Reading ``state`` returns ``on`` if the kdamond is currently running, or 128``off`` if it is not running. Writing ``on`` or ``off`` makes the kdamond be 129in the state. Writing ``commit`` to the ``state`` file makes kdamond reads the 130user inputs in the sysfs files except ``state`` file again. Writing 131``update_schemes_stats`` to ``state`` file updates the contents of stats files 132for each DAMON-based operation scheme of the kdamond. For details of the 133stats, please refer to :ref:`stats section <sysfs_schemes_stats>`. Writing 134``update_schemes_tried_regions`` to ``state`` file updates the DAMON-based 135operation scheme action tried regions directory for each DAMON-based operation 136scheme of the kdamond. Writing ``clear_schemes_tried_regions`` to ``state`` 137file clears the DAMON-based operating scheme action tried regions directory for 138each DAMON-based operation scheme of the kdamond. For details of the 139DAMON-based operation scheme action tried regions directory, please refer to 140:ref:tried_regions section <sysfs_schemes_tried_regions>`. 141 142If the state is ``on``, reading ``pid`` shows the pid of the kdamond thread. 143 144``contexts`` directory contains files for controlling the monitoring contexts 145that this kdamond will execute. 146 147kdamonds/<N>/contexts/ 148---------------------- 149 150In the beginning, this directory has only one file, ``nr_contexts``. Writing a 151number (``N``) to the file creates the number of child directories named as 152``0`` to ``N-1``. Each directory represents each monitoring context. At the 153moment, only one context per kdamond is supported, so only ``0`` or ``1`` can 154be written to the file. 155 156.. _sysfs_contexts: 157 158contexts/<N>/ 159------------- 160 161In each context directory, two files (``avail_operations`` and ``operations``) 162and three directories (``monitoring_attrs``, ``targets``, and ``schemes``) 163exist. 164 165DAMON supports multiple types of monitoring operations, including those for 166virtual address space and the physical address space. You can get the list of 167available monitoring operations set on the currently running kernel by reading 168``avail_operations`` file. Based on the kernel configuration, the file will 169list some or all of below keywords. 170 171 - vaddr: Monitor virtual address spaces of specific processes 172 - fvaddr: Monitor fixed virtual address ranges 173 - paddr: Monitor the physical address space of the system 174 175Please refer to :ref:`regions sysfs directory <sysfs_regions>` for detailed 176differences between the operations sets in terms of the monitoring target 177regions. 178 179You can set and get what type of monitoring operations DAMON will use for the 180context by writing one of the keywords listed in ``avail_operations`` file and 181reading from the ``operations`` file. 182 183.. _sysfs_monitoring_attrs: 184 185contexts/<N>/monitoring_attrs/ 186------------------------------ 187 188Files for specifying attributes of the monitoring including required quality 189and efficiency of the monitoring are in ``monitoring_attrs`` directory. 190Specifically, two directories, ``intervals`` and ``nr_regions`` exist in this 191directory. 192 193Under ``intervals`` directory, three files for DAMON's sampling interval 194(``sample_us``), aggregation interval (``aggr_us``), and update interval 195(``update_us``) exist. You can set and get the values in micro-seconds by 196writing to and reading from the files. 197 198Under ``nr_regions`` directory, two files for the lower-bound and upper-bound 199of DAMON's monitoring regions (``min`` and ``max``, respectively), which 200controls the monitoring overhead, exist. You can set and get the values by 201writing to and rading from the files. 202 203For more details about the intervals and monitoring regions range, please refer 204to the Design document (:doc:`/mm/damon/design`). 205 206contexts/<N>/targets/ 207--------------------- 208 209In the beginning, this directory has only one file, ``nr_targets``. Writing a 210number (``N``) to the file creates the number of child directories named ``0`` 211to ``N-1``. Each directory represents each monitoring target. 212 213targets/<N>/ 214------------ 215 216In each target directory, one file (``pid_target``) and one directory 217(``regions``) exist. 218 219If you wrote ``vaddr`` to the ``contexts/<N>/operations``, each target should 220be a process. You can specify the process to DAMON by writing the pid of the 221process to the ``pid_target`` file. 222 223.. _sysfs_regions: 224 225targets/<N>/regions 226------------------- 227 228When ``vaddr`` monitoring operations set is being used (``vaddr`` is written to 229the ``contexts/<N>/operations`` file), DAMON automatically sets and updates the 230monitoring target regions so that entire memory mappings of target processes 231can be covered. However, users could want to set the initial monitoring region 232to specific address ranges. 233 234In contrast, DAMON do not automatically sets and updates the monitoring target 235regions when ``fvaddr`` or ``paddr`` monitoring operations sets are being used 236(``fvaddr`` or ``paddr`` have written to the ``contexts/<N>/operations``). 237Therefore, users should set the monitoring target regions by themselves in the 238cases. 239 240For such cases, users can explicitly set the initial monitoring target regions 241as they want, by writing proper values to the files under this directory. 242 243In the beginning, this directory has only one file, ``nr_regions``. Writing a 244number (``N``) to the file creates the number of child directories named ``0`` 245to ``N-1``. Each directory represents each initial monitoring target region. 246 247regions/<N>/ 248------------ 249 250In each region directory, you will find two files (``start`` and ``end``). You 251can set and get the start and end addresses of the initial monitoring target 252region by writing to and reading from the files, respectively. 253 254Each region should not overlap with others. ``end`` of directory ``N`` should 255be equal or smaller than ``start`` of directory ``N+1``. 256 257contexts/<N>/schemes/ 258--------------------- 259 260For usual DAMON-based data access aware memory management optimizations, users 261would normally want the system to apply a memory management action to a memory 262region of a specific access pattern. DAMON receives such formalized operation 263schemes from the user and applies those to the target memory regions. Users 264can get and set the schemes by reading from and writing to files under this 265directory. 266 267In the beginning, this directory has only one file, ``nr_schemes``. Writing a 268number (``N``) to the file creates the number of child directories named ``0`` 269to ``N-1``. Each directory represents each DAMON-based operation scheme. 270 271schemes/<N>/ 272------------ 273 274In each scheme directory, five directories (``access_pattern``, ``quotas``, 275``watermarks``, ``filters``, ``stats``, and ``tried_regions``) and one file 276(``action``) exist. 277 278The ``action`` file is for setting and getting what action you want to apply to 279memory regions having specific access pattern of the interest. The keywords 280that can be written to and read from the file and their meaning are as below. 281 282 - ``willneed``: Call ``madvise()`` for the region with ``MADV_WILLNEED`` 283 - ``cold``: Call ``madvise()`` for the region with ``MADV_COLD`` 284 - ``pageout``: Call ``madvise()`` for the region with ``MADV_PAGEOUT`` 285 - ``hugepage``: Call ``madvise()`` for the region with ``MADV_HUGEPAGE`` 286 - ``nohugepage``: Call ``madvise()`` for the region with ``MADV_NOHUGEPAGE`` 287 - ``lru_prio``: Prioritize the region on its LRU lists. 288 - ``lru_deprio``: Deprioritize the region on its LRU lists. 289 - ``stat``: Do nothing but count the statistics 290 291schemes/<N>/access_pattern/ 292--------------------------- 293 294The target access pattern of each DAMON-based operation scheme is constructed 295with three ranges including the size of the region in bytes, number of 296monitored accesses per aggregate interval, and number of aggregated intervals 297for the age of the region. 298 299Under the ``access_pattern`` directory, three directories (``sz``, 300``nr_accesses``, and ``age``) each having two files (``min`` and ``max``) 301exist. You can set and get the access pattern for the given scheme by writing 302to and reading from the ``min`` and ``max`` files under ``sz``, 303``nr_accesses``, and ``age`` directories, respectively. 304 305schemes/<N>/quotas/ 306------------------- 307 308Optimal ``target access pattern`` for each ``action`` is workload dependent, so 309not easy to find. Worse yet, setting a scheme of some action too aggressive 310can cause severe overhead. To avoid such overhead, users can limit time and 311size quota for each scheme. In detail, users can ask DAMON to try to use only 312up to specific time (``time quota``) for applying the action, and to apply the 313action to only up to specific amount (``size quota``) of memory regions having 314the target access pattern within a given time interval (``reset interval``). 315 316When the quota limit is expected to be exceeded, DAMON prioritizes found memory 317regions of the ``target access pattern`` based on their size, access frequency, 318and age. For personalized prioritization, users can set the weights for the 319three properties. 320 321Under ``quotas`` directory, three files (``ms``, ``bytes``, 322``reset_interval_ms``) and one directory (``weights``) having three files 323(``sz_permil``, ``nr_accesses_permil``, and ``age_permil``) in it exist. 324 325You can set the ``time quota`` in milliseconds, ``size quota`` in bytes, and 326``reset interval`` in milliseconds by writing the values to the three files, 327respectively. You can also set the prioritization weights for size, access 328frequency, and age in per-thousand unit by writing the values to the three 329files under the ``weights`` directory. 330 331schemes/<N>/watermarks/ 332----------------------- 333 334To allow easy activation and deactivation of each scheme based on system 335status, DAMON provides a feature called watermarks. The feature receives five 336values called ``metric``, ``interval``, ``high``, ``mid``, and ``low``. The 337``metric`` is the system metric such as free memory ratio that can be measured. 338If the metric value of the system is higher than the value in ``high`` or lower 339than ``low`` at the memoent, the scheme is deactivated. If the value is lower 340than ``mid``, the scheme is activated. 341 342Under the watermarks directory, five files (``metric``, ``interval_us``, 343``high``, ``mid``, and ``low``) for setting each value exist. You can set and 344get the five values by writing to the files, respectively. 345 346Keywords and meanings of those that can be written to the ``metric`` file are 347as below. 348 349 - none: Ignore the watermarks 350 - free_mem_rate: System's free memory rate (per thousand) 351 352The ``interval`` should written in microseconds unit. 353 354schemes/<N>/filters/ 355-------------------- 356 357Users could know something more than the kernel for specific types of memory. 358In the case, users could do their own management for the memory and hence 359doesn't want DAMOS bothers that. Users could limit DAMOS by setting the access 360pattern of the scheme and/or the monitoring regions for the purpose, but that 361can be inefficient in some cases. In such cases, users could set non-access 362pattern driven filters using files in this directory. 363 364In the beginning, this directory has only one file, ``nr_filters``. Writing a 365number (``N``) to the file creates the number of child directories named ``0`` 366to ``N-1``. Each directory represents each filter. The filters are evaluated 367in the numeric order. 368 369Each filter directory contains three files, namely ``type``, ``matcing``, and 370``memcg_path``. You can write one of two special keywords, ``anon`` for 371anonymous pages, or ``memcg`` for specific memory cgroup filtering. In case of 372the memory cgroup filtering, you can specify the memory cgroup of the interest 373by writing the path of the memory cgroup from the cgroups mount point to 374``memcg_path`` file. You can write ``Y`` or ``N`` to ``matching`` file to 375filter out pages that does or does not match to the type, respectively. Then, 376the scheme's action will not be applied to the pages that specified to be 377filtered out. 378 379For example, below restricts a DAMOS action to be applied to only non-anonymous 380pages of all memory cgroups except ``/having_care_already``.:: 381 382 # echo 2 > nr_filters 383 # # filter out anonymous pages 384 echo anon > 0/type 385 echo Y > 0/matching 386 # # further filter out all cgroups except one at '/having_care_already' 387 echo memcg > 1/type 388 echo /having_care_already > 1/memcg_path 389 echo N > 1/matching 390 391Note that filters could be ignored depend on the running DAMON operations set 392`implementation <sysfs_contexts>`. 393 394.. _sysfs_schemes_stats: 395 396schemes/<N>/stats/ 397------------------ 398 399DAMON counts the total number and bytes of regions that each scheme is tried to 400be applied, the two numbers for the regions that each scheme is successfully 401applied, and the total number of the quota limit exceeds. This statistics can 402be used for online analysis or tuning of the schemes. 403 404The statistics can be retrieved by reading the files under ``stats`` directory 405(``nr_tried``, ``sz_tried``, ``nr_applied``, ``sz_applied``, and 406``qt_exceeds``), respectively. The files are not updated in real time, so you 407should ask DAMON sysfs interface to updte the content of the files for the 408stats by writing a special keyword, ``update_schemes_stats`` to the relevant 409``kdamonds/<N>/state`` file. 410 411.. _sysfs_schemes_tried_regions: 412 413schemes/<N>/tried_regions/ 414-------------------------- 415 416When a special keyword, ``update_schemes_tried_regions``, is written to the 417relevant ``kdamonds/<N>/state`` file, DAMON creates directories named integer 418starting from ``0`` under this directory. Each directory contains files 419exposing detailed information about each of the memory region that the 420corresponding scheme's ``action`` has tried to be applied under this directory, 421during next :ref:`aggregation interval <sysfs_monitoring_attrs>`. The 422information includes address range, ``nr_accesses``, , and ``age`` of the 423region. 424 425The directories will be removed when another special keyword, 426``clear_schemes_tried_regions``, is written to the relevant 427``kdamonds/<N>/state`` file. 428 429tried_regions/<N>/ 430------------------ 431 432In each region directory, you will find four files (``start``, ``end``, 433``nr_accesses``, and ``age``). Reading the files will show the start and end 434addresses, ``nr_accesses``, and ``age`` of the region that corresponding 435DAMON-based operation scheme ``action`` has tried to be applied. 436 437Example 438~~~~~~~ 439 440Below commands applies a scheme saying "If a memory region of size in [4KiB, 4418KiB] is showing accesses per aggregate interval in [0, 5] for aggregate 442interval in [10, 20], page out the region. For the paging out, use only up to 44310ms per second, and also don't page out more than 1GiB per second. Under the 444limitation, page out memory regions having longer age first. Also, check the 445free memory rate of the system every 5 seconds, start the monitoring and paging 446out when the free memory rate becomes lower than 50%, but stop it if the free 447memory rate becomes larger than 60%, or lower than 30%". :: 448 449 # cd <sysfs>/kernel/mm/damon/admin 450 # # populate directories 451 # echo 1 > kdamonds/nr_kdamonds; echo 1 > kdamonds/0/contexts/nr_contexts; 452 # echo 1 > kdamonds/0/contexts/0/schemes/nr_schemes 453 # cd kdamonds/0/contexts/0/schemes/0 454 # # set the basic access pattern and the action 455 # echo 4096 > access_pattern/sz/min 456 # echo 8192 > access_pattern/sz/max 457 # echo 0 > access_pattern/nr_accesses/min 458 # echo 5 > access_pattern/nr_accesses/max 459 # echo 10 > access_pattern/age/min 460 # echo 20 > access_pattern/age/max 461 # echo pageout > action 462 # # set quotas 463 # echo 10 > quotas/ms 464 # echo $((1024*1024*1024)) > quotas/bytes 465 # echo 1000 > quotas/reset_interval_ms 466 # # set watermark 467 # echo free_mem_rate > watermarks/metric 468 # echo 5000000 > watermarks/interval_us 469 # echo 600 > watermarks/high 470 # echo 500 > watermarks/mid 471 # echo 300 > watermarks/low 472 473Please note that it's highly recommended to use user space tools like `damo 474<https://github.com/awslabs/damo>`_ rather than manually reading and writing 475the files as above. Above is only for an example. 476 477.. _debugfs_interface: 478 479debugfs Interface 480================= 481 482.. note:: 483 484 DAMON debugfs interface will be removed after next LTS kernel is released, so 485 users should move to the :ref:`sysfs interface <sysfs_interface>`. 486 487DAMON exports eight files, ``attrs``, ``target_ids``, ``init_regions``, 488``schemes``, ``monitor_on``, ``kdamond_pid``, ``mk_contexts`` and 489``rm_contexts`` under its debugfs directory, ``<debugfs>/damon/``. 490 491 492Attributes 493---------- 494 495Users can get and set the ``sampling interval``, ``aggregation interval``, 496``update interval``, and min/max number of monitoring target regions by 497reading from and writing to the ``attrs`` file. To know about the monitoring 498attributes in detail, please refer to the :doc:`/mm/damon/design`. For 499example, below commands set those values to 5 ms, 100 ms, 1,000 ms, 10 and 5001000, and then check it again:: 501 502 # cd <debugfs>/damon 503 # echo 5000 100000 1000000 10 1000 > attrs 504 # cat attrs 505 5000 100000 1000000 10 1000 506 507 508Target IDs 509---------- 510 511Some types of address spaces supports multiple monitoring target. For example, 512the virtual memory address spaces monitoring can have multiple processes as the 513monitoring targets. Users can set the targets by writing relevant id values of 514the targets to, and get the ids of the current targets by reading from the 515``target_ids`` file. In case of the virtual address spaces monitoring, the 516values should be pids of the monitoring target processes. For example, below 517commands set processes having pids 42 and 4242 as the monitoring targets and 518check it again:: 519 520 # cd <debugfs>/damon 521 # echo 42 4242 > target_ids 522 # cat target_ids 523 42 4242 524 525Users can also monitor the physical memory address space of the system by 526writing a special keyword, "``paddr\n``" to the file. Because physical address 527space monitoring doesn't support multiple targets, reading the file will show a 528fake value, ``42``, as below:: 529 530 # cd <debugfs>/damon 531 # echo paddr > target_ids 532 # cat target_ids 533 42 534 535Note that setting the target ids doesn't start the monitoring. 536 537 538Initial Monitoring Target Regions 539--------------------------------- 540 541In case of the virtual address space monitoring, DAMON automatically sets and 542updates the monitoring target regions so that entire memory mappings of target 543processes can be covered. However, users can want to limit the monitoring 544region to specific address ranges, such as the heap, the stack, or specific 545file-mapped area. Or, some users can know the initial access pattern of their 546workloads and therefore want to set optimal initial regions for the 'adaptive 547regions adjustment'. 548 549In contrast, DAMON do not automatically sets and updates the monitoring target 550regions in case of physical memory monitoring. Therefore, users should set the 551monitoring target regions by themselves. 552 553In such cases, users can explicitly set the initial monitoring target regions 554as they want, by writing proper values to the ``init_regions`` file. The input 555should be a sequence of three integers separated by white spaces that represent 556one region in below form.:: 557 558 <target idx> <start address> <end address> 559 560The ``target idx`` should be the index of the target in ``target_ids`` file, 561starting from ``0``, and the regions should be passed in address order. For 562example, below commands will set a couple of address ranges, ``1-100`` and 563``100-200`` as the initial monitoring target region of pid 42, which is the 564first one (index ``0``) in ``target_ids``, and another couple of address 565ranges, ``20-40`` and ``50-100`` as that of pid 4242, which is the second one 566(index ``1``) in ``target_ids``.:: 567 568 # cd <debugfs>/damon 569 # cat target_ids 570 42 4242 571 # echo "0 1 100 \ 572 0 100 200 \ 573 1 20 40 \ 574 1 50 100" > init_regions 575 576Note that this sets the initial monitoring target regions only. In case of 577virtual memory monitoring, DAMON will automatically updates the boundary of the 578regions after one ``update interval``. Therefore, users should set the 579``update interval`` large enough in this case, if they don't want the 580update. 581 582 583Schemes 584------- 585 586For usual DAMON-based data access aware memory management optimizations, users 587would simply want the system to apply a memory management action to a memory 588region of a specific access pattern. DAMON receives such formalized operation 589schemes from the user and applies those to the target processes. 590 591Users can get and set the schemes by reading from and writing to ``schemes`` 592debugfs file. Reading the file also shows the statistics of each scheme. To 593the file, each of the schemes should be represented in each line in below 594form:: 595 596 <target access pattern> <action> <quota> <watermarks> 597 598You can disable schemes by simply writing an empty string to the file. 599 600Target Access Pattern 601~~~~~~~~~~~~~~~~~~~~~ 602 603The ``<target access pattern>`` is constructed with three ranges in below 604form:: 605 606 min-size max-size min-acc max-acc min-age max-age 607 608Specifically, bytes for the size of regions (``min-size`` and ``max-size``), 609number of monitored accesses per aggregate interval for access frequency 610(``min-acc`` and ``max-acc``), number of aggregate intervals for the age of 611regions (``min-age`` and ``max-age``) are specified. Note that the ranges are 612closed interval. 613 614Action 615~~~~~~ 616 617The ``<action>`` is a predefined integer for memory management actions, which 618DAMON will apply to the regions having the target access pattern. The 619supported numbers and their meanings are as below. 620 621 - 0: Call ``madvise()`` for the region with ``MADV_WILLNEED`` 622 - 1: Call ``madvise()`` for the region with ``MADV_COLD`` 623 - 2: Call ``madvise()`` for the region with ``MADV_PAGEOUT`` 624 - 3: Call ``madvise()`` for the region with ``MADV_HUGEPAGE`` 625 - 4: Call ``madvise()`` for the region with ``MADV_NOHUGEPAGE`` 626 - 5: Do nothing but count the statistics 627 628Quota 629~~~~~ 630 631Optimal ``target access pattern`` for each ``action`` is workload dependent, so 632not easy to find. Worse yet, setting a scheme of some action too aggressive 633can cause severe overhead. To avoid such overhead, users can limit time and 634size quota for the scheme via the ``<quota>`` in below form:: 635 636 <ms> <sz> <reset interval> <priority weights> 637 638This makes DAMON to try to use only up to ``<ms>`` milliseconds for applying 639the action to memory regions of the ``target access pattern`` within the 640``<reset interval>`` milliseconds, and to apply the action to only up to 641``<sz>`` bytes of memory regions within the ``<reset interval>``. Setting both 642``<ms>`` and ``<sz>`` zero disables the quota limits. 643 644When the quota limit is expected to be exceeded, DAMON prioritizes found memory 645regions of the ``target access pattern`` based on their size, access frequency, 646and age. For personalized prioritization, users can set the weights for the 647three properties in ``<priority weights>`` in below form:: 648 649 <size weight> <access frequency weight> <age weight> 650 651Watermarks 652~~~~~~~~~~ 653 654Some schemes would need to run based on current value of the system's specific 655metrics like free memory ratio. For such cases, users can specify watermarks 656for the condition.:: 657 658 <metric> <check interval> <high mark> <middle mark> <low mark> 659 660``<metric>`` is a predefined integer for the metric to be checked. The 661supported numbers and their meanings are as below. 662 663 - 0: Ignore the watermarks 664 - 1: System's free memory rate (per thousand) 665 666The value of the metric is checked every ``<check interval>`` microseconds. 667 668If the value is higher than ``<high mark>`` or lower than ``<low mark>``, the 669scheme is deactivated. If the value is lower than ``<mid mark>``, the scheme 670is activated. 671 672.. _damos_stats: 673 674Statistics 675~~~~~~~~~~ 676 677It also counts the total number and bytes of regions that each scheme is tried 678to be applied, the two numbers for the regions that each scheme is successfully 679applied, and the total number of the quota limit exceeds. This statistics can 680be used for online analysis or tuning of the schemes. 681 682The statistics can be shown by reading the ``schemes`` file. Reading the file 683will show each scheme you entered in each line, and the five numbers for the 684statistics will be added at the end of each line. 685 686Example 687~~~~~~~ 688 689Below commands applies a scheme saying "If a memory region of size in [4KiB, 6908KiB] is showing accesses per aggregate interval in [0, 5] for aggregate 691interval in [10, 20], page out the region. For the paging out, use only up to 69210ms per second, and also don't page out more than 1GiB per second. Under the 693limitation, page out memory regions having longer age first. Also, check the 694free memory rate of the system every 5 seconds, start the monitoring and paging 695out when the free memory rate becomes lower than 50%, but stop it if the free 696memory rate becomes larger than 60%, or lower than 30%".:: 697 698 # cd <debugfs>/damon 699 # scheme="4096 8192 0 5 10 20 2" # target access pattern and action 700 # scheme+=" 10 $((1024*1024*1024)) 1000" # quotas 701 # scheme+=" 0 0 100" # prioritization weights 702 # scheme+=" 1 5000000 600 500 300" # watermarks 703 # echo "$scheme" > schemes 704 705 706Turning On/Off 707-------------- 708 709Setting the files as described above doesn't incur effect unless you explicitly 710start the monitoring. You can start, stop, and check the current status of the 711monitoring by writing to and reading from the ``monitor_on`` file. Writing 712``on`` to the file starts the monitoring of the targets with the attributes. 713Writing ``off`` to the file stops those. DAMON also stops if every target 714process is terminated. Below example commands turn on, off, and check the 715status of DAMON:: 716 717 # cd <debugfs>/damon 718 # echo on > monitor_on 719 # echo off > monitor_on 720 # cat monitor_on 721 off 722 723Please note that you cannot write to the above-mentioned debugfs files while 724the monitoring is turned on. If you write to the files while DAMON is running, 725an error code such as ``-EBUSY`` will be returned. 726 727 728Monitoring Thread PID 729--------------------- 730 731DAMON does requested monitoring with a kernel thread called ``kdamond``. You 732can get the pid of the thread by reading the ``kdamond_pid`` file. When the 733monitoring is turned off, reading the file returns ``none``. :: 734 735 # cd <debugfs>/damon 736 # cat monitor_on 737 off 738 # cat kdamond_pid 739 none 740 # echo on > monitor_on 741 # cat kdamond_pid 742 18594 743 744 745Using Multiple Monitoring Threads 746--------------------------------- 747 748One ``kdamond`` thread is created for each monitoring context. You can create 749and remove monitoring contexts for multiple ``kdamond`` required use case using 750the ``mk_contexts`` and ``rm_contexts`` files. 751 752Writing the name of the new context to the ``mk_contexts`` file creates a 753directory of the name on the DAMON debugfs directory. The directory will have 754DAMON debugfs files for the context. :: 755 756 # cd <debugfs>/damon 757 # ls foo 758 # ls: cannot access 'foo': No such file or directory 759 # echo foo > mk_contexts 760 # ls foo 761 # attrs init_regions kdamond_pid schemes target_ids 762 763If the context is not needed anymore, you can remove it and the corresponding 764directory by putting the name of the context to the ``rm_contexts`` file. :: 765 766 # echo foo > rm_contexts 767 # ls foo 768 # ls: cannot access 'foo': No such file or directory 769 770Note that ``mk_contexts``, ``rm_contexts``, and ``monitor_on`` files are in the 771root directory only. 772 773 774.. _tracepoint: 775 776Tracepoint for Monitoring Results 777================================= 778 779DAMON provides the monitoring results via a tracepoint, 780``damon:damon_aggregated``. While the monitoring is turned on, you could 781record the tracepoint events and show results using tracepoint supporting tools 782like ``perf``. For example:: 783 784 # echo on > monitor_on 785 # perf record -e damon:damon_aggregated & 786 # sleep 5 787 # kill 9 $(pidof perf) 788 # echo off > monitor_on 789 # perf script 790