912 lines
37 KiB
ReStructuredText
912 lines
37 KiB
ReStructuredText
.. SPDX-License-Identifier: GPL-2.0
|
||
|
||
===============
|
||
Detailed Usages
|
||
===============
|
||
|
||
DAMON provides below interfaces for different users.
|
||
|
||
- *DAMON user space tool.*
|
||
`This <https://github.com/awslabs/damo>`_ is for privileged people such as
|
||
system administrators who want a just-working human-friendly interface.
|
||
Using this, users can use the DAMON’s major features in a human-friendly way.
|
||
It may not be highly tuned for special cases, though. For more detail,
|
||
please refer to its `usage document
|
||
<https://github.com/awslabs/damo/blob/next/USAGE.md>`_.
|
||
- *sysfs interface.*
|
||
:ref:`This <sysfs_interface>` is for privileged user space programmers who
|
||
want more optimized use of DAMON. Using this, users can use DAMON’s major
|
||
features by reading from and writing to special sysfs files. Therefore,
|
||
you can write and use your personalized DAMON sysfs wrapper programs that
|
||
reads/writes the sysfs files instead of you. The `DAMON user space tool
|
||
<https://github.com/awslabs/damo>`_ is one example of such programs.
|
||
- *Kernel Space Programming Interface.*
|
||
:doc:`This </mm/damon/api>` is for kernel space programmers. Using this,
|
||
users can utilize every feature of DAMON most flexibly and efficiently by
|
||
writing kernel space DAMON application programs for you. You can even extend
|
||
DAMON for various address spaces. For detail, please refer to the interface
|
||
:doc:`document </mm/damon/api>`.
|
||
- *debugfs interface. (DEPRECATED!)*
|
||
:ref:`This <debugfs_interface>` is almost identical to :ref:`sysfs interface
|
||
<sysfs_interface>`. This is deprecated, so users should move to the
|
||
:ref:`sysfs interface <sysfs_interface>`. If you depend on this and cannot
|
||
move, please report your usecase to damon@lists.linux.dev and
|
||
linux-mm@kvack.org.
|
||
|
||
.. _sysfs_interface:
|
||
|
||
sysfs Interface
|
||
===============
|
||
|
||
DAMON sysfs interface is built when ``CONFIG_DAMON_SYSFS`` is defined. It
|
||
creates multiple directories and files under its sysfs directory,
|
||
``<sysfs>/kernel/mm/damon/``. You can control DAMON by writing to and reading
|
||
from the files under the directory.
|
||
|
||
For a short example, users can monitor the virtual address space of a given
|
||
workload as below. ::
|
||
|
||
# cd /sys/kernel/mm/damon/admin/
|
||
# echo 1 > kdamonds/nr_kdamonds && echo 1 > kdamonds/0/contexts/nr_contexts
|
||
# echo vaddr > kdamonds/0/contexts/0/operations
|
||
# echo 1 > kdamonds/0/contexts/0/targets/nr_targets
|
||
# echo $(pidof <workload>) > kdamonds/0/contexts/0/targets/0/pid_target
|
||
# echo on > kdamonds/0/state
|
||
|
||
Files Hierarchy
|
||
---------------
|
||
|
||
The files hierarchy of DAMON sysfs interface is shown below. In the below
|
||
figure, parents-children relations are represented with indentations, each
|
||
directory is having ``/`` suffix, and files in each directory are separated by
|
||
comma (",").
|
||
|
||
.. parsed-literal::
|
||
|
||
:ref:`/sys/kernel/mm/damon <sysfs_root>`/admin
|
||
│ :ref:`kdamonds <sysfs_kdamonds>`/nr_kdamonds
|
||
│ │ :ref:`0 <sysfs_kdamond>`/state,pid
|
||
│ │ │ :ref:`contexts <sysfs_contexts>`/nr_contexts
|
||
│ │ │ │ :ref:`0 <sysfs_context>`/avail_operations,operations
|
||
│ │ │ │ │ :ref:`monitoring_attrs <sysfs_monitoring_attrs>`/
|
||
│ │ │ │ │ │ intervals/sample_us,aggr_us,update_us
|
||
│ │ │ │ │ │ nr_regions/min,max
|
||
│ │ │ │ │ :ref:`targets <sysfs_targets>`/nr_targets
|
||
│ │ │ │ │ │ :ref:`0 <sysfs_target>`/pid_target
|
||
│ │ │ │ │ │ │ :ref:`regions <sysfs_regions>`/nr_regions
|
||
│ │ │ │ │ │ │ │ :ref:`0 <sysfs_region>`/start,end
|
||
│ │ │ │ │ │ │ │ ...
|
||
│ │ │ │ │ │ ...
|
||
│ │ │ │ │ :ref:`schemes <sysfs_schemes>`/nr_schemes
|
||
│ │ │ │ │ │ :ref:`0 <sysfs_scheme>`/action,apply_interval_us
|
||
│ │ │ │ │ │ │ :ref:`access_pattern <sysfs_access_pattern>`/
|
||
│ │ │ │ │ │ │ │ sz/min,max
|
||
│ │ │ │ │ │ │ │ nr_accesses/min,max
|
||
│ │ │ │ │ │ │ │ age/min,max
|
||
│ │ │ │ │ │ │ :ref:`quotas <sysfs_quotas>`/ms,bytes,reset_interval_ms
|
||
│ │ │ │ │ │ │ │ weights/sz_permil,nr_accesses_permil,age_permil
|
||
│ │ │ │ │ │ │ │ :ref:`goals <sysfs_schemes_quota_goals>`/nr_goals
|
||
│ │ │ │ │ │ │ │ │ 0/target_value,current_value
|
||
│ │ │ │ │ │ │ :ref:`watermarks <sysfs_watermarks>`/metric,interval_us,high,mid,low
|
||
│ │ │ │ │ │ │ :ref:`filters <sysfs_filters>`/nr_filters
|
||
│ │ │ │ │ │ │ │ 0/type,matching,memcg_id
|
||
│ │ │ │ │ │ │ :ref:`stats <sysfs_schemes_stats>`/nr_tried,sz_tried,nr_applied,sz_applied,qt_exceeds
|
||
│ │ │ │ │ │ │ :ref:`tried_regions <sysfs_schemes_tried_regions>`/total_bytes
|
||
│ │ │ │ │ │ │ │ 0/start,end,nr_accesses,age
|
||
│ │ │ │ │ │ │ │ ...
|
||
│ │ │ │ │ │ ...
|
||
│ │ │ │ ...
|
||
│ │ ...
|
||
|
||
.. _sysfs_root:
|
||
|
||
Root
|
||
----
|
||
|
||
The root of the DAMON sysfs interface is ``<sysfs>/kernel/mm/damon/``, and it
|
||
has one directory named ``admin``. The directory contains the files for
|
||
privileged user space programs' control of DAMON. User space tools or daemons
|
||
having the root permission could use this directory.
|
||
|
||
.. _sysfs_kdamonds:
|
||
|
||
kdamonds/
|
||
---------
|
||
|
||
Under the ``admin`` directory, one directory, ``kdamonds``, which has files for
|
||
controlling the kdamonds (refer to
|
||
:ref:`design <damon_design_execution_model_and_data_structures>` for more
|
||
details) exists. In the beginning, this directory has only one file,
|
||
``nr_kdamonds``. Writing a number (``N``) to the file creates the number of
|
||
child directories named ``0`` to ``N-1``. Each directory represents each
|
||
kdamond.
|
||
|
||
.. _sysfs_kdamond:
|
||
|
||
kdamonds/<N>/
|
||
-------------
|
||
|
||
In each kdamond directory, two files (``state`` and ``pid``) and one directory
|
||
(``contexts``) exist.
|
||
|
||
Reading ``state`` returns ``on`` if the kdamond is currently running, or
|
||
``off`` if it is not running.
|
||
|
||
Users can write below commands for the kdamond to the ``state`` file.
|
||
|
||
- ``on``: Start running.
|
||
- ``off``: Stop running.
|
||
- ``commit``: Read the user inputs in the sysfs files except ``state`` file
|
||
again.
|
||
- ``commit_schemes_quota_goals``: Read the DAMON-based operation schemes'
|
||
:ref:`quota goals <sysfs_schemes_quota_goals>`.
|
||
- ``update_schemes_stats``: Update the contents of stats files for each
|
||
DAMON-based operation scheme of the kdamond. For details of the stats,
|
||
please refer to :ref:`stats section <sysfs_schemes_stats>`.
|
||
- ``update_schemes_tried_regions``: Update the DAMON-based operation scheme
|
||
action tried regions directory for each DAMON-based operation scheme of the
|
||
kdamond. For details of the DAMON-based operation scheme action tried
|
||
regions directory, please refer to
|
||
:ref:`tried_regions section <sysfs_schemes_tried_regions>`.
|
||
- ``update_schemes_tried_bytes``: Update only ``.../tried_regions/total_bytes``
|
||
files.
|
||
- ``clear_schemes_tried_regions``: Clear the DAMON-based operating scheme
|
||
action tried regions directory for each DAMON-based operation scheme of the
|
||
kdamond.
|
||
|
||
If the state is ``on``, reading ``pid`` shows the pid of the kdamond thread.
|
||
|
||
``contexts`` directory contains files for controlling the monitoring contexts
|
||
that this kdamond will execute.
|
||
|
||
.. _sysfs_contexts:
|
||
|
||
kdamonds/<N>/contexts/
|
||
----------------------
|
||
|
||
In the beginning, this directory has only one file, ``nr_contexts``. Writing a
|
||
number (``N``) to the file creates the number of child directories named as
|
||
``0`` to ``N-1``. Each directory represents each monitoring context (refer to
|
||
:ref:`design <damon_design_execution_model_and_data_structures>` for more
|
||
details). At the moment, only one context per kdamond is supported, so only
|
||
``0`` or ``1`` can be written to the file.
|
||
|
||
.. _sysfs_context:
|
||
|
||
contexts/<N>/
|
||
-------------
|
||
|
||
In each context directory, two files (``avail_operations`` and ``operations``)
|
||
and three directories (``monitoring_attrs``, ``targets``, and ``schemes``)
|
||
exist.
|
||
|
||
DAMON supports multiple types of monitoring operations, including those for
|
||
virtual address space and the physical address space. You can get the list of
|
||
available monitoring operations set on the currently running kernel by reading
|
||
``avail_operations`` file. Based on the kernel configuration, the file will
|
||
list some or all of below keywords.
|
||
|
||
- vaddr: Monitor virtual address spaces of specific processes
|
||
- fvaddr: Monitor fixed virtual address ranges
|
||
- paddr: Monitor the physical address space of the system
|
||
|
||
Please refer to :ref:`regions sysfs directory <sysfs_regions>` for detailed
|
||
differences between the operations sets in terms of the monitoring target
|
||
regions.
|
||
|
||
You can set and get what type of monitoring operations DAMON will use for the
|
||
context by writing one of the keywords listed in ``avail_operations`` file and
|
||
reading from the ``operations`` file.
|
||
|
||
.. _sysfs_monitoring_attrs:
|
||
|
||
contexts/<N>/monitoring_attrs/
|
||
------------------------------
|
||
|
||
Files for specifying attributes of the monitoring including required quality
|
||
and efficiency of the monitoring are in ``monitoring_attrs`` directory.
|
||
Specifically, two directories, ``intervals`` and ``nr_regions`` exist in this
|
||
directory.
|
||
|
||
Under ``intervals`` directory, three files for DAMON's sampling interval
|
||
(``sample_us``), aggregation interval (``aggr_us``), and update interval
|
||
(``update_us``) exist. You can set and get the values in micro-seconds by
|
||
writing to and reading from the files.
|
||
|
||
Under ``nr_regions`` directory, two files for the lower-bound and upper-bound
|
||
of DAMON's monitoring regions (``min`` and ``max``, respectively), which
|
||
controls the monitoring overhead, exist. You can set and get the values by
|
||
writing to and rading from the files.
|
||
|
||
For more details about the intervals and monitoring regions range, please refer
|
||
to the Design document (:doc:`/mm/damon/design`).
|
||
|
||
.. _sysfs_targets:
|
||
|
||
contexts/<N>/targets/
|
||
---------------------
|
||
|
||
In the beginning, this directory has only one file, ``nr_targets``. Writing a
|
||
number (``N``) to the file creates the number of child directories named ``0``
|
||
to ``N-1``. Each directory represents each monitoring target.
|
||
|
||
.. _sysfs_target:
|
||
|
||
targets/<N>/
|
||
------------
|
||
|
||
In each target directory, one file (``pid_target``) and one directory
|
||
(``regions``) exist.
|
||
|
||
If you wrote ``vaddr`` to the ``contexts/<N>/operations``, each target should
|
||
be a process. You can specify the process to DAMON by writing the pid of the
|
||
process to the ``pid_target`` file.
|
||
|
||
.. _sysfs_regions:
|
||
|
||
targets/<N>/regions
|
||
-------------------
|
||
|
||
When ``vaddr`` monitoring operations set is being used (``vaddr`` is written to
|
||
the ``contexts/<N>/operations`` file), DAMON automatically sets and updates the
|
||
monitoring target regions so that entire memory mappings of target processes
|
||
can be covered. However, users could want to set the initial monitoring region
|
||
to specific address ranges.
|
||
|
||
In contrast, DAMON do not automatically sets and updates the monitoring target
|
||
regions when ``fvaddr`` or ``paddr`` monitoring operations sets are being used
|
||
(``fvaddr`` or ``paddr`` have written to the ``contexts/<N>/operations``).
|
||
Therefore, users should set the monitoring target regions by themselves in the
|
||
cases.
|
||
|
||
For such cases, users can explicitly set the initial monitoring target regions
|
||
as they want, by writing proper values to the files under this directory.
|
||
|
||
In the beginning, this directory has only one file, ``nr_regions``. Writing a
|
||
number (``N``) to the file creates the number of child directories named ``0``
|
||
to ``N-1``. Each directory represents each initial monitoring target region.
|
||
|
||
.. _sysfs_region:
|
||
|
||
regions/<N>/
|
||
------------
|
||
|
||
In each region directory, you will find two files (``start`` and ``end``). You
|
||
can set and get the start and end addresses of the initial monitoring target
|
||
region by writing to and reading from the files, respectively.
|
||
|
||
Each region should not overlap with others. ``end`` of directory ``N`` should
|
||
be equal or smaller than ``start`` of directory ``N+1``.
|
||
|
||
.. _sysfs_schemes:
|
||
|
||
contexts/<N>/schemes/
|
||
---------------------
|
||
|
||
The directory for DAMON-based Operation Schemes (:ref:`DAMOS
|
||
<damon_design_damos>`). Users can get and set the schemes by reading from and
|
||
writing to files under this directory.
|
||
|
||
In the beginning, this directory has only one file, ``nr_schemes``. Writing a
|
||
number (``N``) to the file creates the number of child directories named ``0``
|
||
to ``N-1``. Each directory represents each DAMON-based operation scheme.
|
||
|
||
.. _sysfs_scheme:
|
||
|
||
schemes/<N>/
|
||
------------
|
||
|
||
In each scheme directory, five directories (``access_pattern``, ``quotas``,
|
||
``watermarks``, ``filters``, ``stats``, and ``tried_regions``) and two files
|
||
(``action`` and ``apply_interval``) exist.
|
||
|
||
The ``action`` file is for setting and getting the scheme's :ref:`action
|
||
<damon_design_damos_action>`. The keywords that can be written to and read
|
||
from the file and their meaning are as below.
|
||
|
||
Note that support of each action depends on the running DAMON operations set
|
||
:ref:`implementation <sysfs_context>`.
|
||
|
||
- ``willneed``: Call ``madvise()`` for the region with ``MADV_WILLNEED``.
|
||
Supported by ``vaddr`` and ``fvaddr`` operations set.
|
||
- ``cold``: Call ``madvise()`` for the region with ``MADV_COLD``.
|
||
Supported by ``vaddr`` and ``fvaddr`` operations set.
|
||
- ``pageout``: Call ``madvise()`` for the region with ``MADV_PAGEOUT``.
|
||
Supported by ``vaddr``, ``fvaddr`` and ``paddr`` operations set.
|
||
- ``hugepage``: Call ``madvise()`` for the region with ``MADV_HUGEPAGE``.
|
||
Supported by ``vaddr`` and ``fvaddr`` operations set.
|
||
- ``nohugepage``: Call ``madvise()`` for the region with ``MADV_NOHUGEPAGE``.
|
||
Supported by ``vaddr`` and ``fvaddr`` operations set.
|
||
- ``lru_prio``: Prioritize the region on its LRU lists.
|
||
Supported by ``paddr`` operations set.
|
||
- ``lru_deprio``: Deprioritize the region on its LRU lists.
|
||
Supported by ``paddr`` operations set.
|
||
- ``stat``: Do nothing but count the statistics.
|
||
Supported by all operations sets.
|
||
|
||
The ``apply_interval_us`` file is for setting and getting the scheme's
|
||
:ref:`apply_interval <damon_design_damos>` in microseconds.
|
||
|
||
.. _sysfs_access_pattern:
|
||
|
||
schemes/<N>/access_pattern/
|
||
---------------------------
|
||
|
||
The directory for the target access :ref:`pattern
|
||
<damon_design_damos_access_pattern>` of the given DAMON-based operation scheme.
|
||
|
||
Under the ``access_pattern`` directory, three directories (``sz``,
|
||
``nr_accesses``, and ``age``) each having two files (``min`` and ``max``)
|
||
exist. You can set and get the access pattern for the given scheme by writing
|
||
to and reading from the ``min`` and ``max`` files under ``sz``,
|
||
``nr_accesses``, and ``age`` directories, respectively. Note that the ``min``
|
||
and the ``max`` form a closed interval.
|
||
|
||
.. _sysfs_quotas:
|
||
|
||
schemes/<N>/quotas/
|
||
-------------------
|
||
|
||
The directory for the :ref:`quotas <damon_design_damos_quotas>` of the given
|
||
DAMON-based operation scheme.
|
||
|
||
Under ``quotas`` directory, three files (``ms``, ``bytes``,
|
||
``reset_interval_ms``) and two directores (``weights`` and ``goals``) exist.
|
||
|
||
You can set the ``time quota`` in milliseconds, ``size quota`` in bytes, and
|
||
``reset interval`` in milliseconds by writing the values to the three files,
|
||
respectively. Then, DAMON tries to use only up to ``time quota`` milliseconds
|
||
for applying the ``action`` to memory regions of the ``access_pattern``, and to
|
||
apply the action to only up to ``bytes`` bytes of memory regions within the
|
||
``reset_interval_ms``. Setting both ``ms`` and ``bytes`` zero disables the
|
||
quota limits.
|
||
|
||
Under ``weights`` directory, three files (``sz_permil``,
|
||
``nr_accesses_permil``, and ``age_permil``) exist.
|
||
You can set the :ref:`prioritization weights
|
||
<damon_design_damos_quotas_prioritization>` for size, access frequency, and age
|
||
in per-thousand unit by writing the values to the three files under the
|
||
``weights`` directory.
|
||
|
||
.. _sysfs_schemes_quota_goals:
|
||
|
||
schemes/<N>/quotas/goals/
|
||
-------------------------
|
||
|
||
The directory for the :ref:`automatic quota tuning goals
|
||
<damon_design_damos_quotas_auto_tuning>` of the given DAMON-based operation
|
||
scheme.
|
||
|
||
In the beginning, this directory has only one file, ``nr_goals``. Writing a
|
||
number (``N``) to the file creates the number of child directories named ``0``
|
||
to ``N-1``. Each directory represents each goal and current achievement.
|
||
Among the multiple feedback, the best one is used.
|
||
|
||
Each goal directory contains two files, namely ``target_value`` and
|
||
``current_value``. Users can set and get any number to those files to set the
|
||
feedback. User space main workload's latency or throughput, system metrics
|
||
like free memory ratio or memory pressure stall time (PSI) could be example
|
||
metrics for the values. Note that users should write
|
||
``commit_schemes_quota_goals`` to the ``state`` file of the :ref:`kdamond
|
||
directory <sysfs_kdamond>` to pass the feedback to DAMON.
|
||
|
||
.. _sysfs_watermarks:
|
||
|
||
schemes/<N>/watermarks/
|
||
-----------------------
|
||
|
||
The directory for the :ref:`watermarks <damon_design_damos_watermarks>` of the
|
||
given DAMON-based operation scheme.
|
||
|
||
Under the watermarks directory, five files (``metric``, ``interval_us``,
|
||
``high``, ``mid``, and ``low``) for setting the metric, the time interval
|
||
between check of the metric, and the three watermarks exist. You can set and
|
||
get the five values by writing to the files, respectively.
|
||
|
||
Keywords and meanings of those that can be written to the ``metric`` file are
|
||
as below.
|
||
|
||
- none: Ignore the watermarks
|
||
- free_mem_rate: System's free memory rate (per thousand)
|
||
|
||
The ``interval`` should written in microseconds unit.
|
||
|
||
.. _sysfs_filters:
|
||
|
||
schemes/<N>/filters/
|
||
--------------------
|
||
|
||
The directory for the :ref:`filters <damon_design_damos_filters>` of the given
|
||
DAMON-based operation scheme.
|
||
|
||
In the beginning, this directory has only one file, ``nr_filters``. Writing a
|
||
number (``N``) to the file creates the number of child directories named ``0``
|
||
to ``N-1``. Each directory represents each filter. The filters are evaluated
|
||
in the numeric order.
|
||
|
||
Each filter directory contains six files, namely ``type``, ``matcing``,
|
||
``memcg_path``, ``addr_start``, ``addr_end``, and ``target_idx``. To ``type``
|
||
file, you can write one of four special keywords: ``anon`` for anonymous pages,
|
||
``memcg`` for specific memory cgroup, ``addr`` for specific address range (an
|
||
open-ended interval), or ``target`` for specific DAMON monitoring target
|
||
filtering. In case of the memory cgroup filtering, you can specify the memory
|
||
cgroup of the interest by writing the path of the memory cgroup from the
|
||
cgroups mount point to ``memcg_path`` file. In case of the address range
|
||
filtering, you can specify the start and end address of the range to
|
||
``addr_start`` and ``addr_end`` files, respectively. For the DAMON monitoring
|
||
target filtering, you can specify the index of the target between the list of
|
||
the DAMON context's monitoring targets list to ``target_idx`` file. You can
|
||
write ``Y`` or ``N`` to ``matching`` file to filter out pages that does or does
|
||
not match to the type, respectively. Then, the scheme's action will not be
|
||
applied to the pages that specified to be filtered out.
|
||
|
||
For example, below restricts a DAMOS action to be applied to only non-anonymous
|
||
pages of all memory cgroups except ``/having_care_already``.::
|
||
|
||
# echo 2 > nr_filters
|
||
# # filter out anonymous pages
|
||
echo anon > 0/type
|
||
echo Y > 0/matching
|
||
# # further filter out all cgroups except one at '/having_care_already'
|
||
echo memcg > 1/type
|
||
echo /having_care_already > 1/memcg_path
|
||
echo N > 1/matching
|
||
|
||
Note that ``anon`` and ``memcg`` filters are currently supported only when
|
||
``paddr`` :ref:`implementation <sysfs_context>` is being used.
|
||
|
||
Also, memory regions that are filtered out by ``addr`` or ``target`` filters
|
||
are not counted as the scheme has tried to those, while regions that filtered
|
||
out by other type filters are counted as the scheme has tried to. The
|
||
difference is applied to :ref:`stats <damos_stats>` and
|
||
:ref:`tried regions <sysfs_schemes_tried_regions>`.
|
||
|
||
.. _sysfs_schemes_stats:
|
||
|
||
schemes/<N>/stats/
|
||
------------------
|
||
|
||
DAMON counts the total number and bytes of regions that each scheme is tried to
|
||
be applied, the two numbers for the regions that each scheme is successfully
|
||
applied, and the total number of the quota limit exceeds. This statistics can
|
||
be used for online analysis or tuning of the schemes.
|
||
|
||
The statistics can be retrieved by reading the files under ``stats`` directory
|
||
(``nr_tried``, ``sz_tried``, ``nr_applied``, ``sz_applied``, and
|
||
``qt_exceeds``), respectively. The files are not updated in real time, so you
|
||
should ask DAMON sysfs interface to update the content of the files for the
|
||
stats by writing a special keyword, ``update_schemes_stats`` to the relevant
|
||
``kdamonds/<N>/state`` file.
|
||
|
||
.. _sysfs_schemes_tried_regions:
|
||
|
||
schemes/<N>/tried_regions/
|
||
--------------------------
|
||
|
||
This directory initially has one file, ``total_bytes``.
|
||
|
||
When a special keyword, ``update_schemes_tried_regions``, is written to the
|
||
relevant ``kdamonds/<N>/state`` file, DAMON updates the ``total_bytes`` file so
|
||
that reading it returns the total size of the scheme tried regions, and creates
|
||
directories named integer starting from ``0`` under this directory. Each
|
||
directory contains files exposing detailed information about each of the memory
|
||
region that the corresponding scheme's ``action`` has tried to be applied under
|
||
this directory, during next :ref:`apply interval <damon_design_damos>` of the
|
||
corresponding scheme. The information includes address range, ``nr_accesses``,
|
||
and ``age`` of the region.
|
||
|
||
Writing ``update_schemes_tried_bytes`` to the relevant ``kdamonds/<N>/state``
|
||
file will only update the ``total_bytes`` file, and will not create the
|
||
subdirectories.
|
||
|
||
The directories will be removed when another special keyword,
|
||
``clear_schemes_tried_regions``, is written to the relevant
|
||
``kdamonds/<N>/state`` file.
|
||
|
||
The expected usage of this directory is investigations of schemes' behaviors,
|
||
and query-like efficient data access monitoring results retrievals. For the
|
||
latter use case, in particular, users can set the ``action`` as ``stat`` and
|
||
set the ``access pattern`` as their interested pattern that they want to query.
|
||
|
||
.. _sysfs_schemes_tried_region:
|
||
|
||
tried_regions/<N>/
|
||
------------------
|
||
|
||
In each region directory, you will find four files (``start``, ``end``,
|
||
``nr_accesses``, and ``age``). Reading the files will show the start and end
|
||
addresses, ``nr_accesses``, and ``age`` of the region that corresponding
|
||
DAMON-based operation scheme ``action`` has tried to be applied.
|
||
|
||
Example
|
||
~~~~~~~
|
||
|
||
Below commands applies a scheme saying "If a memory region of size in [4KiB,
|
||
8KiB] is showing accesses per aggregate interval in [0, 5] for aggregate
|
||
interval in [10, 20], page out the region. For the paging out, use only up to
|
||
10ms per second, and also don't page out more than 1GiB per second. Under the
|
||
limitation, page out memory regions having longer age first. Also, check the
|
||
free memory rate of the system every 5 seconds, start the monitoring and paging
|
||
out when the free memory rate becomes lower than 50%, but stop it if the free
|
||
memory rate becomes larger than 60%, or lower than 30%". ::
|
||
|
||
# cd <sysfs>/kernel/mm/damon/admin
|
||
# # populate directories
|
||
# echo 1 > kdamonds/nr_kdamonds; echo 1 > kdamonds/0/contexts/nr_contexts;
|
||
# echo 1 > kdamonds/0/contexts/0/schemes/nr_schemes
|
||
# cd kdamonds/0/contexts/0/schemes/0
|
||
# # set the basic access pattern and the action
|
||
# echo 4096 > access_pattern/sz/min
|
||
# echo 8192 > access_pattern/sz/max
|
||
# echo 0 > access_pattern/nr_accesses/min
|
||
# echo 5 > access_pattern/nr_accesses/max
|
||
# echo 10 > access_pattern/age/min
|
||
# echo 20 > access_pattern/age/max
|
||
# echo pageout > action
|
||
# # set quotas
|
||
# echo 10 > quotas/ms
|
||
# echo $((1024*1024*1024)) > quotas/bytes
|
||
# echo 1000 > quotas/reset_interval_ms
|
||
# # set watermark
|
||
# echo free_mem_rate > watermarks/metric
|
||
# echo 5000000 > watermarks/interval_us
|
||
# echo 600 > watermarks/high
|
||
# echo 500 > watermarks/mid
|
||
# echo 300 > watermarks/low
|
||
|
||
Please note that it's highly recommended to use user space tools like `damo
|
||
<https://github.com/awslabs/damo>`_ rather than manually reading and writing
|
||
the files as above. Above is only for an example.
|
||
|
||
.. _tracepoint:
|
||
|
||
Tracepoints for Monitoring Results
|
||
==================================
|
||
|
||
Users can get the monitoring results via the :ref:`tried_regions
|
||
<sysfs_schemes_tried_regions>`. The interface is useful for getting a
|
||
snapshot, but it could be inefficient for fully recording all the monitoring
|
||
results. For the purpose, two trace points, namely ``damon:damon_aggregated``
|
||
and ``damon:damos_before_apply``, are provided. ``damon:damon_aggregated``
|
||
provides the whole monitoring results, while ``damon:damos_before_apply``
|
||
provides the monitoring results for regions that each DAMON-based Operation
|
||
Scheme (:ref:`DAMOS <damon_design_damos>`) is gonna be applied. Hence,
|
||
``damon:damos_before_apply`` is more useful for recording internal behavior of
|
||
DAMOS, or DAMOS target access
|
||
:ref:`pattern <damon_design_damos_access_pattern>` based query-like efficient
|
||
monitoring results recording.
|
||
|
||
While the monitoring is turned on, you could record the tracepoint events and
|
||
show results using tracepoint supporting tools like ``perf``. For example::
|
||
|
||
# echo on > monitor_on
|
||
# perf record -e damon:damon_aggregated &
|
||
# sleep 5
|
||
# kill 9 $(pidof perf)
|
||
# echo off > monitor_on
|
||
# perf script
|
||
kdamond.0 46568 [027] 79357.842179: damon:damon_aggregated: target_id=0 nr_regions=11 122509119488-135708762112: 0 864
|
||
[...]
|
||
|
||
Each line of the perf script output represents each monitoring region. The
|
||
first five fields are as usual other tracepoint outputs. The sixth field
|
||
(``target_id=X``) shows the ide of the monitoring target of the region. The
|
||
seventh field (``nr_regions=X``) shows the total number of monitoring regions
|
||
for the target. The eighth field (``X-Y:``) shows the start (``X``) and end
|
||
(``Y``) addresses of the region in bytes. The ninth field (``X``) shows the
|
||
``nr_accesses`` of the region (refer to
|
||
:ref:`design <damon_design_region_based_sampling>` for more details of the
|
||
counter). Finally the tenth field (``X``) shows the ``age`` of the region
|
||
(refer to :ref:`design <damon_design_age_tracking>` for more details of the
|
||
counter).
|
||
|
||
If the event was ``damon:damos_beofre_apply``, the ``perf script`` output would
|
||
be somewhat like below::
|
||
|
||
kdamond.0 47293 [000] 80801.060214: damon:damos_before_apply: ctx_idx=0 scheme_idx=0 target_idx=0 nr_regions=11 121932607488-135128711168: 0 136
|
||
[...]
|
||
|
||
Each line of the output represents each monitoring region that each DAMON-based
|
||
Operation Scheme was about to be applied at the traced time. The first five
|
||
fields are as usual. It shows the index of the DAMON context (``ctx_idx=X``)
|
||
of the scheme in the list of the contexts of the context's kdamond, the index
|
||
of the scheme (``scheme_idx=X``) in the list of the schemes of the context, in
|
||
addition to the output of ``damon_aggregated`` tracepoint.
|
||
|
||
|
||
.. _debugfs_interface:
|
||
|
||
debugfs Interface (DEPRECATED!)
|
||
===============================
|
||
|
||
.. note::
|
||
|
||
THIS IS DEPRECATED!
|
||
|
||
DAMON debugfs interface is deprecated, so users should move to the
|
||
:ref:`sysfs interface <sysfs_interface>`. If you depend on this and cannot
|
||
move, please report your usecase to damon@lists.linux.dev and
|
||
linux-mm@kvack.org.
|
||
|
||
DAMON exports eight files, ``attrs``, ``target_ids``, ``init_regions``,
|
||
``schemes``, ``monitor_on``, ``kdamond_pid``, ``mk_contexts`` and
|
||
``rm_contexts`` under its debugfs directory, ``<debugfs>/damon/``.
|
||
|
||
|
||
Attributes
|
||
----------
|
||
|
||
Users can get and set the ``sampling interval``, ``aggregation interval``,
|
||
``update interval``, and min/max number of monitoring target regions by
|
||
reading from and writing to the ``attrs`` file. To know about the monitoring
|
||
attributes in detail, please refer to the :doc:`/mm/damon/design`. For
|
||
example, below commands set those values to 5 ms, 100 ms, 1,000 ms, 10 and
|
||
1000, and then check it again::
|
||
|
||
# cd <debugfs>/damon
|
||
# echo 5000 100000 1000000 10 1000 > attrs
|
||
# cat attrs
|
||
5000 100000 1000000 10 1000
|
||
|
||
|
||
Target IDs
|
||
----------
|
||
|
||
Some types of address spaces supports multiple monitoring target. For example,
|
||
the virtual memory address spaces monitoring can have multiple processes as the
|
||
monitoring targets. Users can set the targets by writing relevant id values of
|
||
the targets to, and get the ids of the current targets by reading from the
|
||
``target_ids`` file. In case of the virtual address spaces monitoring, the
|
||
values should be pids of the monitoring target processes. For example, below
|
||
commands set processes having pids 42 and 4242 as the monitoring targets and
|
||
check it again::
|
||
|
||
# cd <debugfs>/damon
|
||
# echo 42 4242 > target_ids
|
||
# cat target_ids
|
||
42 4242
|
||
|
||
Users can also monitor the physical memory address space of the system by
|
||
writing a special keyword, "``paddr\n``" to the file. Because physical address
|
||
space monitoring doesn't support multiple targets, reading the file will show a
|
||
fake value, ``42``, as below::
|
||
|
||
# cd <debugfs>/damon
|
||
# echo paddr > target_ids
|
||
# cat target_ids
|
||
42
|
||
|
||
Note that setting the target ids doesn't start the monitoring.
|
||
|
||
|
||
Initial Monitoring Target Regions
|
||
---------------------------------
|
||
|
||
In case of the virtual address space monitoring, DAMON automatically sets and
|
||
updates the monitoring target regions so that entire memory mappings of target
|
||
processes can be covered. However, users can want to limit the monitoring
|
||
region to specific address ranges, such as the heap, the stack, or specific
|
||
file-mapped area. Or, some users can know the initial access pattern of their
|
||
workloads and therefore want to set optimal initial regions for the 'adaptive
|
||
regions adjustment'.
|
||
|
||
In contrast, DAMON do not automatically sets and updates the monitoring target
|
||
regions in case of physical memory monitoring. Therefore, users should set the
|
||
monitoring target regions by themselves.
|
||
|
||
In such cases, users can explicitly set the initial monitoring target regions
|
||
as they want, by writing proper values to the ``init_regions`` file. The input
|
||
should be a sequence of three integers separated by white spaces that represent
|
||
one region in below form.::
|
||
|
||
<target idx> <start address> <end address>
|
||
|
||
The ``target idx`` should be the index of the target in ``target_ids`` file,
|
||
starting from ``0``, and the regions should be passed in address order. For
|
||
example, below commands will set a couple of address ranges, ``1-100`` and
|
||
``100-200`` as the initial monitoring target region of pid 42, which is the
|
||
first one (index ``0``) in ``target_ids``, and another couple of address
|
||
ranges, ``20-40`` and ``50-100`` as that of pid 4242, which is the second one
|
||
(index ``1``) in ``target_ids``.::
|
||
|
||
# cd <debugfs>/damon
|
||
# cat target_ids
|
||
42 4242
|
||
# echo "0 1 100 \
|
||
0 100 200 \
|
||
1 20 40 \
|
||
1 50 100" > init_regions
|
||
|
||
Note that this sets the initial monitoring target regions only. In case of
|
||
virtual memory monitoring, DAMON will automatically updates the boundary of the
|
||
regions after one ``update interval``. Therefore, users should set the
|
||
``update interval`` large enough in this case, if they don't want the
|
||
update.
|
||
|
||
|
||
Schemes
|
||
-------
|
||
|
||
Users can get and set the DAMON-based operation :ref:`schemes
|
||
<damon_design_damos>` by reading from and writing to ``schemes`` debugfs file.
|
||
Reading the file also shows the statistics of each scheme. To the file, each
|
||
of the schemes should be represented in each line in below form::
|
||
|
||
<target access pattern> <action> <quota> <watermarks>
|
||
|
||
You can disable schemes by simply writing an empty string to the file.
|
||
|
||
Target Access Pattern
|
||
~~~~~~~~~~~~~~~~~~~~~
|
||
|
||
The target access :ref:`pattern <damon_design_damos_access_pattern>` of the
|
||
scheme. The ``<target access pattern>`` is constructed with three ranges in
|
||
below form::
|
||
|
||
min-size max-size min-acc max-acc min-age max-age
|
||
|
||
Specifically, bytes for the size of regions (``min-size`` and ``max-size``),
|
||
number of monitored accesses per aggregate interval for access frequency
|
||
(``min-acc`` and ``max-acc``), number of aggregate intervals for the age of
|
||
regions (``min-age`` and ``max-age``) are specified. Note that the ranges are
|
||
closed interval.
|
||
|
||
Action
|
||
~~~~~~
|
||
|
||
The ``<action>`` is a predefined integer for memory management :ref:`actions
|
||
<damon_design_damos_action>`. The supported numbers and their meanings are as
|
||
below.
|
||
|
||
- 0: Call ``madvise()`` for the region with ``MADV_WILLNEED``. Ignored if
|
||
``target`` is ``paddr``.
|
||
- 1: Call ``madvise()`` for the region with ``MADV_COLD``. Ignored if
|
||
``target`` is ``paddr``.
|
||
- 2: Call ``madvise()`` for the region with ``MADV_PAGEOUT``.
|
||
- 3: Call ``madvise()`` for the region with ``MADV_HUGEPAGE``. Ignored if
|
||
``target`` is ``paddr``.
|
||
- 4: Call ``madvise()`` for the region with ``MADV_NOHUGEPAGE``. Ignored if
|
||
``target`` is ``paddr``.
|
||
- 5: Do nothing but count the statistics
|
||
|
||
Quota
|
||
~~~~~
|
||
|
||
Users can set the :ref:`quotas <damon_design_damos_quotas>` of the given scheme
|
||
via the ``<quota>`` in below form::
|
||
|
||
<ms> <sz> <reset interval> <priority weights>
|
||
|
||
This makes DAMON to try to use only up to ``<ms>`` milliseconds for applying
|
||
the action to memory regions of the ``target access pattern`` within the
|
||
``<reset interval>`` milliseconds, and to apply the action to only up to
|
||
``<sz>`` bytes of memory regions within the ``<reset interval>``. Setting both
|
||
``<ms>`` and ``<sz>`` zero disables the quota limits.
|
||
|
||
For the :ref:`prioritization <damon_design_damos_quotas_prioritization>`, users
|
||
can set the weights for the three properties in ``<priority weights>`` in below
|
||
form::
|
||
|
||
<size weight> <access frequency weight> <age weight>
|
||
|
||
Watermarks
|
||
~~~~~~~~~~
|
||
|
||
Users can specify :ref:`watermarks <damon_design_damos_watermarks>` of the
|
||
given scheme via ``<watermarks>`` in below form::
|
||
|
||
<metric> <check interval> <high mark> <middle mark> <low mark>
|
||
|
||
``<metric>`` is a predefined integer for the metric to be checked. The
|
||
supported numbers and their meanings are as below.
|
||
|
||
- 0: Ignore the watermarks
|
||
- 1: System's free memory rate (per thousand)
|
||
|
||
The value of the metric is checked every ``<check interval>`` microseconds.
|
||
|
||
If the value is higher than ``<high mark>`` or lower than ``<low mark>``, the
|
||
scheme is deactivated. If the value is lower than ``<mid mark>``, the scheme
|
||
is activated.
|
||
|
||
.. _damos_stats:
|
||
|
||
Statistics
|
||
~~~~~~~~~~
|
||
|
||
It also counts the total number and bytes of regions that each scheme is tried
|
||
to be applied, the two numbers for the regions that each scheme is successfully
|
||
applied, and the total number of the quota limit exceeds. This statistics can
|
||
be used for online analysis or tuning of the schemes.
|
||
|
||
The statistics can be shown by reading the ``schemes`` file. Reading the file
|
||
will show each scheme you entered in each line, and the five numbers for the
|
||
statistics will be added at the end of each line.
|
||
|
||
Example
|
||
~~~~~~~
|
||
|
||
Below commands applies a scheme saying "If a memory region of size in [4KiB,
|
||
8KiB] is showing accesses per aggregate interval in [0, 5] for aggregate
|
||
interval in [10, 20], page out the region. For the paging out, use only up to
|
||
10ms per second, and also don't page out more than 1GiB per second. Under the
|
||
limitation, page out memory regions having longer age first. Also, check the
|
||
free memory rate of the system every 5 seconds, start the monitoring and paging
|
||
out when the free memory rate becomes lower than 50%, but stop it if the free
|
||
memory rate becomes larger than 60%, or lower than 30%".::
|
||
|
||
# cd <debugfs>/damon
|
||
# scheme="4096 8192 0 5 10 20 2" # target access pattern and action
|
||
# scheme+=" 10 $((1024*1024*1024)) 1000" # quotas
|
||
# scheme+=" 0 0 100" # prioritization weights
|
||
# scheme+=" 1 5000000 600 500 300" # watermarks
|
||
# echo "$scheme" > schemes
|
||
|
||
|
||
Turning On/Off
|
||
--------------
|
||
|
||
Setting the files as described above doesn't incur effect unless you explicitly
|
||
start the monitoring. You can start, stop, and check the current status of the
|
||
monitoring by writing to and reading from the ``monitor_on`` file. Writing
|
||
``on`` to the file starts the monitoring of the targets with the attributes.
|
||
Writing ``off`` to the file stops those. DAMON also stops if every target
|
||
process is terminated. Below example commands turn on, off, and check the
|
||
status of DAMON::
|
||
|
||
# cd <debugfs>/damon
|
||
# echo on > monitor_on
|
||
# echo off > monitor_on
|
||
# cat monitor_on
|
||
off
|
||
|
||
Please note that you cannot write to the above-mentioned debugfs files while
|
||
the monitoring is turned on. If you write to the files while DAMON is running,
|
||
an error code such as ``-EBUSY`` will be returned.
|
||
|
||
|
||
Monitoring Thread PID
|
||
---------------------
|
||
|
||
DAMON does requested monitoring with a kernel thread called ``kdamond``. You
|
||
can get the pid of the thread by reading the ``kdamond_pid`` file. When the
|
||
monitoring is turned off, reading the file returns ``none``. ::
|
||
|
||
# cd <debugfs>/damon
|
||
# cat monitor_on
|
||
off
|
||
# cat kdamond_pid
|
||
none
|
||
# echo on > monitor_on
|
||
# cat kdamond_pid
|
||
18594
|
||
|
||
|
||
Using Multiple Monitoring Threads
|
||
---------------------------------
|
||
|
||
One ``kdamond`` thread is created for each monitoring context. You can create
|
||
and remove monitoring contexts for multiple ``kdamond`` required use case using
|
||
the ``mk_contexts`` and ``rm_contexts`` files.
|
||
|
||
Writing the name of the new context to the ``mk_contexts`` file creates a
|
||
directory of the name on the DAMON debugfs directory. The directory will have
|
||
DAMON debugfs files for the context. ::
|
||
|
||
# cd <debugfs>/damon
|
||
# ls foo
|
||
# ls: cannot access 'foo': No such file or directory
|
||
# echo foo > mk_contexts
|
||
# ls foo
|
||
# attrs init_regions kdamond_pid schemes target_ids
|
||
|
||
If the context is not needed anymore, you can remove it and the corresponding
|
||
directory by putting the name of the context to the ``rm_contexts`` file. ::
|
||
|
||
# echo foo > rm_contexts
|
||
# ls foo
|
||
# ls: cannot access 'foo': No such file or directory
|
||
|
||
Note that ``mk_contexts``, ``rm_contexts``, and ``monitor_on`` files are in the
|
||
root directory only.
|