135 lines
6.3 KiB
ReStructuredText
135 lines
6.3 KiB
ReStructuredText
|
====================
|
||
|
rtla-timerlat-top
|
||
|
====================
|
||
|
-------------------------------------------
|
||
|
Measures the operating system timer latency
|
||
|
-------------------------------------------
|
||
|
|
||
|
:Manual section: 1
|
||
|
|
||
|
SYNOPSIS
|
||
|
========
|
||
|
**rtla timerlat top** [*OPTIONS*] ...
|
||
|
|
||
|
DESCRIPTION
|
||
|
===========
|
||
|
|
||
|
.. include:: common_timerlat_description.rst
|
||
|
|
||
|
The **rtla timerlat top** displays a summary of the periodic output
|
||
|
from the *timerlat* tracer. It also provides information for each
|
||
|
operating system noise via the **osnoise:** tracepoints that can be
|
||
|
seem with the option **-T**.
|
||
|
|
||
|
OPTIONS
|
||
|
=======
|
||
|
|
||
|
.. include:: common_timerlat_options.rst
|
||
|
|
||
|
.. include:: common_top_options.rst
|
||
|
|
||
|
.. include:: common_options.rst
|
||
|
|
||
|
.. include:: common_timerlat_aa.rst
|
||
|
|
||
|
**--aa-only** *us*
|
||
|
|
||
|
Set stop tracing conditions and run without collecting and displaying statistics.
|
||
|
Print the auto-analysis if the system hits the stop tracing condition. This option
|
||
|
is useful to reduce rtla timerlat CPU, enabling the debug without the overhead of
|
||
|
collecting the statistics.
|
||
|
|
||
|
EXAMPLE
|
||
|
=======
|
||
|
|
||
|
In the example below, the timerlat tracer is dispatched in cpus *1-23* in the
|
||
|
automatic trace mode, instructing the tracer to stop if a *40 us* latency or
|
||
|
higher is found::
|
||
|
|
||
|
# timerlat -a 40 -c 1-23 -q
|
||
|
Timer Latency
|
||
|
0 00:00:12 | IRQ Timer Latency (us) | Thread Timer Latency (us)
|
||
|
CPU COUNT | cur min avg max | cur min avg max
|
||
|
1 #12322 | 0 0 1 15 | 10 3 9 31
|
||
|
2 #12322 | 3 0 1 12 | 10 3 9 23
|
||
|
3 #12322 | 1 0 1 21 | 8 2 8 34
|
||
|
4 #12322 | 1 0 1 17 | 10 2 11 33
|
||
|
5 #12322 | 0 0 1 12 | 8 3 8 25
|
||
|
6 #12322 | 1 0 1 14 | 16 3 11 35
|
||
|
7 #12322 | 0 0 1 14 | 9 2 8 29
|
||
|
8 #12322 | 1 0 1 22 | 9 3 9 34
|
||
|
9 #12322 | 0 0 1 14 | 8 2 8 24
|
||
|
10 #12322 | 1 0 0 12 | 9 3 8 24
|
||
|
11 #12322 | 0 0 0 15 | 6 2 7 29
|
||
|
12 #12321 | 1 0 0 13 | 5 3 8 23
|
||
|
13 #12319 | 0 0 1 14 | 9 3 9 26
|
||
|
14 #12321 | 1 0 0 13 | 6 2 8 24
|
||
|
15 #12321 | 1 0 1 15 | 12 3 11 27
|
||
|
16 #12318 | 0 0 1 13 | 7 3 10 24
|
||
|
17 #12319 | 0 0 1 13 | 11 3 9 25
|
||
|
18 #12318 | 0 0 0 12 | 8 2 8 20
|
||
|
19 #12319 | 0 0 1 18 | 10 2 9 28
|
||
|
20 #12317 | 0 0 0 20 | 9 3 8 34
|
||
|
21 #12318 | 0 0 0 13 | 8 3 8 28
|
||
|
22 #12319 | 0 0 1 11 | 8 3 10 22
|
||
|
23 #12320 | 28 0 1 28 | 41 3 11 41
|
||
|
rtla timerlat hit stop tracing
|
||
|
## CPU 23 hit stop tracing, analyzing it ##
|
||
|
IRQ handler delay: 27.49 us (65.52 %)
|
||
|
IRQ latency: 28.13 us
|
||
|
Timerlat IRQ duration: 9.59 us (22.85 %)
|
||
|
Blocking thread: 3.79 us (9.03 %)
|
||
|
objtool:49256 3.79 us
|
||
|
Blocking thread stacktrace
|
||
|
-> timerlat_irq
|
||
|
-> __hrtimer_run_queues
|
||
|
-> hrtimer_interrupt
|
||
|
-> __sysvec_apic_timer_interrupt
|
||
|
-> sysvec_apic_timer_interrupt
|
||
|
-> asm_sysvec_apic_timer_interrupt
|
||
|
-> _raw_spin_unlock_irqrestore
|
||
|
-> cgroup_rstat_flush_locked
|
||
|
-> cgroup_rstat_flush_irqsafe
|
||
|
-> mem_cgroup_flush_stats
|
||
|
-> mem_cgroup_wb_stats
|
||
|
-> balance_dirty_pages
|
||
|
-> balance_dirty_pages_ratelimited_flags
|
||
|
-> btrfs_buffered_write
|
||
|
-> btrfs_do_write_iter
|
||
|
-> vfs_write
|
||
|
-> __x64_sys_pwrite64
|
||
|
-> do_syscall_64
|
||
|
-> entry_SYSCALL_64_after_hwframe
|
||
|
------------------------------------------------------------------------
|
||
|
Thread latency: 41.96 us (100%)
|
||
|
|
||
|
The system has exit from idle latency!
|
||
|
Max timerlat IRQ latency from idle: 17.48 us in cpu 4
|
||
|
Saving trace to timerlat_trace.txt
|
||
|
|
||
|
In this case, the major factor was the delay suffered by the *IRQ handler*
|
||
|
that handles **timerlat** wakeup: *65.52%*. This can be caused by the
|
||
|
current thread masking interrupts, which can be seen in the blocking
|
||
|
thread stacktrace: the current thread (*objtool:49256*) disabled interrupts
|
||
|
via *raw spin lock* operations inside mem cgroup, while doing write
|
||
|
syscall in a btrfs file system.
|
||
|
|
||
|
The raw trace is saved in the **timerlat_trace.txt** file for further analysis.
|
||
|
|
||
|
Note that **rtla timerlat** was dispatched without changing *timerlat* tracer
|
||
|
threads' priority. That is generally not needed because these threads have
|
||
|
priority *FIFO:95* by default, which is a common priority used by real-time
|
||
|
kernel developers to analyze scheduling delays.
|
||
|
|
||
|
SEE ALSO
|
||
|
--------
|
||
|
**rtla-timerlat**\(1), **rtla-timerlat-hist**\(1)
|
||
|
|
||
|
*timerlat* tracer documentation: <https://www.kernel.org/doc/html/latest/trace/timerlat-tracer.html>
|
||
|
|
||
|
AUTHOR
|
||
|
------
|
||
|
Written by Daniel Bristot de Oliveira <bristot@kernel.org>
|
||
|
|
||
|
.. include:: common_appendix.rst
|