NAME
trace-cmd-profile - profile tasks running live
SYNOPSIS
trace-cmd profile [OPTIONS] [command]
DESCRIPTION
The trace-cmd(1) profile will start tracing just like trace-cmd-record(1), with the --profile option, except that it does not write to a file, but instead, it will read the events as they happen and will update the accounting of the events. When the trace is finished, it will report the results just like trace-cmd-report(1) would do with its --profile option. In other words, the profile command does the work of trace-cmd record --profile, and trace-cmd report --profile without having to record the data to disk, in between.
The advantage of using the profile command is that the profiling can be done over a long period of time where recording all events would take up too much disk space.
This will enable several events as well as the function graph tracer with a depth of one (if the kernel supports it). This is to show where tasks enter and exit the kernel and how long they were in the kernel.
To disable calling function graph, use the -p option to enable another tracer. To not enable any tracer, use -p nop.
All timings are currently in nanoseconds.
OPTIONS
These are the same as trace-cmd-record(1) with the --profile option.
-p tracer
Set a tracer plugin to run instead of function graph tracing set to depth of 1. To not run any tracer, use -p nop.
-S
Only enable the tracer or events speficied on the command line. With this option, the function_graph tracer is not enabled, nor are any events (like sched_switch), unless they are specifically specified on the command line (i.e. -p function -e sched_switch -e sched_wakeup)
-G
Set interrupt (soft and hard) events as global (associated to CPU instead of tasks).
-o file
Write the output of the profile to file. This supersedes --stderr
-H event-hooks
Add custom event matching to connect any two events together. Format is: [<start_system>:]<start_event>,<start_match>[,<start_pid>]/ [<end_system>:]<end_event>,<end_match>[,<flags>]
The
start_system:start_event (start_system is optional), is the
event that
starts the timing.
start_match is
the field in the start event that is to match with the
end_match in the end event.
start_pid is
optional, as matches are attached to the tasks that run
the events, if another field should be used to find that
task, then
it is specified with start_pid.
end_system:end_event
is the event that ends the timing (end_system is
optional).
end_match is
the field in end_match that wil match the start event field
start_match.
flags are optional and can be the following (case insensitive):
p : The two
events are pinned to the same CPU (start and end happen
on the same CPU always).
s : The event
should have a stack traced with it (enable stack tracing
for the start event).
g : The event
is global (not associated to a task). start_pid is
not applicable with this flag.
--stderr
Redirect the output to stderr. The output of the command being executed is not changed. This allows watching the command execute and saving the output of the profile to another file.
--verbose[=level]
Set the log level. Supported log levels are "none", "critical", "error", "warning", "info", "debug", "all" or their identifiers "0", "1", "2", "3", "4", "5", "6". Setting the log level to specific value enables all logs from that and all previous levels. The level will default to "info" if one is not specified.
Example: enable all critical, error and warning logs
trace-cmd profile --verbose=warning
EXAMPLES
---
# trace-cmd profile -F sleep 1
[..]
task: sleep-1121
Event: sched_switch:R (2) Total: 234559 Avg: 117279 Max:
129886 Min:104673
|
+ ftrace_raw_event_sched_switch (0xffffffff8109f310)
100% (2) time:234559 max:129886 min:104673 avg:117279
__schedule (0xffffffff816c1e81)
preempt_schedule (0xffffffff816c236e)
___preempt_schedule (0xffffffff81351a59)
|
+ unmap_single_vma (0xffffffff81198c05)
| 55% (1) time:129886 max:129886 min:0 avg:129886
| stop_one_cpu (0xffffffff8110909a)
| sched_exec (0xffffffff810a119b)
| do_execveat_common.isra.31 (0xffffffff811de528)
| do_execve (0xffffffff811dea8c)
| SyS_execve (0xffffffff811ded1e)
| return_to_handler (0xffffffff816c8458)
| stub_execve (0xffffffff816c6929)
|
+ unmap_single_vma (0xffffffff81198c05)
45% (1) time:104673 max:104673 min:0 avg:104673
unmap_vmas (0xffffffff81199174)
exit_mmap (0xffffffff811a1f5b)
mmput (0xffffffff8107699a)
flush_old_exec (0xffffffff811ddb75)
load_elf_binary (0xffffffff812287df)
search_binary_handler (0xffffffff811dd3e0)
do_execveat_common.isra.31 (0xffffffff811de8bd)
do_execve (0xffffffff811dea8c)
SyS_execve (0xffffffff811ded1e)
return_to_handler (0xffffffff816c8458)
stub_execve (0xffffffff816c6929)
Event:
sched_switch:S (1) Total: 1000513242 Avg: 1000513242 Max:
1000513242 Min:1000513242
|
+ ftrace_raw_event_sched_switch (0xffffffff8109f310)
100% (1) time:1000513242 max:1000513242 min:0 avg:1000513242
__schedule (0xffffffff816c1e81)
schedule (0xffffffff816c23b9)
do_nanosleep (0xffffffff816c4f1c)
hrtimer_nanosleep (0xffffffff810dcd86)
SyS_nanosleep (0xffffffff810dcea6)
return_to_handler (0xffffffff816c8458)
tracesys_phase2 (0xffffffff816c65b0)
Event:
sched_wakeup:1121 (1) Total: 43405 Avg: 43405 Max: 43405
Min:43405
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:43405 max:43405 min:0 avg:43405
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
wake_up_process (0xffffffff810a4057)
hrtimer_wakeup (0xffffffff810db772)
__run_hrtimer (0xffffffff810dbd91)
hrtimer_interrupt (0xffffffff810dc6b7)
local_apic_timer_interrupt (0xffffffff810363e7)
smp_trace_apic_timer_interrupt (0xffffffff816c8c6a)
trace_apic_timer_interrupt (0xffffffff816c725a)
finish_task_switch (0xffffffff8109c3a4)
__schedule (0xffffffff816c1e01)
schedule (0xffffffff816c23b9)
ring_buffer_wait (0xffffffff811323a3)
wait_on_pipe (0xffffffff81133d93)
tracing_buffers_splice_read (0xffffffff811350b0)
do_splice_to (0xffffffff8120476f)
SyS_splice (0xffffffff81206c1f)
tracesys_phase2 (0xffffffff816c65b0)
Event: func:
sys_nanosleep() (1) Total: 1000598016 Avg: 1000598016 Max:
1000598016 Min:1000598016
Event: func: sys_munmap() (1) Total: 14300 Avg: 14300 Max:
14300 Min:14300
Event: func: sys_arch_prctl() (1) Total: 571 Avg: 571 Max:
571 Min:571
Event: func: sys_mprotect() (4) Total: 14382 Avg: 3595 Max:
7196 Min:2190
Event: func: SyS_read() (1) Total: 2640 Avg: 2640 Max: 2640
Min:2640
Event: func: sys_close() (5) Total: 4001 Avg: 800 Max: 1252
Min:414
Event: func: sys_newfstat() (3) Total: 11684 Avg: 3894 Max:
10206 Min:636
Event: func: SyS_open() (3) Total: 23615 Avg: 7871 Max:
10535 Min:4743
Event: func: sys_access() (1) Total: 5924 Avg: 5924 Max:
5924 Min:5924
Event: func: SyS_mmap() (8) Total: 39153 Avg: 4894 Max:
12354 Min:1518
Event: func: smp_trace_apic_timer_interrupt() (1) Total:
10298 Avg: 10298 Max: 10298 Min:10298
Event: func: SyS_brk() (4) Total: 2407 Avg: 601 Max: 1564
Min:206
Event: func: do_notify_resume() (2) Total: 4095 Avg: 2047
Max: 2521 Min:1574
Event: func: sys_execve() (5) Total: 1625251 Avg: 325050
Max: 1605698 Min:3570
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:1605698 max:1605698 min:0 avg:1605698
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
wake_up_process (0xffffffff810a4057)
cpu_stop_queue_work (0xffffffff81108df8)
stop_one_cpu (0xffffffff8110909a)
sched_exec (0xffffffff810a119b)
do_execveat_common.isra.31 (0xffffffff811de528)
do_execve (0xffffffff811dea8c)
SyS_execve (0xffffffff811ded1e)
return_to_handler (0xffffffff816c8458)
stub_execve (0xffffffff816c6929)
stub_execve (0xffffffff816c6929)
Event: func:
syscall_trace_enter_phase2() (38) Total: 21544 Avg: 566 Max:
1066 Min:329
Event: func: syscall_trace_enter_phase1() (38) Total: 9202
Avg: 242 Max: 376 Min:150
Event: func: __do_page_fault() (53) Total: 257672 Avg: 4861
Max: 27745 Min:458
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:27745 max:27745 min:0 avg:27745
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
default_wake_function (0xffffffff810a4002)
autoremove_wake_function (0xffffffff810b50fd)
__wake_up_common (0xffffffff810b4958)
__wake_up (0xffffffff810b4cb8)
rb_wake_up_waiters (0xffffffff8112f126)
irq_work_run_list (0xffffffff81157d0f)
irq_work_run (0xffffffff81157d5e)
smp_trace_irq_work_interrupt (0xffffffff810082fc)
trace_irq_work_interrupt (0xffffffff816c7aaa)
return_to_handler (0xffffffff816c8458)
trace_do_page_fault (0xffffffff810478b2)
trace_page_fault (0xffffffff816c7dd2)
Event: func:
syscall_trace_leave() (38) Total: 26145 Avg: 688 Max: 1264
Min:381
Event: func: __sb_end_write() (1) Total: 373 Avg: 373 Max:
373 Min:373
Event: func: fsnotify() (1) Total: 598 Avg: 598 Max: 598
Min:598
Event: func: __fsnotify_parent() (1) Total: 286 Avg: 286
Max: 286 Min:286
Event: func: mutex_unlock() (2) Total: 39636 Avg: 19818 Max:
39413 Min:223
Event: func: smp_trace_irq_work_interrupt() (6) Total:
236459 Avg: 39409 Max: 100671 Min:634
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (4) time:234348 max:100671 min:38745 avg:58587
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
default_wake_function (0xffffffff810a4002)
autoremove_wake_function (0xffffffff810b50fd)
__wake_up_common (0xffffffff810b4958)
__wake_up (0xffffffff810b4cb8)
rb_wake_up_waiters (0xffffffff8112f126)
irq_work_run_list (0xffffffff81157d0f)
irq_work_run (0xffffffff81157d5e)
smp_trace_irq_work_interrupt (0xffffffff810082fc)
return_to_handler (0xffffffff816c8458)
trace_irq_work_interrupt (0xffffffff816c7aaa)
|
+ ftrace_return_to_handler (0xffffffff81140840)
| 84% (3) time:197396 max:100671 min:38745 avg:65798
| return_to_handler (0xffffffff816c846d)
| trace_page_fault (0xffffffff816c7dd2)
|
+ ftrace_return_to_handler (0xffffffff81140840)
16% (1) time:36952 max:36952 min:0 avg:36952
ftrace_graph_caller (0xffffffff816c8428)
mutex_unlock (0xffffffff816c3f75)
rb_simple_write (0xffffffff81133142)
vfs_write (0xffffffff811d7727)
SyS_write (0xffffffff811d7acf)
tracesys_phase2 (0xffffffff816c65b0)
Event:
sys_enter:35 (1) Total: 1000599765 Avg: 1000599765 Max:
1000599765 Min:1000599765
Event: sys_enter:11 (1) Total: 55025 Avg: 55025 Max: 55025
Min:55025
Event: sys_enter:158 (1) Total: 1584 Avg: 1584 Max: 1584
Min:1584
Event: sys_enter:10 (4) Total: 18359 Avg: 4589 Max: 8764
Min:2933
Event: sys_enter:0 (1) Total: 4223 Avg: 4223 Max: 4223
Min:4223
Event: sys_enter:3 (5) Total: 9948 Avg: 1989 Max: 2606
Min:1203
Event: sys_enter:5 (3) Total: 15530 Avg: 5176 Max: 11840
Min:1405
Event: sys_enter:2 (3) Total: 28002 Avg: 9334 Max: 12035
Min:5656
Event: sys_enter:21 (1) Total: 7814 Avg: 7814 Max: 7814
Min:7814
Event: sys_enter:9 (8) Total: 49583 Avg: 6197 Max: 14137
Min:2362
Event: sys_enter:12 (4) Total: 108493 Avg: 27123 Max: 104079
Min:922
Event: sys_enter:59 (5) Total: 1631608 Avg: 326321 Max:
1607529 Min:4563
Event: page_fault_user:0x398d86b630 (1)
Event: page_fault_user:0x398d844de0 (1)
Event: page_fault_user:0x398d8d9020 (1)
Event: page_fault_user:0x1d37008 (1)
Event: page_fault_user:0x7f0b89e91074 (1)
Event: page_fault_user:0x7f0b89d98ed0 (1)
Event: page_fault_user:0x7f0b89ec8950 (1)
Event: page_fault_user:0x7f0b89d83644 (1)
Event: page_fault_user:0x7f0b89d622a8 (1)
Event: page_fault_user:0x7f0b89d5a560 (1)
Event: page_fault_user:0x7f0b89d34010 (1)
Event: page_fault_user:0x1d36008 (1)
Event: page_fault_user:0x398d900510 (1)
Event: page_fault_user:0x398dbb3ae8 (1)
Event: page_fault_user:0x398d87f490 (1)
Event: page_fault_user:0x398d8eb660 (1)
Event: page_fault_user:0x398d8bd730 (1)
Event: page_fault_user:0x398d9625d9 (1)
Event: page_fault_user:0x398d931810 (1)
Event: page_fault_user:0x398dbb7114 (1)
Event: page_fault_user:0x398d837610 (1)
Event: page_fault_user:0x398d89e860 (1)
Event: page_fault_user:0x398d8f23b0 (1)
Event: page_fault_user:0x398dbb4510 (1)
Event: page_fault_user:0x398dbad6f0 (1)
Event: page_fault_user:0x398dbb1018 (1)
Event: page_fault_user:0x398d977b37 (1)
Event: page_fault_user:0x398d92eb60 (1)
Event: page_fault_user:0x398d8abff0 (1)
Event: page_fault_user:0x398dbb0d30 (1)
Event: page_fault_user:0x398dbb6c24 (1)
Event: page_fault_user:0x398d821c50 (1)
Event: page_fault_user:0x398dbb6c20 (1)
Event: page_fault_user:0x398d886350 (1)
Event: page_fault_user:0x7f0b90125000 (1)
Event: page_fault_user:0x7f0b90124740 (1)
Event: page_fault_user:0x7f0b90126000 (1)
Event: page_fault_user:0x398d816230 (1)
Event: page_fault_user:0x398d8002b8 (1)
Event: page_fault_user:0x398dbb0b40 (1)
Event: page_fault_user:0x398dbb2880 (1)
Event: page_fault_user:0x7f0b90141cc6 (1)
Event: page_fault_user:0x7f0b9013b85c (1)
Event: page_fault_user:0x7f0b90127000 (1)
Event: page_fault_user:0x606e70 (1)
Event: page_fault_user:0x7f0b90144010 (1)
Event: page_fault_user:0x7fffcb31b038 (1)
Event: page_fault_user:0x606da8 (1)
Event: page_fault_user:0x400040 (1)
Event: page_fault_user:0x398d222218 (1)
Event: page_fault_user:0x398d015120 (1)
Event: page_fault_user:0x398d220ce8 (1)
Event: page_fault_user:0x398d220b80 (1)
Event: page_fault_user:0x7fffcb2fcff8 (1)
Event: page_fault_user:0x398d001590 (1)
Event: page_fault_user:0x398d838490 (1)
Event: softirq_raise:RCU (3) Total: 252931 Avg: 84310 Max:
243288 Min:4639
Event: softirq_raise:SCHED (2) Total: 241249 Avg: 120624
Max: 239076 Min:2173
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:239076 max:239076 min:0 avg:239076
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
default_wake_function (0xffffffff810a4002)
autoremove_wake_function (0xffffffff810b50fd)
__wake_up_common (0xffffffff810b4958)
__wake_up (0xffffffff810b4cb8)
rb_wake_up_waiters (0xffffffff8112f126)
irq_work_run_list (0xffffffff81157d0f)
irq_work_run (0xffffffff81157d5e)
smp_trace_irq_work_interrupt (0xffffffff810082fc)
trace_irq_work_interrupt (0xffffffff816c7aaa)
irq_exit (0xffffffff8107dd66)
smp_trace_apic_timer_interrupt (0xffffffff816c8c7a)
trace_apic_timer_interrupt (0xffffffff816c725a)
prepare_ftrace_return (0xffffffff8103d4fd)
ftrace_graph_caller (0xffffffff816c8428)
mem_cgroup_begin_page_stat (0xffffffff811cfd25)
page_remove_rmap (0xffffffff811a4fc5)
stub_execve (0xffffffff816c6929)
unmap_single_vma (0xffffffff81198b1c)
unmap_vmas (0xffffffff81199174)
exit_mmap (0xffffffff811a1f5b)
mmput (0xffffffff8107699a)
flush_old_exec (0xffffffff811ddb75)
load_elf_binary (0xffffffff812287df)
search_binary_handler (0xffffffff811dd3e0)
do_execveat_common.isra.31 (0xffffffff811de8bd)
do_execve (0xffffffff811dea8c)
SyS_execve (0xffffffff811ded1e)
return_to_handler (0xffffffff816c8458)
Event:
softirq_raise:HI (3) Total: 72472 Avg: 24157 Max: 64186
Min:3430
Event: softirq_entry:RCU (2) Total: 3191 Avg: 1595 Max: 1788
Min:1403
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:1788 max:1788 min:0 avg:1788
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
default_wake_function (0xffffffff810a4002)
autoremove_wake_function (0xffffffff810b50fd)
__wake_up_common (0xffffffff810b4958)
__wake_up (0xffffffff810b4cb8)
rb_wake_up_waiters (0xffffffff8112f126)
irq_work_run_list (0xffffffff81157d0f)
irq_work_run (0xffffffff81157d5e)
smp_trace_irq_work_interrupt (0xffffffff810082fc)
trace_irq_work_interrupt (0xffffffff816c7aaa)
irq_work_queue (0xffffffff81157e95)
ring_buffer_unlock_commit (0xffffffff8113039f)
__buffer_unlock_commit (0xffffffff811367d5)
trace_buffer_unlock_commit (0xffffffff811376a2)
ftrace_event_buffer_commit (0xffffffff81146d5f)
ftrace_raw_event_sched_process_exec (0xffffffff8109c511)
do_execveat_common.isra.31 (0xffffffff811de9a3)
do_execve (0xffffffff811dea8c)
SyS_execve (0xffffffff811ded1e)
return_to_handler (0xffffffff816c8458)
stub_execve (0xffffffff816c6929)
Event:
softirq_entry:SCHED (2) Total: 2289 Avg: 1144 Max: 1350
Min:939
Event: softirq_entry:HI (3) Total: 180146 Avg: 60048 Max:
178969 Min:499
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:178969 max:178969 min:0 avg:178969
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
wake_up_process (0xffffffff810a4057)
wake_up_worker (0xffffffff8108de74)
insert_work (0xffffffff8108fca6)
__queue_work (0xffffffff8108fe12)
delayed_work_timer_fn (0xffffffff81090088)
call_timer_fn (0xffffffff810d8f89)
run_timer_softirq (0xffffffff810da8a1)
__do_softirq (0xffffffff8107d8fa)
irq_exit (0xffffffff8107dd66)
smp_trace_apic_timer_interrupt (0xffffffff816c8c7a)
trace_apic_timer_interrupt (0xffffffff816c725a)
prepare_ftrace_return (0xffffffff8103d4fd)
ftrace_graph_caller (0xffffffff816c8428)
mem_cgroup_begin_page_stat (0xffffffff811cfd25)
page_remove_rmap (0xffffffff811a4fc5)
stub_execve (0xffffffff816c6929)
unmap_single_vma (0xffffffff81198b1c)
unmap_vmas (0xffffffff81199174)
exit_mmap (0xffffffff811a1f5b)
mmput (0xffffffff8107699a)
flush_old_exec (0xffffffff811ddb75)
load_elf_binary (0xffffffff812287df)
search_binary_handler (0xffffffff811dd3e0)
do_execveat_common.isra.31 (0xffffffff811de8bd)
do_execve (0xffffffff811dea8c)
SyS_execve (0xffffffff811ded1e)
return_to_handler (0xffffffff816c8458)
---
The above uses -F to follow the sleep task. It filters only on events that pertain to sleep. Note, in order to follow forks, you need to also include the -c flag.
Other tasks will appear in the profile as well if events reference more than one task (like sched_switch and sched_wakeup do. The "prev_pid" and "next_pid" of sched_switch, and the "common_pid" and "pid" of sched_wakeup).
Stack traces are attached to events that are related to them.
Taking a look at the above output:
Event: sched_switch:R (2) Total: 234559 Avg: 117279 Max: 129886 Min:104673
This shows that task was preempted (it’s in the running R state). It was preempted twice (2) for a total of 234,559 nanoseconds, with a average preempt time of 117,279 ns, and maximum of 128,886 ns and minimum of 104,673 ns.
The tree shows where it was preempted:
|
+ ftrace_raw_event_sched_switch (0xffffffff8109f310)
100% (2) time:234559 max:129886 min:104673 avg:117279
__schedule (0xffffffff816c1e81)
preempt_schedule (0xffffffff816c236e)
___preempt_schedule (0xffffffff81351a59)
|
+ unmap_single_vma (0xffffffff81198c05)
| 55% (1) time:129886 max:129886 min:0 avg:129886
| stop_one_cpu (0xffffffff8110909a)
| sched_exec (0xffffffff810a119b)
| do_execveat_common.isra.31 (0xffffffff811de528)
| do_execve (0xffffffff811dea8c)
| SyS_execve (0xffffffff811ded1e)
| return_to_handler (0xffffffff816c8458)
| stub_execve (0xffffffff816c6929)
|
+ unmap_single_vma (0xffffffff81198c05)
45% (1) time:104673 max:104673 min:0 avg:104673
unmap_vmas (0xffffffff81199174)
exit_mmap (0xffffffff811a1f5b)
mmput (0xffffffff8107699a)
flush_old_exec (0xffffffff811ddb75)
load_elf_binary (0xffffffff812287df)
search_binary_handler (0xffffffff811dd3e0)
do_execveat_common.isra.31 (0xffffffff811de8bd)
do_execve (0xffffffff811dea8c)
SyS_execve (0xffffffff811ded1e)
return_to_handler (0xffffffff816c8458)
stub_execve (0xffffffff816c6929)
Event: sched_switch:S (1) Total: 1000513242 Avg: 1000513242 Max: 1000513242 Min:10005132
This shows that the task was scheduled out in the INTERRUPTIBLE state once for a total of 1,000,513,242 ns (~1s), which makes sense as the task was a "sleep 1".
After the schedule events, the function events are shown. By default the profiler will use the function graph tracer if the depth setting is supported by the kernel. It will set the depth to one which will only trace the first function that enters the kernel. It will also record the amount of time it was in the kernel.
Event: func:
sys_nanosleep() (1) Total: 1000598016 Avg: 1000598016 Max:
1000598016 Min:1000598016
Event: func: sys_munmap() (1) Total: 14300 Avg: 14300 Max:
14300 Min:14300
Event: func: sys_arch_prctl() (1) Total: 571 Avg: 571 Max:
571 Min:571
Event: func: sys_mprotect() (4) Total: 14382 Avg: 3595 Max:
7196 Min:2190
Event: func: SyS_read() (1) Total: 2640 Avg: 2640 Max: 2640
Min:2640
Event: func: sys_close() (5) Total: 4001 Avg: 800 Max: 1252
Min:414
Event: func: sys_newfstat() (3) Total: 11684 Avg: 3894 Max:
10206 Min:636
Event: func: SyS_open() (3) Total: 23615 Avg: 7871 Max:
10535 Min:4743
Event: func: sys_access() (1) Total: 5924 Avg: 5924 Max:
5924 Min:5924
Event: func: SyS_mmap() (8) Total: 39153 Avg: 4894 Max:
12354 Min:1518
Event: func: smp_trace_apic_timer_interrupt() (1) Total:
10298 Avg: 10298 Max: 10298 Min:10298
Event: func: SyS_brk() (4) Total: 2407 Avg: 601 Max: 1564
Min:206
Event: func: do_notify_resume() (2) Total: 4095 Avg: 2047
Max: 2521 Min:1574
Event: func: sys_execve() (5) Total: 1625251 Avg: 325050
Max: 1605698 Min:3570
Count of times the event was hit is always in parenthesis (5).
The function graph trace may produce too much overhead as it is still triggering (just not tracing) on all functions. To limit functions just to system calls (not interrupts), add the following option:
-l 'sys_*' -l 'SyS_*'
To disable function graph tracing totally, use:
-p nop
To use function tracing instead (note, this will not record timings, but just the count of times a function is hit):
-p function
Following the functions are the events that are recorded.
Event:
sys_enter:35 (1) Total: 1000599765 Avg: 1000599765 Max:
1000599765 Min:1000599765
Event: sys_enter:11 (1) Total: 55025 Avg: 55025 Max: 55025
Min:55025
Event: sys_enter:158 (1) Total: 1584 Avg: 1584 Max: 1584
Min:1584
Event: sys_enter:10 (4) Total: 18359 Avg: 4589 Max: 8764
Min:2933
Event: sys_enter:0 (1) Total: 4223 Avg: 4223 Max: 4223
Min:4223
Event: sys_enter:3 (5) Total: 9948 Avg: 1989 Max: 2606
Min:1203
Event: sys_enter:5 (3) Total: 15530 Avg: 5176 Max: 11840
Min:1405
Event: sys_enter:2 (3) Total: 28002 Avg: 9334 Max: 12035
Min:5656
Event: sys_enter:21 (1) Total: 7814 Avg: 7814 Max: 7814
Min:7814
Event: sys_enter:9 (8) Total: 49583 Avg: 6197 Max: 14137
Min:2362
Event: sys_enter:12 (4) Total: 108493 Avg: 27123 Max: 104079
Min:922
Event: sys_enter:59 (5) Total: 1631608 Avg: 326321 Max:
1607529 Min:4563
These are the raw system call events, with the raw system call ID after the "sys_enter:" For example, "59" is execve(2). Why did it execute 5 times? Looking at a strace of this run, we can see:
execve("/usr/lib64/ccache/sleep",
["sleep", "1"], [/* 27 vars */]
<unfinished ...>
<... execve resumed> ) = -1 ENOENT (No such file or
directory)
execve("/usr/local/sbin/sleep",
["sleep", "1"], [/* 27 vars */]
<unfinished ...>
<... execve resumed> ) = -1 ENOENT (No such file or
directory)
execve("/usr/local/bin/sleep", ["sleep",
"1"], [/* 27 vars */] <unfinished ...>
<... execve resumed> ) = -1 ENOENT (No such file or
directory)
execve("/usr/sbin/sleep", ["sleep",
"1"], [/* 27 vars */] <unfinished ...>
<... execve resumed> ) = -1 ENOENT (No such file or
directory)
execve("/usr/bin/sleep", ["sleep",
"1"], [/* 27 vars */] <unfinished ...>
<... execve resumed> ) = 0
It attempted to execve the "sleep" command for each path in $PATH until it found one.
The page_fault_user events show what userspace address took a page fault.
Event:
softirq_raise:RCU (3) Total: 252931 Avg: 84310 Max: 243288
Min:4639
Event: softirq_raise:SCHED (2) Total: 241249 Avg: 120624
Max: 239076 Min:2173
|
+ ftrace_raw_event_sched_wakeup_template
(0xffffffff8109d960)
100% (1) time:239076 max:239076 min:0 avg:239076
ttwu_do_wakeup (0xffffffff810a01a2)
ttwu_do_activate.constprop.122 (0xffffffff810a0236)
try_to_wake_up (0xffffffff810a3ec3)
default_wake_function (0xffffffff810a4002)
autoremove_wake_function (0xffffffff810b50fd)
__wake_up_common (0xffffffff810b4958)
__wake_up (0xffffffff810b4cb8)
rb_wake_up_waiters (0xffffffff8112f126)
irq_work_run_list (0xffffffff81157d0f)
irq_work_run (0xffffffff81157d5e)
smp_trace_irq_work_interrupt (0xffffffff810082fc)
trace_irq_work_interrupt (0xffffffff816c7aaa)
irq_exit (0xffffffff8107dd66)
The timings for the softirq_raise events measure the time it took from the raised softirq to the time it executed.
The timings for the softirq_entry events measure the time the softirq took to execute.
The stack traces for the softirqs (and possibly other events) are used when an event has a stack attached to it. This can happen if the profile ran more stacks than just the sched events, or when events are dropped and stacks
To have full control of what gets traced, use the -S option that will have trace-cmd not enable any events or the function_graph tracer. Only the events listed on the command line are shown.
If only the time of kmalloc is needed to be seen, and where it was recorded, using the -S option and enabling function_graph and stack tracing for just the function needed will give the profile of only that function.
---
# trace-cmd profile -S -p function_graph -l '*kmalloc*' -l
'*kmalloc*:stacktrace' sleep 1
task: sshd-11786
Event: func: __kmalloc_reserve.isra.59() (2) Total: 149684
Avg: 74842 Max: 75598 Min:74086
|
+ __alloc_skb (0xffffffff815a8917)
| 67% (2) time:149684 max:75598 min:74086 avg:74842
| __kmalloc_node_track_caller (0xffffffff811c6635)
| __kmalloc_reserve.isra.59 (0xffffffff815a84ac)
| return_to_handler (0xffffffff816c8458)
| sk_stream_alloc_skb (0xffffffff81604ea1)
| tcp_sendmsg (0xffffffff8160592c)
| inet_sendmsg (0xffffffff8162fed1)
| sock_aio_write (0xffffffff8159f9fc)
| do_sync_write (0xffffffff811d694a)
| vfs_write (0xffffffff811d7825)
| SyS_write (0xffffffff811d7adf)
| system_call_fastpath (0xffffffff816c63d2)
|
+ __alloc_skb (0xffffffff815a8917)
33% (1) time:74086 max:74086 min:74086 avg:74086
__alloc_skb (0xffffffff815a8917)
sk_stream_alloc_skb (0xffffffff81604ea1)
tcp_sendmsg (0xffffffff8160592c)
inet_sendmsg (0xffffffff8162fed1)
sock_aio_write (0xffffffff8159f9fc)
do_sync_write (0xffffffff811d694a)
vfs_write (0xffffffff811d7825)
SyS_write (0xffffffff811d7adf)
system_call_fastpath (0xffffffff816c63d2)
[..]
---
To watch the command run but save the output of the profile to a file use --stderr, and redirect stderr to a file
# trace-cmd profile --stderr cyclictest -p 80 -n -t1 2> profile.out
Or simple use -o
# trace-cmd profile -o profile.out cyclictest -p 80 -n -t1
SEE ALSO
trace-cmd(1), trace-cmd-record(1), trace-cmd-report(1), trace-cmd-start(1), trace-cmd-stop(1), trace-cmd-reset(1), trace-cmd-split(1), trace-cmd-list(1), trace-cmd-listen(1)
AUTHOR
Written by Steven Rostedt, <rostedt [AT] goodmis.org [1] >
RESOURCES
https://git.kernel.org/pub/scm/utils/trace-cmd/trace-cmd.git/
COPYING
Copyright (C) 2014 Red Hat, Inc. Free use of this software is granted under the terms of the GNU Public License (GPL).
NOTES
1. |
mailto:rostedt [AT] goodmis.org