1======================================
2HNS3 Performance Monitoring Unit (PMU)
3======================================
4
5HNS3(HiSilicon network system 3) Performance Monitoring Unit (PMU) is an
6End Point device to collect performance statistics of HiSilicon SoC NIC.
7On Hip09, each SICL(Super I/O cluster) has one PMU device.
8
9HNS3 PMU supports collection of performance statistics such as bandwidth,
10latency, packet rate and interrupt rate.
11
12Each HNS3 PMU supports 8 hardware events.
13
14HNS3 PMU driver
15===============
16
17The HNS3 PMU driver registers a perf PMU with the name of its sicl id.::
18
19  /sys/devices/hns3_pmu_sicl_<sicl_id>
20
21PMU driver provides description of available events, filter modes, format,
22identifier and cpumask in sysfs.
23
24The "events" directory describes the event code of all supported events
25shown in perf list.
26
27The "filtermode" directory describes the supported filter modes of each
28event.
29
30The "format" directory describes all formats of the config (events) and
31config1 (filter options) fields of the perf_event_attr structure.
32
33The "identifier" file shows version of PMU hardware device.
34
35The "bdf_min" and "bdf_max" files show the supported bdf range of each
36pmu device.
37
38The "hw_clk_freq" file shows the hardware clock frequency of each pmu
39device.
40
41Example usage of checking event code and subevent code::
42
43  $# cat /sys/devices/hns3_pmu_sicl_0/events/dly_tx_normal_to_mac_time
44  config=0x00204
45  $# cat /sys/devices/hns3_pmu_sicl_0/events/dly_tx_normal_to_mac_packet_num
46  config=0x10204
47
48Each performance statistic has a pair of events to get two values to
49calculate real performance data in userspace.
50
51The bits 0~15 of config (here 0x0204) are the true hardware event code. If
52two events have same value of bits 0~15 of config, that means they are
53event pair. And the bit 16 of config indicates getting counter 0 or
54counter 1 of hardware event.
55
56After getting two values of event pair in usersapce, the formula of
57computation to calculate real performance data is:::
58
59  counter 0 / counter 1
60
61Example usage of checking supported filter mode::
62
63  $# cat /sys/devices/hns3_pmu_sicl_0/filtermode/bw_ssu_rpu_byte_num
64  filter mode supported: global/port/port-tc/func/func-queue/
65
66Example usage of perf::
67
68  $# perf list
69  hns3_pmu_sicl_0/bw_ssu_rpu_byte_num/ [kernel PMU event]
70  hns3_pmu_sicl_0/bw_ssu_rpu_time/     [kernel PMU event]
71  ------------------------------------------
72
73  $# perf stat -g -e hns3_pmu_sicl_0/bw_ssu_rpu_byte_num,global=1/ -e hns3_pmu_sicl_0/bw_ssu_rpu_time,global=1/ -I 1000
74  or
75  $# perf stat -g -e hns3_pmu_sicl_0/config=0x00002,global=1/ -e hns3_pmu_sicl_0/config=0x10002,global=1/ -I 1000
76
77
78Filter modes
79--------------
80
811. global mode
82PMU collect performance statistics for all HNS3 PCIe functions of IO DIE.
83Set the "global" filter option to 1 will enable this mode.
84Example usage of perf::
85
86  $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,global=1/ -I 1000
87
882. port mode
89PMU collect performance statistic of one whole physical port. The port id
90is same as mac id. The "tc" filter option must be set to 0xF in this mode,
91here tc stands for traffic class.
92
93Example usage of perf::
94
95  $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,port=0,tc=0xF/ -I 1000
96
973. port-tc mode
98PMU collect performance statistic of one tc of physical port. The port id
99is same as mac id. The "tc" filter option must be set to 0 ~ 7 in this
100mode.
101Example usage of perf::
102
103  $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,port=0,tc=0/ -I 1000
104
1054. func mode
106PMU collect performance statistic of one PF/VF. The function id is BDF of
107PF/VF, its conversion formula::
108
109  func = (bus << 8) + (device << 3) + (function)
110
111for example:
112  BDF         func
113  35:00.0    0x3500
114  35:00.1    0x3501
115  35:01.0    0x3508
116
117In this mode, the "queue" filter option must be set to 0xFFFF.
118Example usage of perf::
119
120  $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,bdf=0x3500,queue=0xFFFF/ -I 1000
121
1225. func-queue mode
123PMU collect performance statistic of one queue of PF/VF. The function id
124is BDF of PF/VF, the "queue" filter option must be set to the exact queue
125id of function.
126Example usage of perf::
127
128  $# perf stat -a -e hns3_pmu_sicl_0/config=0x1020F,bdf=0x3500,queue=0/ -I 1000
129
1306. func-intr mode
131PMU collect performance statistic of one interrupt of PF/VF. The function
132id is BDF of PF/VF, the "intr" filter option must be set to the exact
133interrupt id of function.
134Example usage of perf::
135
136  $# perf stat -a -e hns3_pmu_sicl_0/config=0x00301,bdf=0x3500,intr=0/ -I 1000
137