| e1fa8594 | 29-Aug-2025 |
Potin Lai <potin.lai@quantatw.com> |
Introduce ignoreFailIfHostOff config setting
Some sensors only provide valid readings when the host is powered on. This change introduces the `ignoreFailIfHostOff` configuration option to differenti
Introduce ignoreFailIfHostOff config setting
Some sensors only provide valid readings when the host is powered on. This change introduces the `ignoreFailIfHostOff` configuration option to differentiate between unavailable readings based on host state.
- Host OFF: Sensor unavailable is acceptable and will not be treated as a failure. - Host ON (running): Sensor unavailable is unexpected, and the failsafe mode will be triggered.
This ensures that sensors dependent on host power state are handled correctly without causing unnecessary failsafe triggers during host-off.
Tested on Catalina: failsafe mode is not triggered when the host is off. - config.json example: add `ignoreFailIfHostOff` for sensors only available when the host is on. ``` { "sensors": [ ...... { "name": "HDDBOARD_SSD0_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/HDDBOARD_SSD0_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "HDDBOARD_SSD2_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/HDDBOARD_SSD2_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "PDB_P48V_HSC1_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/PDB_P48V_HSC1_TEMP_C", "timeout": 0, "ignoreDbusMinMax": true }, { "name": "PDB_VR_P12V_N1_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/PDB_VR_P12V_N1_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "PDB_VR_P12V_N2_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/PDB_VR_P12V_N2_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "IOB0_NIC0_OSFP_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/IOB0_NIC0_OSFP_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "IOB0_NIC1_OSFP_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/IOB0_NIC1_OSFP_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "IOB1_NIC0_OSFP_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/IOB1_NIC0_OSFP_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, { "name": "IOB1_NIC1_OSFP_TEMP_C", "type": "temp", "readPath": "/xyz/openbmc_project/sensors/temperature/IOB1_NIC1_OSFP_TEMP_C", "timeout": 0, "ignoreFailIfHostOff": true, "ignoreDbusMinMax": true }, ...... ],
"zones": [ ...... ] } ```
Change-Id: I5355c453ca3c6d918c197dcd2cc9119e471d615d Signed-off-by: Potin Lai <potin.lai@quantatw.com>
show more ...
|
| 8f73ad76 | 06-Oct-2021 |
Alex.Song <zheng.song@intel.com> |
Make specific UNA sensors not trigger failsafe
By convention, sensors at some states like 'not present', 'power state not matching' are marked as 'unavailable' on dbus. At such states, some specific
Make specific UNA sensors not trigger failsafe
By convention, sensors at some states like 'not present', 'power state not matching' are marked as 'unavailable' on dbus. At such states, some specific sensors should not be considered as failed and trigger pid 'failsafe'.
A typical example is when a system is powered-off, its CPU/DIMM temp sensors are 'unavailable', these sensors should not be treated as 'failed' and trigger pid 'failsafe'. This is necessary for systems whose Fans will keep working when the CPU is off.
This feature is configurable per sensor (valid on thermal sensors). It can be enabled by setting the Pid controller option "InputUnavailableAsFailed" to 'false' when one configuring the PID module via entity-manager, or by setting the sensor option "unavailableAsFailed" to 'false' when one configuring the PID module via JSON. (These options are optional and default to 'true')
Tested: 1. On a Fan 'always-on' system, enabale this feature on CPU temp sensors, poweroff the system, 'unavailable' CPU temp sensors do not trigger the failsafe mode. 2. 'Unavailable' Fans still trigger the failsafe mode. 3. 'Unfunctional' or 'failed' sensors still trigger the failsafe mode.
Signed-off-by: Zheng Song <zheng.song@intel.com> Change-Id: I1dd1d76466f43e7dcf51c161c96714f1bcfae88d
show more ...
|