#
7b92372d |
| 19-Mar-2025 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Still create callout without full loc code
When PELs are created early during a power on reset, the service that expands location codes may not be online yet. Previously, callouts just wouldn'
PEL: Still create callout without full loc code
When PELs are created early during a power on reset, the service that expands location codes may not be online yet. Previously, callouts just wouldn't be created if a location code couldn't be expanded.
Fix that so that a callout is still created with the unexpanded location code, as that is still better than nothing. In those cases, the CC/PN/FN fields would also be empty since those are obtained by finding the FRU inventory path from the location code.
Also create a similar callout in the case where the call to get the inventory path from a location code fails.
This applies for callouts created via: - The PEL message registry - The device path lookup tables - JSON FFDC files
Tested: New unit tests that cover these paths pass.
On a system, after stopping vpd-manager.service, will get callouts like: ``` "Callout Section": { "Callout Count": "4", "Callouts": [{ "FRU Type": "Normal Hardware FRU", "Priority": "Mandatory, replace all with this type as a unit", "Location Code": "P0-C5", "Part Number": "", "CCIN": "", "Serial Number": "" }, { "FRU Type": "Normal Hardware FRU", "Priority": "Mandatory, replace all with this type as a unit", "Location Code": "E0", "Part Number": "", "CCIN": "", "Serial Number": "" }, { "FRU Type": "Normal Hardware FRU", "Priority": "Lowest priority replacement", "Location Code": "P0", "Part Number": "", "CCIN": "", "Serial Number": "" }, { "FRU Type": "Normal Hardware FRU", "Priority": "Lowest priority replacement", "Location Code": "P0-T4", "Part Number": "", "CCIN": "", "Serial Number": "" }] } ```
Change-Id: Ia7e94f9d4848f78ba0b00023438c4db4135fed75 Signed-off-by: Matt Spinler <spinler@us.ibm.com>
show more ...
|
#
e5940634 |
| 22-Nov-2024 |
Patrick Williams <patrick@stwcx.xyz> |
extension: use map in create extension
Adjust the Create extension to use a map for metadata instead of vector.
Tested: Unit tests updated and passing.
Signed-off-by: Patrick Williams <patrick@stw
extension: use map in create extension
Adjust the Create extension to use a map for metadata instead of vector.
Tested: Unit tests updated and passing.
Signed-off-by: Patrick Williams <patrick@stwcx.xyz> Change-Id: I9bd62b8dcc8b18a61958ed64de98b52a48333637
show more ...
|
#
083c7049 |
| 14-Oct-2024 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Remove dump status bits from PELs
Stop filling in the bits in the PEL that say there are un-offloaded dumps. These require calls to the dump daemon that can be slow and even time out if the du
PEL: Remove dump status bits from PELs
Stop filling in the bits in the PEL that say there are un-offloaded dumps. These require calls to the dump daemon that can be slow and even time out if the dump daemon is busy.
These aren't parsed out in the peltool output anyway, and there are other ways to determine if there are dumps - someone could just look at the dump D-Bus directly.
This isn't a direct revert of the commit that introduced it because of all the merge conflicts trying to do a revert entailed.
Tested: Can still create PELs.
Change-Id: I975f06ebf3638b39315fdea49393d1941a6f5216 Signed-off-by: Matt Spinler <spinler@us.ibm.com>
show more ...
|
#
075c7923 |
| 16-Aug-2024 |
Patrick Williams <patrick@stwcx.xyz> |
clang-format: re-format for clang-18
clang-format-18 isn't compatible with the clang-format-17 output, so we need to reformat the code with the latest version. The way clang-18 handles lambda forma
clang-format: re-format for clang-18
clang-format-18 isn't compatible with the clang-format-17 output, so we need to reformat the code with the latest version. The way clang-18 handles lambda formatting also changed, so we have made changes to the organization default style format to better handle lambda formatting.
See I5e08687e696dd240402a2780158664b7113def0e for updated style. See Iea0776aaa7edd483fa395e23de25ebf5a6288f71 for clang-18 enablement.
Change-Id: I21d2ca8065f24fd73509229c517f5caf48934b60 Signed-off-by: Patrick Williams <patrick@stwcx.xyz>
show more ...
|
#
2edce4e2 |
| 17-Jan-2024 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Use raw procedure names in PEL msg reg
Start using the raw maintenance procedure names in the PEL message registry, such as BMC0001, instead of the enum values, such as 'bmc_firmware'. This wa
PEL: Use raw procedure names in PEL msg reg
Start using the raw maintenance procedure names in the PEL message registry, such as BMC0001, instead of the enum values, such as 'bmc_firmware'. This way, the script that generates documentation can list the procedure in the section for that PEL. This was requested by the support team.
Note that there is still one user of the enums - the checkstop analysis code uses them when it creates callouts by passing in them via JSON user data files.
There are pointers added to the README and schema to find the available procedures and their descriptions.
Tested: - The unit tests still work which verify the callout contents when PEL callouts are created via the message registry. - Also manually verified PEL procedure callouts look correct when created from the message registry.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I3e8416956120d4dae1ff40592fd0df4f0285d072
show more ...
|
#
da5b76b2 |
| 01-Jun-2023 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Support for CheckstopFlag msg reg field
Similiar to the DeconfigFlag field that was recently added, this one indicates the PEL is for a hardware checkstop and results in a bit in SRC hex word 5
PEL: Support for CheckstopFlag msg reg field
Similiar to the DeconfigFlag field that was recently added, this one indicates the PEL is for a hardware checkstop and results in a bit in SRC hex word 5 being set.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: Ib05de7471ad3e32f48e7f20a5c611abc119fe82a
show more ...
|
#
784b02e7 |
| 25-Apr-2023 |
Matt Spinler <spinler@us.ibm.com> |
PEL:test: Refactor mocked checkDumpStatus usage
When creating an instance of the SRC class in a testcase, it needs a filled in mock of the checkDumpStatus() function. Instead of manually doing that
PEL:test: Refactor mocked checkDumpStatus usage
When creating an instance of the SRC class in a testcase, it needs a filled in mock of the checkDumpStatus() function. Instead of manually doing that everywhere a PEL or SRC class is created, just do it in the constructor of the mock DataInterface class.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I74790c67251465aae87d318ea37891d9eabab5e5
show more ...
|
#
3fe93e96 |
| 14-Apr-2023 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Support new DeconfigFlag msg reg field
Add a new field to the PEL message registry to indicate that some hardware associated with a callout was deconfigured.
"DeconfigFlag": true
This results
PEL: Support new DeconfigFlag msg reg field
Add a new field to the PEL message registry to indicate that some hardware associated with a callout was deconfigured.
"DeconfigFlag": true
This results in bit 6 in the SRC's hex word 5 being set. It is looked at by the code that creates the periodic PELs when there is a system running with deconfigured hardware.
Note: For something to be considered deconfigured, it doesn't necessarily mean that firmware chose to not configure it. It could also mean that it is broken or missing, at least in the case of power supplies or fans.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I3e657cb52416fc50000e299bf2d583db7b67caf8
show more ...
|
#
bad056be |
| 25-Jan-2023 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Handle multiple inv paths per loc code
DataInterface::getInventoryFromLocCode() was only returning a single inventory path from GetFRUsByExpandedLocationCode() even though multiple paths may ha
PEL: Handle multiple inv paths per loc code
DataInterface::getInventoryFromLocCode() was only returning a single inventory path from GetFRUsByExpandedLocationCode() even though multiple paths may have been returned.
Mostly that was fine, except when a processor on a DCM was called out. That would lead to only one processor on the DCM being set to not functional by service_indicators.cpp, so on the web UI the actual CPU called out may not have been marked as unhealthy (health status critical in Redfish).
This commit changes getInventoryFromLocCode() to return all the paths that GetFRUsByExpandedLocationCode() returns, and then makes the corresponding changes in service_indicators.cpp to be able to handle multiple inventory paths per location code when setting them to not functional and creating a critical association.
The other code that was calling this function can just use the first path returned, since in those cases it's just needed to get the VPD information for the PEL, and all the paths would return the same info anyway since they had the same location code.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: Ia16f50881e4a4f84c171ae20b7a99eddcc98ad4f
show more ...
|
#
875b6c7b |
| 20-Oct-2021 |
Vijay Lobo <vijaylobo@gmail.com> |
PEL: Add boot progress code to SRC hex data
Add the first 8 characters from the ASCII string field of the current progress SRC, taken from the xyz.openbmc_project.State.Boot.Raw D-Bus interface, to
PEL: Add boot progress code to SRC hex data
Add the first 8 characters from the ASCII string field of the current progress SRC, taken from the xyz.openbmc_project.State.Boot.Raw D-Bus interface, to SRC hex word 4 when creating a PEL.
This is how the field is defined in the PEL spec, and is to help with debug so that one can know which part of the boot was occurring when the PEL was created. Note that at this point most progress codes are sent down from one of the host firmware subsystems and not created by the BMC.
The field is only inserted into the SRC if those characters are present and represent a valid 4 byte number, such as "C7004000". This is then represented as 0xC7004000 in the SRC word. Otherwise, the word is left at a value of zero.
For example: ... "Valid Word Count": "0x09", "Reference Code": "BD8D1001", "Hex Word 2": "00080455", "Hex Word 3": "2E2D0010", "Hex Word 4": "C7004000", <---Progress code "Hex Word 5": "00000000", "Hex Word 6": "00000005", "Hex Word 7": "00000000", "Hex Word 8": "00000000", "Hex Word 9": "00000000" ...
Signed-off-by: Vijay Lobo <vijaylobo@gmail.com> Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: Iba41e88626c0e081e5759b994e3630ef8b12daf4
show more ...
|
#
3c7ec6d8 |
| 06-May-2022 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Allow pretty symFRU/proc name in callout JSON
When callouts are passed in via JSON FFDC files, allow maintenance procedures and symbolic FRUs to be the pretty names used in the message registry
PEL: Allow pretty symFRU/proc name in callout JSON
When callouts are passed in via JSON FFDC files, allow maintenance procedures and symbolic FRUs to be the pretty names used in the message registry as well as the raw 7 character names.
Do this by attempting to look up the name as a key in the pretty name -> raw name maps, and if found then use the raw name.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I576e56932dd101e32caf495c65de6fed65de43e7
show more ...
|
#
a2d7b775 |
| 07-Mar-2022 |
Mike Capps <mikepcapps@gmail.com> |
PEL: Remove PowerFault from message entries, registries, and code
The PowerFault flag is deprecated and should be removed in both the registry and code references. Its functionality is superseded by
PEL: Remove PowerFault from message entries, registries, and code
The PowerFault flag is deprecated and should be removed in both the registry and code references. Its functionality is superseded by the critical_system severity level.
Signed-off-by: Mike Capps <mikepcapps@gmail.com> Change-Id: I1bc1898d9589e0678b5a37b87289c6588859f2ee
show more ...
|
#
50bfa69a |
| 06-Jan-2022 |
Sumit Kumar <sumit_kumar@in.ibm.com> |
PEL: Pass in subsystem to use with additional data
There is a requirement to pass in the subsystem to use with the additional data field. In the process update the user header code and SRC with the
PEL: Pass in subsystem to use with additional data
There is a requirement to pass in the subsystem to use with the additional data field. In the process update the user header code and SRC with the passed-in subsystem value instead from the message registry.
Signed-off-by: Sumit Kumar <sumit_kumar@in.ibm.com> Change-Id: I08c8c3bab100fec581df9043f4bb27b28cb556e1
show more ...
|
#
39936e34 |
| 13-Nov-2021 |
Zane Shelley <zshelle@us.ibm.com> |
PEL: pad hex words displayed in error message
In the message registry under "Documentation" and "Message", users can include the hex words as part of the error message. This change pads the displaye
PEL: pad hex words displayed in error message
In the message registry under "Documentation" and "Message", users can include the hex words as part of the error message. This change pads the displayed hex words so that the full eight digit hex number is displayed.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I32c4204a1643b585dc1309f67976f4abb44fb04c
show more ...
|
#
9d43a727 |
| 24-Aug-2021 |
Sumit Kumar <sumit_kumar@in.ibm.com> |
PEL: Update bmc & platform dump status in SRC section
With every PEL creation, the status of bmc and platform dumps - Hardware and Hypervisor would be checked and status bits in SRC section updated
PEL: Update bmc & platform dump status in SRC section
With every PEL creation, the status of bmc and platform dumps - Hardware and Hypervisor would be checked and status bits in SRC section updated accordingly.
Change-Id: I3ec7626611cf330f2ce235a97ee3046c0d32b6ab Signed-off-by: Sumit Kumar <sumit_kumar@in.ibm.com>
show more ...
|
#
ea2873dd |
| 18-Aug-2021 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Change BMCSP01 maint procedure to BMC0001
This name change allows 4 characters of uniqueness instead of just 2.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I12cb883619a3a3c2f84
PEL: Change BMCSP01 maint procedure to BMC0001
This name change allows 4 characters of uniqueness instead of just 2.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I12cb883619a3a3c2f843dd7ccc3db6524dea3459
show more ...
|
#
479b6927 |
| 17-Aug-2021 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Remove 'no_vpd_for_fru' maintenance procedure
This procedure was being used in a callout when the location code could not be found on an inventory item. The code was changed to just not add a
PEL: Remove 'no_vpd_for_fru' maintenance procedure
This procedure was being used in a callout when the location code could not be found on an inventory item. The code was changed to just not add a callout in this case.
This was done for the following reasons: 1) There's no expected reason that the inventory would be missing a location code for a valid inventory path. 2) There wasn't a way to give a hint to the end user about what the callout should be in that case. Usually maintenance procedures should have steps one can take to still do the appropriate replacement. 3) Because of 1), this case indicates a bad inventory path passed in by the caller. Theoretically there is a way to have a 'bmc_code' callout, but that wouldn't be appropriate to add in this log since the intent of this one is for another problem. 3a) There is no way to create a new error log from inside a PEL section, and I don't think this one case warrants coming up with a way to do so. 4) The inventory path in question is still being added into a UserData section so that development can debug it.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I0757080f04942807cc34029d8667387db6b959fe
show more ...
|
#
0d92b528 |
| 16-Jun-2021 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Handle location codes for connectors
There are cases where PELs need to callout physical system connectors, such as a USB or other cable connector. This would be done by specifying the connect
PEL: Handle location codes for connectors
There are cases where PELs need to callout physical system connectors, such as a USB or other cable connector. This would be done by specifying the connector's location code in either the message registry JSON or device callout JSON. The problem is that the connector may not be modeled by the inventory, and even if it was, there would need to be a way to get the serial number, etc for the FRU that connector is on to use in the callout
Instead of forcing all connectors to be modeled in the inventory along with a copy of the SN/FN/CC from the parent FRU just to do a callout, this commit will just pop off the connector segment of the location code, a '-Tx', before trying to expand it or getting its related inventory item. That way, it will be operating on the parent FRU which would be in the inventory with its SN/FN/CC. In the case of the location code expansion, the connector segment will then just be re-appended afterwards.
This commit also makes a change to the device path callouts code to expand the location code of the callout from the device path JSON instead of trying to look it up later, which would have lost the connector portion.
Note that when a cable itself wants to be called out, there usually would need be a symbolic FRU specified saying which cable it is, along with the connector location codes. In that case, the SN/FN/CC of the parent FRU would not show up in the PEL.
Change-Id: I8e0eb58dfe630858f75e64b11ac13432bff4d2d1
show more ...
|
#
b41fa54b |
| 29-May-2021 |
William A. Kennington III <wak@google.com> |
test: openpower-pels: Fix unscoped use of DoAll
This is in the global scope on old releases of googletest but is fixed and inaccessible when tested against newer ones.
Change-Id: I139f2455bb804e1c7
test: openpower-pels: Fix unscoped use of DoAll
This is in the global scope on old releases of googletest but is fixed and inaccessible when tested against newer ones.
Change-Id: I139f2455bb804e1c711c5eec3159164d9ac138fa Signed-off-by: William A. Kennington III <wak@google.com>
show more ...
|
#
f3702bb3 |
| 09-Apr-2021 |
Vijay Lobo <vijaylobo@gmail.com> |
PEL: Support creator supplied power fault flag
User can supply POWER_THERMAL_CRITICAL_FAULT=TRUE as part of AdditionalData entry in the event log to set the power fault bit in PEL
Tested: I ran uni
PEL: Support creator supplied power fault flag
User can supply POWER_THERMAL_CRITICAL_FAULT=TRUE as part of AdditionalData entry in the event log to set the power fault bit in PEL
Tested: I ran unit test using docker. Also tested manually by setting D-bus event log
Change-Id: Ifa03f091a7d1efcc67f181c2be1cc0a6240891e1 Signed-off-by: Vijay Lobo <vijaylobo@gmail.com>
show more ...
|
#
1a1b0dfb |
| 23-Nov-2020 |
Harisuddin Mohamed Isa <harisuddin@gmail.com> |
peltool: Print the Words6To9 'Description' field
Include the 'Description' field from the Words6To9 registry entry when printing SRC error details.
Changed from this:
"Error Details": { "Messa
peltool: Print the Words6To9 'Description' field
Include the 'Description' field from the Words6To9 registry entry when printing SRC error details.
Changed from this:
"Error Details": { "Message": "There was a failure when reading a sensor device", "CALLOUT_ERRNO": "0x5" }
to this:
"Error Details": { "Message": "There was a failure when reading a sensor device", "CALLOUT_ERRNO": [ "0x5", "errno of the failure" ] }
Signed-off-by: Harisuddin Mohamed Isa <harisuddin@gmail.com> Change-Id: Icc107b0516062eb891f14f3ff3f3cffd316c6d77
show more ...
|
#
53ef1552 |
| 14-Oct-2020 |
Miguel Gomez <mgomez@mx1.ibm.com> |
Specify PEL callout priority and sort callouts.
Allow the priority to be passed in at creation time by calling the CALLOUT_PRIORITY keyword. When creating a PEL, callouts will always be sorted by pr
Specify PEL callout priority and sort callouts.
Allow the priority to be passed in at creation time by calling the CALLOUT_PRIORITY keyword. When creating a PEL, callouts will always be sorted by priority instead of creation time, the order is as follows: H,M,A,B,C,L.
Signed-off-by: Miguel Gomez <mgomez@mx1.ibm.com> Change-Id: I84345aaf3fad7b2e4958b698ab761966f499986b
show more ...
|
#
1ab6696f |
| 29-Oct-2020 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Use the real system names property
There is now a 'Names' property on the new xyz.openbmc_project.Configuration.IBMCompatibleSystem interface that the DataInterface::getSystemNames function wil
PEL: Use the real system names property
There is now a 'Names' property on the new xyz.openbmc_project.Configuration.IBMCompatibleSystem interface that the DataInterface::getSystemNames function will read. The existing code was mostly a placeholder until the actual interface showed up on D-Bus.
Since the path that holds the interface is system specific, and the PEL code should only rarely need this, the code was changed to read it on demand instead of at caching it startup.
Since getSystemNames() no longer returns a cached value, the signature was changed slightly to remove the reference from the returned std::vector<std::string> value.
The UserHeader constructor used to make a call to getSystemNames every time, and that was changed to only call it when actually necessary, which is when there is a severity value defined in the message registry that depends on the system name.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I1d4f307ef38287b631f94dae2b8929307d129029
show more ...
|
#
f00f9d0f |
| 23-Oct-2020 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Dynamic location code for registry callout
The PEL callout type 'symbolic FRU with trusted location code' previously had both the symoblic FRU name and the location code specified in the PEL me
PEL: Dynamic location code for registry callout
The PEL callout type 'symbolic FRU with trusted location code' previously had both the symoblic FRU name and the location code specified in the PEL message registry. However, there are cases where the location code is not known until runtime so it cannot be specified in the registry.
To solve this, create a boolean 'UseInventoryLocCode` key that can be used in the registry callout entry to specify that the location code to use for the callout should come from the FRU passed in using the CALLOUT_INVENTORY_PATH entry in the AdditionalData event log property. In this case, that FRU will not be added as a normal FRU callout as it would normally be otherwise. The registry that uses this must be the first one in the callout list.
For example: { "Priority": "high", "SymbolicFRUTrusted": "air_mover", "UseInventoryLocCode": true },
along with CALLOUT_INVENTORY_PATH=<processor 0 path>
would result in a symbolic FRU callout with the trusted location code being the processor 0 location code.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: Iaf65c523803662313f2ad5e197262b9f4e722332
show more ...
|
#
afa2c799 |
| 27-Aug-2020 |
Matt Spinler <spinler@us.ibm.com> |
PEL: Look for JSON Guard & Deconfig fields
There are optional 'Guarded' and 'Deconfigured' fields in the callout JSON. If these are set, there are some bits in the SRC that should be set.
Signed-o
PEL: Look for JSON Guard & Deconfig fields
There are optional 'Guarded' and 'Deconfigured' fields in the callout JSON. If these are set, there are some bits in the SRC that should be set.
Signed-off-by: Matt Spinler <spinler@us.ibm.com> Change-Id: I1bde74094fc596f383d683722d769615cba56975
show more ...
|