e36866c3 | 31-Oct-2022 |
Caleb Palmer <cnpalmer@us.ibm.com> |
Add auto-generated json RAS data and supporting changes
Moving forward we want to use json RAS data files that have been auto-generated instead of maintaining the json itself. This updates the curre
Add auto-generated json RAS data and supporting changes
Moving forward we want to use json RAS data files that have been auto-generated instead of maintaining the json itself. This updates the current json RAS data to version 2 and makes accompanying changes in the RAS data parser and schema.
Change-Id: I1278c65f6479437630de5b9d3440d4a19f42a1f6 Signed-off-by: Caleb Palmer <cnpalmer@us.ibm.com>
show more ...
|
3a80c983 | 21-Oct-2022 |
Ben Tyner <ben.tyner@ibm.com> |
Attn: Add ten second delay during SBE attention handling
A 10 second wait is introduced in the attention handler path when a SBE attention is detected. This wait has been shown to be adequate to all
Attn: Add ten second delay during SBE attention handling
A 10 second wait is introduced in the attention handler path when a SBE attention is detected. This wait has been shown to be adequate to allow the power fault handling logic to commit a PEL and gracefully power down the host before the SBE attention is handled. The SBE attention handling will be preempted by the host power down sequence. No SBE attention PEL will be committed.
Signed-off-by: Ben Tyner <ben.tyner@ibm.com> Change-Id: I91a313712b67eec0ec10eb3f37e0c56fda64f200
show more ...
|
d04d6423 | 23-Mar-2022 |
Ben Tyner <ben.tyner@ibm.com> |
Attn: Update and revise attention handler markdown
Updated the markdown document to reflect more accurately and with more detail the attention handler design.
Signed-off-by: Ben Tyner <ben.tyner@ib
Attn: Update and revise attention handler markdown
Updated the markdown document to reflect more accurately and with more detail the attention handler design.
Signed-off-by: Ben Tyner <ben.tyner@ibm.com> Change-Id: I8ac7aa9783e86893ba7dc3f6c0cb53c5a44481df
show more ...
|
21cc6270 | 05-Oct-2022 |
Ben Tyner <ben.tyner@ibm.com> |
Capture PRD scratch registers in attention handler
PRD uses some scratch registers to debug scenarios when analysis may have been interrupted. Attention handler will capture these registers in case
Capture PRD scratch registers in attention handler
PRD uses some scratch registers to debug scenarios when analysis may have been interrupted. Attention handler will capture these registers in case the analyzer does not.
Signed-off-by: Ben Tyner <ben.tyner@ibm.com> Change-Id: I2882afea1299e89ed5b5ad50441d9ca2eea13e96
show more ...
|
329dbbde | 03-Oct-2022 |
Caleb Palmer <cnpalmer@us.ibm.com> |
Adjust root cause filtering for IUE thresholds
After handling an IUE threshold, a channel fail will be initiated by firmware. If that channel fail causes a system checkstop, we want to blame the IUE
Adjust root cause filtering for IUE thresholds
After handling an IUE threshold, a channel fail will be initiated by firmware. If that channel fail causes a system checkstop, we want to blame the IUE FIR bits as the root cause.
Change-Id: Idd28b0b4310b83b97258755bc8da0dad1f58d2a6 Signed-off-by: Caleb Palmer <cnpalmer@us.ibm.com>
show more ...
|
7a465259 | 09-Sep-2022 |
Caleb Palmer <cnpalmer@us.ibm.com> |
Add FFDC for signatures stored in scratch registers
If analysis was interrupted by a system checkstop there may exist an error signature within two Hostboot scratch regs that indicates the signature
Add FFDC for signatures stored in scratch registers
If analysis was interrupted by a system checkstop there may exist an error signature within two Hostboot scratch regs that indicates the signature from that analysis. This commit adds support to add that signature as FFDC to the PEL if it exists to indicate that a prior analysis was interrupted such that we may be missing a PEL for that signature.
Change-Id: I53216e2c7910c69c4e7e74010a5c0045b793bfde Signed-off-by: Caleb Palmer <cnpalmer@us.ibm.com>
show more ...
|
4e4582ba | 01-Apr-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Stop ATTN service before instruction stop
Ensuring that ATTN service is stopped after PHYP has completed the power off process. PHYP must be done before starting instruction stop. So that is a good
Stop ATTN service before instruction stop
Ensuring that ATTN service is stopped after PHYP has completed the power off process. PHYP must be done before starting instruction stop. So that is a good target to use.
Tested: Verify that during soft-off attention handler will service attentions (inject error during graceful shutdown).
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I9611546d3133c444fe62500fc76750558945baf7
show more ...
|
1d63b6f3 | 04-Aug-2022 |
Patrick Williams <patrick@stwcx.xyz> |
MAINTAINERS: remove file
The MAINTAINERS file is deprecated in favor of OWNERS.
Signed-off-by: Patrick Williams <patrick@stwcx.xyz> Change-Id: I765ff4819bfd6395f8b1d517b460edb16ac5be62
|
e212fb06 | 22-Jul-2022 |
Patrick Williams <patrick@stwcx.xyz> |
sdbusplus: use shorter type aliases
The sdbusplus headers provide shortened aliases for many types. Switch to using them to provide better code clarity and shorter lines. Possible replacements are
sdbusplus: use shorter type aliases
The sdbusplus headers provide shortened aliases for many types. Switch to using them to provide better code clarity and shorter lines. Possible replacements are for: * bus_t * exception_t * manager_t * match_t * message_t * object_t * slot_t
Signed-off-by: Patrick Williams <patrick@stwcx.xyz> Change-Id: I9f166640fbf9d4ee3fb24b542a73237cdf54f775
show more ...
|
3de96454 | 22-Jul-2022 |
Patrick Williams <patrick@stwcx.xyz> |
OWNERS: switch 'matches' to 'matchers'
The original OWNERS template had a mistake which used 'matches' instead of the field supported by the Gerrit plugin 'matchers'. Update the OWNERS file to have
OWNERS: switch 'matches' to 'matchers'
The original OWNERS template had a mistake which used 'matches' instead of the field supported by the Gerrit plugin 'matchers'. Update the OWNERS file to have the correct field.
Signed-off-by: Patrick Williams <patrick@stwcx.xyz> Change-Id: I366d259d989fff2adce57a83943e6c8bd8a1b4f6
show more ...
|
9b41feba | 13-Jul-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Revert clang-format to align with openbmc/docs
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I50db2f1021b7f55711ad9bdcc318f63ce4109c62 |
a9f85590 | 28-Jun-2022 |
Ben Tyner <ben.tyner@ibm.com> |
Attention handler configuration file support
The default behavior of the attention handler can be overriden by modifying the config file contents. The behavior can also be defined by the associated
Attention handler configuration file support
The default behavior of the attention handler can be overriden by modifying the config file contents. The behavior can also be defined by the associated build option. The default behavior remains unchanged. The intent of this change is to allow other components to change the attention handler behavior by updating the configuration file and restarting the attention handler service.
Signed-off-by: Ben Tyner <ben.tyner@ibm.com> Change-Id: I8f73004c926a3a070c5feb690ee5684429f922c5
show more ...
|
fc7e2476 | 24-Jun-2022 |
Zane Shelley <zshelle@us.ibm.com> |
CORE_FIR recoverables could be blamed as checkstop root cause
If a CORE_FIR recoverable attention fails recovery, it will trigger a core unit checkstop attention via another bit. All core unit check
CORE_FIR recoverables could be blamed as checkstop root cause
If a CORE_FIR recoverable attention fails recovery, it will trigger a core unit checkstop attention via another bit. All core unit checkstop attentions have the potential to trigger a system checkstop attention. Therefore, all CORE_FIR recoverable attentions could be blamed a system checkstop root cause attentions.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: Ib2f3916218b4dce88797f645a302716ef4fd4d49
show more ...
|
b82cbf75 | 27-Jun-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Update to clang-format-14
Required because the Jenkins CI tools have moved to v14.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I3cf4df1b45325545a423bdcb810040724a598ec5 |
513f64aa | 15-Jun-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Handling for host detected LPC timeout
For reasons not explained yet, hardware will not initiate an LPC timeout attention via NCU timeout FIR bit as we expected. When the host firmware detects an LP
Handling for host detected LPC timeout
For reasons not explained yet, hardware will not initiate an LPC timeout attention via NCU timeout FIR bit as we expected. When the host firmware detects an LPC timeout, it will manually set N1_LOCAL_FIR[61] to force a system checkstop. The service response for this bit will be to call out the hardware as if there was a hardware reported LPC timeout.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I863e8aa3ef50a4b18b5106b3a45c4cf81b2c7808
show more ...
|
4efca280 | 10-Jun-2022 |
Andrew Geissler <geissonator@yahoo.com> |
include what you use (array)
New yocto rebase compiler has been failing on missing includes so add them in.
Signed-off-by: Andrew Geissler <geissonator@yahoo.com> Change-Id: Iebd2fcc5d248528e04d3c5
include what you use (array)
New yocto rebase compiler has been failing on missing includes so add them in.
Signed-off-by: Andrew Geissler <geissonator@yahoo.com> Change-Id: Iebd2fcc5d248528e04d3c58b4d4ecd2f5694f4aa
show more ...
|
ed3ab8f9 | 24-May-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Fix outdate comment in analyzer filter support
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I5e14eb82a4017ed794314d2800ea88dd0d706942 |
cd6373d3 | 12-May-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Ignore analysis of OCMBs that have been masked
Attentions from OCMBs chip will flow through their connected processor chips. We should not do analysis of those attentions if they are masked on the c
Ignore analysis of OCMBs that have been masked
Attentions from OCMBs chip will flow through their connected processor chips. We should not do analysis of those attentions if they are masked on the connected processor chip regardless if the OCMB chip shows any active attentions. This will take care of scenarios like a channel failure attention that has already been handled and masked by the host firmware.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I2c170ea4770ad3a229c1c65fa50b056fc8a6e4b2
show more ...
|
026e5a3f | 05-May-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Avoid guarding on TOD interfaces errors
The error could be anywhere between the two processors in the interface. Fatally guarding the MDMT will cause system outage until service is done. Instead, do
Avoid guarding on TOD interfaces errors
The error could be anywhere between the two processors in the interface. Fatally guarding the MDMT will cause system outage until service is done. Instead, do not guard on the TOD interface errors to avoid outage.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I446917bad985e5143657398b2fbadacf6e8c4a9d
show more ...
|
d28d5f8b | 28-Apr-2022 |
austinfcui <austinfcui@gmail.com> |
Add Attn config option for clearing Attn Interrupts
Added attn config option Added UT for attn config object Added UT for cli that uses attn config object
Signed-off-by: austinfcui <austinfcui@gmai
Add Attn config option for clearing Attn Interrupts
Added attn config option Added UT for attn config object Added UT for cli that uses attn config object
Signed-off-by: austinfcui <austinfcui@gmail.com> Change-Id: I0e30ebf4d94de676814bf5e173691c4d1ee362cb
show more ...
|
e81eb15d | 19-Apr-2022 |
austinfcui <austinfcui@gmail.com> |
Add a UT function for getActiveChips() of util/pdbg files
Signed-off-by: austinfcui <austinfcui@gmail.com> Change-Id: I22fed946eaa7f4c11c52df9e2c5aa3ab734b033f |
7bf1bfa5 | 27-Apr-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Enable LPC timeout handling
It turns out the plugin exists, but nothing in the RAS data was calling the plugin.
Change-Id: I9d35a61064e5f412f216ffbea96597b4d691a98a |
8f07b2e6 | 21-Apr-2022 |
Zane Shelley <zshelle@us.ibm.com> |
Add chip at fault to TOD secondary error callout list
The error could be anywhere between the chip at fault and the clock source chip. So both chips should be added to the callout list.
Signed-off-
Add chip at fault to TOD secondary error callout list
The error could be anywhere between the chip at fault and the clock source chip. So both chips should be added to the callout list.
Signed-off-by: Zane Shelley <zshelle@us.ibm.com> Change-Id: I2a9e9b7b03ea300fc8ea92fcbd41080737f862d7
show more ...
|
5dbebde0 | 12-Apr-2022 |
austinfcui <austinfcui@gmail.com> |
Configure UT test static lib in base meson.build
Fix problem of UT GCOV converages not increasing
Signed-off-by: austinfcui <austinfcui@gmail.com> Change-Id: I070fedf7cd268a541fd61b05aca05b9759a4a6
Configure UT test static lib in base meson.build
Fix problem of UT GCOV converages not increasing
Signed-off-by: austinfcui <austinfcui@gmail.com> Change-Id: I070fedf7cd268a541fd61b05aca05b9759a4a67a
show more ...
|
7ec2dd11 | 13-Apr-2022 |
austinfcui <austinfcui@gmail.com> |
Eliminate warning messages from test-attention.cpp file
Signed-off-by: austinfcui <austinfcui@gmail.com> Change-Id: Ib933179c8d11486333bfb60a477a3a47dfc7e685 |