1========================== 2PCIe Device AER statistics 3========================== 4These attributes show up under all the devices that are AER capable. These 5statistical counters indicate the errors "as seen/reported by the device". 6Note that this may mean that if an endpoint is causing problems, the AER 7counters may increment at its link partner (e.g. root port) because the 8errors may be "seen" / reported by the link partner and not the 9problematic endpoint itself (which may report all counters as 0 as it never 10saw any problems). 11 12Where: /sys/bus/pci/devices/<dev>/aer_dev_correctable 13Date: July 2018 14Kernel Version: 4.19.0 15Contact: linux-pci@vger.kernel.org, rajatja@google.com 16Description: List of correctable errors seen and reported by this 17 PCI device using ERR_COR. Note that since multiple errors may 18 be reported using a single ERR_COR message, thus 19 TOTAL_ERR_COR at the end of the file may not match the actual 20 total of all the errors in the file. Sample output: 21------------------------------------------------------------------------- 22localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable 23Receiver Error 2 24Bad TLP 0 25Bad DLLP 0 26RELAY_NUM Rollover 0 27Replay Timer Timeout 0 28Advisory Non-Fatal 0 29Corrected Internal Error 0 30Header Log Overflow 0 31TOTAL_ERR_COR 2 32------------------------------------------------------------------------- 33 34Where: /sys/bus/pci/devices/<dev>/aer_dev_fatal 35Date: July 2018 36Kernel Version: 4.19.0 37Contact: linux-pci@vger.kernel.org, rajatja@google.com 38Description: List of uncorrectable fatal errors seen and reported by this 39 PCI device using ERR_FATAL. Note that since multiple errors may 40 be reported using a single ERR_FATAL message, thus 41 TOTAL_ERR_FATAL at the end of the file may not match the actual 42 total of all the errors in the file. Sample output: 43------------------------------------------------------------------------- 44localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal 45Undefined 0 46Data Link Protocol 0 47Surprise Down Error 0 48Poisoned TLP 0 49Flow Control Protocol 0 50Completion Timeout 0 51Completer Abort 0 52Unexpected Completion 0 53Receiver Overflow 0 54Malformed TLP 0 55ECRC 0 56Unsupported Request 0 57ACS Violation 0 58Uncorrectable Internal Error 0 59MC Blocked TLP 0 60AtomicOp Egress Blocked 0 61TLP Prefix Blocked Error 0 62TOTAL_ERR_FATAL 0 63------------------------------------------------------------------------- 64 65Where: /sys/bus/pci/devices/<dev>/aer_dev_nonfatal 66Date: July 2018 67Kernel Version: 4.19.0 68Contact: linux-pci@vger.kernel.org, rajatja@google.com 69Description: List of uncorrectable nonfatal errors seen and reported by this 70 PCI device using ERR_NONFATAL. Note that since multiple errors 71 may be reported using a single ERR_FATAL message, thus 72 TOTAL_ERR_NONFATAL at the end of the file may not match the 73 actual total of all the errors in the file. Sample output: 74------------------------------------------------------------------------- 75localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal 76Undefined 0 77Data Link Protocol 0 78Surprise Down Error 0 79Poisoned TLP 0 80Flow Control Protocol 0 81Completion Timeout 0 82Completer Abort 0 83Unexpected Completion 0 84Receiver Overflow 0 85Malformed TLP 0 86ECRC 0 87Unsupported Request 0 88ACS Violation 0 89Uncorrectable Internal Error 0 90MC Blocked TLP 0 91AtomicOp Egress Blocked 0 92TLP Prefix Blocked Error 0 93TOTAL_ERR_NONFATAL 0 94------------------------------------------------------------------------- 95 96============================ 97PCIe Rootport AER statistics 98============================ 99These attributes show up under only the rootports (or root complex event 100collectors) that are AER capable. These indicate the number of error messages as 101"reported to" the rootport. Please note that the rootports also transmit 102(internally) the ERR_* messages for errors seen by the internal rootport PCI 103device, so these counters include them and are thus cumulative of all the error 104messages on the PCI hierarchy originating at that root port. 105 106Where: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor 107Date: July 2018 108Kernel Version: 4.19.0 109Contact: linux-pci@vger.kernel.org, rajatja@google.com 110Description: Total number of ERR_COR messages reported to rootport. 111 112Where: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal 113Date: July 2018 114Kernel Version: 4.19.0 115Contact: linux-pci@vger.kernel.org, rajatja@google.com 116Description: Total number of ERR_FATAL messages reported to rootport. 117 118Where: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal 119Date: July 2018 120Kernel Version: 4.19.0 121Contact: linux-pci@vger.kernel.org, rajatja@google.com 122Description: Total number of ERR_NONFATAL messages reported to rootport. 123