I now get twenty until thirty messages per day of the following message in the logfile:
QuoteFeb 3 20:32:13 firewall kernel: MCA: Bank 4, Status 0xdc094000a6080a13
Feb 3 20:32:13 firewall kernel: MCA: Global Cap 0x0000000000000106, Status 0x0000000000000000
Feb 3 20:32:13 firewall kernel: MCA: Vendor "AuthenticAMD", ID 0x730, APIC ID 0
Feb 3 20:32:13 firewall kernel: MCA: CPU 0 COR OVER BUSLG Responder RD Memory
Feb 3 20:32:13 firewall kernel: MCA: Address 0x9224b940
Feb 3 20:32:13 firewall kernel: MCA: Misc 0xc01a0ffe01000000
With the command "mcelog --k8 --ascii" the event decodes to
QuoteCPU 0 4 northbridge
MISC c01a0ffe01000000 ADDR 9224b940
Northbridge RAM Chipkill ECC error
Chipkill ECC syndrome = a612
bit46 = corrected ecc error
bit59 = misc error valid
bit62 = error overflow (multiple errors)
bus error 'local node response, request didn't time out
generic read mem transaction
memory access, level generic'
STATUS dc094000a6080a13 MCGSTATUS 0
MCGCAP 106 APICID 0 SOCKETID 0
On the apu2 I use the newest BIOS v4.11.0.3.
I experience no crash and no failing of any service at this device.
Does the event mean that the whole device will fail soon by memory and should I replace the device as fast as possible?
I'd direct this question to PC Engines support team if I were you: support@pcengines.ch
Quote from: dave on February 11, 2020, 02:04:03 PM
I'd direct this question to PC Engines support team if I were you: support@pcengines.ch
Many thanks. A very good idea.
I mailed to support@pcengines.ch. If I get a proper answer, I will put it in the forum, too.