Page 1 of 1

Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/23 06:45:25
by wlsghks213
kernel: {23}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 4
kernel: {23}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {23}[Hardware Error]: event severity: corrected
kernel: {23}[Hardware Error]: Error 0, type: corrected
kernel: {23}[Hardware Error]: fru_text: A3
kernel: {23}[Hardware Error]: section_type: memory error
kernel: {23}[Hardware Error]: error_status: 0x0000000000000400
kernel: {23}[Hardware Error]: physical_address: 0x00000012ef189f80
kernel: {23}[Hardware Error]: node: 0 card: 2 module: 0 rank: 0 bank: 3 device: 14 row: 38801 column: 504
kernel: {23}[Hardware Error]: error_type: 2, single-bit ECC
kernel: {23}[Hardware Error]: DIMM location: not present. DMI handle: 0x0000
kernel: {24}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 65534
kernel: {24}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {24}[Hardware Error]: event severity: corrected
kernel: {24}[Hardware Error]: Error 0, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 1, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 2, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 3, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 4, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 5, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 6, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 7, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 8, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 9, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 10, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 11, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: mce: [Hardware Error]: Machine check events logged







It happened about 5 times in a month.

Is there a real hardware problem?

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/23 11:19:18
by TrevorH
Yes.

Try installing the 'edac-util' package then run edac-util -v and it may tell you some more info.

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/23 15:57:45
by Whoever
TrevorH wrote:
2023/05/23 11:19:18
run edc-util -v and it may tell you some more info.
Shouldn't that be "edac-util -v"?

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/23 16:15:28
by TrevorH
It should and is now, thanks.

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/25 07:28:01
by wlsghks213
When the command is entered, the output is as follows

# edac-util -v
edac-util: Error: No memory controller data found.

I wonder if there is an actual hardware error

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/25 17:06:12
by TrevorH
Do you have ECC RAM installed?

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Posted: 2023/05/26 00:32:55
by wlsghks213
yes. it is