Recently, a log called "single-bit ECC" has occurred a few times.
Posted: 2023/05/23 06:45:25
kernel: {23}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 4
kernel: {23}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {23}[Hardware Error]: event severity: corrected
kernel: {23}[Hardware Error]: Error 0, type: corrected
kernel: {23}[Hardware Error]: fru_text: A3
kernel: {23}[Hardware Error]: section_type: memory error
kernel: {23}[Hardware Error]: error_status: 0x0000000000000400
kernel: {23}[Hardware Error]: physical_address: 0x00000012ef189f80
kernel: {23}[Hardware Error]: node: 0 card: 2 module: 0 rank: 0 bank: 3 device: 14 row: 38801 column: 504
kernel: {23}[Hardware Error]: error_type: 2, single-bit ECC
kernel: {23}[Hardware Error]: DIMM location: not present. DMI handle: 0x0000
kernel: {24}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 65534
kernel: {24}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {24}[Hardware Error]: event severity: corrected
kernel: {24}[Hardware Error]: Error 0, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 1, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 2, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 3, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 4, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 5, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 6, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 7, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 8, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 9, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 10, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 11, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: mce: [Hardware Error]: Machine check events logged
It happened about 5 times in a month.
Is there a real hardware problem?
kernel: {23}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {23}[Hardware Error]: event severity: corrected
kernel: {23}[Hardware Error]: Error 0, type: corrected
kernel: {23}[Hardware Error]: fru_text: A3
kernel: {23}[Hardware Error]: section_type: memory error
kernel: {23}[Hardware Error]: error_status: 0x0000000000000400
kernel: {23}[Hardware Error]: physical_address: 0x00000012ef189f80
kernel: {23}[Hardware Error]: node: 0 card: 2 module: 0 rank: 0 bank: 3 device: 14 row: 38801 column: 504
kernel: {23}[Hardware Error]: error_type: 2, single-bit ECC
kernel: {23}[Hardware Error]: DIMM location: not present. DMI handle: 0x0000
kernel: {24}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 65534
kernel: {24}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {24}[Hardware Error]: event severity: corrected
kernel: {24}[Hardware Error]: Error 0, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 1, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 2, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 3, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 4, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 5, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 6, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 7, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 8, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 9, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 10, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 11, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: mce: [Hardware Error]: Machine check events logged
It happened about 5 times in a month.
Is there a real hardware problem?