Recently, a log called "single-bit ECC" has occurred a few times.

General support questions
Post Reply
wlsghks213
Posts: 3
Joined: 2023/05/23 05:04:59

Recently, a log called "single-bit ECC" has occurred a few times.

Post by wlsghks213 » 2023/05/23 06:45:25

kernel: {23}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 4
kernel: {23}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {23}[Hardware Error]: event severity: corrected
kernel: {23}[Hardware Error]: Error 0, type: corrected
kernel: {23}[Hardware Error]: fru_text: A3
kernel: {23}[Hardware Error]: section_type: memory error
kernel: {23}[Hardware Error]: error_status: 0x0000000000000400
kernel: {23}[Hardware Error]: physical_address: 0x00000012ef189f80
kernel: {23}[Hardware Error]: node: 0 card: 2 module: 0 rank: 0 bank: 3 device: 14 row: 38801 column: 504
kernel: {23}[Hardware Error]: error_type: 2, single-bit ECC
kernel: {23}[Hardware Error]: DIMM location: not present. DMI handle: 0x0000
kernel: {24}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 65534
kernel: {24}[Hardware Error]: It has been corrected by h/w and requires no further action
kernel: {24}[Hardware Error]: event severity: corrected
kernel: {24}[Hardware Error]: Error 0, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 1, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 2, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 3, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 4, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 5, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 6, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 7, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 8, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 9, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 10, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: {24}[Hardware Error]: Error 11, type: corrected
kernel: {24}[Hardware Error]: section type: unknown, 330f1140-72a5-11df-9690-0002a5d5c51b
kernel: mce: [Hardware Error]: Machine check events logged







It happened about 5 times in a month.

Is there a real hardware problem?

User avatar
TrevorH
Site Admin
Posts: 33267
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Post by TrevorH » 2023/05/23 11:19:18

Yes.

Try installing the 'edac-util' package then run edac-util -v and it may tell you some more info.
The future appears to be RHEL or Debian. I think I'm going Debian.
Info for USB installs on http://wiki.centos.org/HowTos/InstallFromUSBkey
CentOS 5 and 6 are deadest, do not use them.
Use the FAQ Luke

Whoever
Posts: 1363
Joined: 2013/09/06 03:12:10

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Post by Whoever » 2023/05/23 15:57:45

TrevorH wrote:
2023/05/23 11:19:18
run edc-util -v and it may tell you some more info.
Shouldn't that be "edac-util -v"?

User avatar
TrevorH
Site Admin
Posts: 33267
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Post by TrevorH » 2023/05/23 16:15:28

It should and is now, thanks.
The future appears to be RHEL or Debian. I think I'm going Debian.
Info for USB installs on http://wiki.centos.org/HowTos/InstallFromUSBkey
CentOS 5 and 6 are deadest, do not use them.
Use the FAQ Luke

wlsghks213
Posts: 3
Joined: 2023/05/23 05:04:59

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Post by wlsghks213 » 2023/05/25 07:28:01

When the command is entered, the output is as follows

# edac-util -v
edac-util: Error: No memory controller data found.

I wonder if there is an actual hardware error

User avatar
TrevorH
Site Admin
Posts: 33267
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Post by TrevorH » 2023/05/25 17:06:12

Do you have ECC RAM installed?
The future appears to be RHEL or Debian. I think I'm going Debian.
Info for USB installs on http://wiki.centos.org/HowTos/InstallFromUSBkey
CentOS 5 and 6 are deadest, do not use them.
Use the FAQ Luke

wlsghks213
Posts: 3
Joined: 2023/05/23 05:04:59

Re: Recently, a log called "single-bit ECC" has occurred a few times.

Post by wlsghks213 » 2023/05/26 00:32:55

yes. it is

Post Reply