Could this be a case that the edac_mce_amd kernel module in the initrd is an older version than the one that
is available once the machine is booted off of the real OS?
Steve Timm
-----Original Message-----
From: [log in to unmask] [mailto:[log in to unmask]] On Behalf Of Konstantin Olchanski
Sent: Thursday, February 07, 2013 7:10 PM
To: Yasha Karant
Cc: scientific-linux-users
Subject: Re: AMD edac_mce_amd kernel module question
On Tue, Feb 05, 2013 at 09:22:21PM -0800, Yasha Karant wrote:
> SL 6x X86-64 on an AMD CPU. During boot, the dac_mce_amd kernel
> module is indicated as not being loaded. However, lsmod as well as a
> direct viewing of /proc/modules shows that the module is loaded and
> live. Evidence below. Is this consistent? Is the module actually
> active?
>
> kernel: 2.6.32-279.el6.x86_64 #1 SMP from uname -a
>
> From /var/log/boot.log:
>
> AMD Processor family 16: Please load edac_mce_amd module.
> CPU is unsupported
>
I confirm that in SL6, the EDAC modules load and work correctly in the default configuration. (I did not have to do anything special, they "just worked" after the normal installation).
How do I know they work? On both dual AMD Opteron machines still alive (the 1st generation, single-core ones), the memory subsystem is iffy and I often see messages from EDAC/MCE about ECC correcting memory bits.
In Yasha's case, I would be suspicious about the message about "CPU is unsupported". Perhaps that's the real problem. But he is not telling us what CPU he has, so I cannot check the EDAC compatibility and supported CPU list for him.
--
Konstantin Olchanski
Data Acquisition Systems: The Bytes Must Flow!
Email: olchansk-at-triumf-dot-ca
Snail mail: 4004 Wesbrook Mall, TRIUMF, Vancouver, B.C., V6T 2A3, Canada
|