[Mageia-sysadm] Many Corrected Memory Errors on jonund

Damien Lallement mageia at damsweb.net
Thu Aug 23 15:02:34 CEST 2012


Le 23/08/2012 13:57, nicolas vigier a écrit :
> On Thu, 23 Aug 2012, Pascal Terjan wrote:
>
>> MCE 0
>> CPU 0 BANK 8
>> MISC 920da04000000283 ADDR 19425300
>> TIME 1345722282 Thu Aug 23 13:44:42 2012
>> MCG status:
>> MCi status:
>> Corrected error
>> MCi_MISC register valid
>> MCi_ADDR register valid
>> MCA: MEMORY CONTROLLER RD_CHANNELunspecified_ERR
>> Transaction: Memory read error
>> Memory read ECC error
>> Memory corrected error count (CORE_ERR_CNT): 1
>> Memory transaction Tracker ID (RTId): 83
>> Memory DIMM ID of error: 0
>> Memory channel ID of error: 0
>> Memory ECC syndrome: 920da040
>> STATUS 8c0000400001009f MCGSTATUS 0
>> MCGCAP 1c09 APICID 0 SOCKETID 0
>> CPUID Vendor Intel Family 6 Model 44
>>
>> I have removed it from bs now, but will probably reenable it as they
>> are all corrected (yeah ECC), but we may consider replacing that RAM
>
> Ok. So we can run memtest86 on this server next week to find the wrong
> RAM and remove it.

For now, I made a memtester check.
Still in progress...

> I added this to the list :
> https://wiki.mageia.org/en/Marseille_3#Things_to_do_in_Marseille

Great!
Perhaps it will be a good idea to run it on each server once we are in 
da place.
-- 
Damien Lallement
twitter: damsweb - IRC: damsweb/coincoin


More information about the Mageia-sysadm mailing list