I used two 8G ecc ram proxmox servers , and One of them has been having some problems lately, the dmesg shows system auto corrected the error
the proxmox is still alived , but one of VMs on the host will have a kernel panic durning the correction.
I don't know why , the other VM does'nt effect by the correction , only specified one. they were all the same OS (ubuntu 14.04)
I will replace the memory for further test , but strill curious about why the correction will make vm kernel panic ?
Code:
[15327239.518589] {4}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1
[15327239.518590] {4}[Hardware Error]: It has been corrected by h/w and requires no further action
[15327239.518591] {4}[Hardware Error]: event severity: corrected
[15327239.518592] {4}[Hardware Error]: Error 0, type: corrected
[15327239.518592] {4}[Hardware Error]: fru_text: CorrectedErr
[15327239.518593] {4}[Hardware Error]: section_type: memory error
[15327239.518594] {4}[Hardware Error]: node: 0 device: 1
[15327239.518594] {4}[Hardware Error]: error_type: 2, single-bit ECC
[15327239.518596] ghes_edac: Internal error: Can't find EDAC structure
the proxmox is still alived , but one of VMs on the host will have a kernel panic durning the correction.
I don't know why , the other VM does'nt effect by the correction , only specified one. they were all the same OS (ubuntu 14.04)
I will replace the memory for further test , but strill curious about why the correction will make vm kernel panic ?