We've been experiencing intermittent crashes with our Proxmox server, with some occurrences happening after several days, weeks, or even months.
To address potential hardware issues, we've updated the iDRAC drivers and firmware. Additionally, we've optimized the system profile in the BIOS for performance and disabled C-State. In Proxmox, we've disabled EDAC as well.
Despite these measures, we haven't found any error logs explaining the crashes, except for the message from iDRAC indicating a CPU 1 machine check error.
Has anyone experienced the same issues? Or can suggest what things to check?
++++++++++++++++++++++++++++++++++++++++++
PVE Manager Version: pve-manager/7.1-7/df5740ad
Poweredge R7515
To address potential hardware issues, we've updated the iDRAC drivers and firmware. Additionally, we've optimized the system profile in the BIOS for performance and disabled C-State. In Proxmox, we've disabled EDAC as well.
Despite these measures, we haven't found any error logs explaining the crashes, except for the message from iDRAC indicating a CPU 1 machine check error.
Has anyone experienced the same issues? Or can suggest what things to check?
++++++++++++++++++++++++++++++++++++++++++
PVE Manager Version: pve-manager/7.1-7/df5740ad
Poweredge R7515