Hi forum,
we have a 4 node setup based on Supermicro Superservers with the latest PVE5.4 and Ceph. For about 2 months now, we observe sudden reboots of single nodes about once per week without any hints in the logs (messages, syslog, kern.log). It seems like the node is running without any trouble and suddenly decides to do a hard reboot. No kernel panic, no watchdog message - nothing.
Can someone give us a hint on how to debug this issue?
Hauke
we have a 4 node setup based on Supermicro Superservers with the latest PVE5.4 and Ceph. For about 2 months now, we observe sudden reboots of single nodes about once per week without any hints in the logs (messages, syslog, kern.log). It seems like the node is running without any trouble and suddenly decides to do a hard reboot. No kernel panic, no watchdog message - nothing.
Can someone give us a hint on how to debug this issue?
Hauke