I conducted a test production POC with a 3-node Proxmox cluster and decided to modify the MTU settings. The process involved changing the MTU on node 3 first, applying the changes, then moving to node 2 for a similar adjustment. Unexpectedly, the network applied a reboot across all 3 nodes, resulting in the downtime of over 50 VMs. Fortunately, critical VMs were under the POC umbrella.
To analyze the root cause, I discovered that the team failed to place nodes in maintenance mode, and the MTU misconfiguration on 2 nodes caused a simultaneous restart. I'm currently seeking logs and assistance to identify the root cause of the Proxmox issue that led to the simultaneous reboot of all 3 nodes.
please assist me
Thanks and Regards
To analyze the root cause, I discovered that the team failed to place nodes in maintenance mode, and the MTU misconfiguration on 2 nodes caused a simultaneous restart. I'm currently seeking logs and assistance to identify the root cause of the Proxmox issue that led to the simultaneous reboot of all 3 nodes.
please assist me
Thanks and Regards