Hello,
I have a 5 node cluster with CEPH and HA. There is an issue that requires the install of latest Intel 10gndrivers after the kernel is updated as It reverts to an older version. So for now we update and install the newer drivers after reboot.
During a maintenance window I did this for the first node and when the network went down, it triggered the rest of the nodes to reboot. We had quorum with 4 nodes, minimum set to 3.
I saw other threads with similar issues pointing at 5.x and corosync 2.x being the issue. But I use 6.x and confirmed corosync 3.x to be installed. I couldn't find anything of value in the logs to tell me why the other nodes decided to reboot. Any help would be appreciated.
I have a 5 node cluster with CEPH and HA. There is an issue that requires the install of latest Intel 10gndrivers after the kernel is updated as It reverts to an older version. So for now we update and install the newer drivers after reboot.
During a maintenance window I did this for the first node and when the network went down, it triggered the rest of the nodes to reboot. We had quorum with 4 nodes, minimum set to 3.
I saw other threads with similar issues pointing at 5.x and corosync 2.x being the issue. But I use 6.x and confirmed corosync 3.x to be installed. I couldn't find anything of value in the logs to tell me why the other nodes decided to reboot. Any help would be appreciated.