[SOLVED] Prevent Corosync from restarting server

NotMe

New Member
Feb 4, 2022
3
0
1
53
Hello,
I have troubles with restarting server by corosync, while network is unstable. Can I solve this problem without adding a seperate interfaces for corosync? Unfortunately, we encounter this problem during DDoS attacks before mitigation becomes effective (10-20s).

Code:
Oct 27 18:44:46 myserver corosync[865560]:   [TOTEM ] Retransmit List: 46 54 5a 61 6b 1 5 a b d e 10 11 34 48 62 6c 71 72 77 78 8c
Oct 27 18:44:46 myserver corosync[865560]:   [TOTEM ] Retransmit List: 11 34 48 62 1 5 a b 3c 42 46 54 5a 61 63 6b 6c 71 72 77 78 83 8c 8e
Oct 27 18:44:46 myserver corosync[865560]:   [TOTEM ] Retransmit List: 63 6b 6c 8a 1 5 a b d e 10 11 34 3c 42 71 72 77 78 83 8c 8e
Oct 27 18:44:46 myserver corosync[865560]:   [TOTEM ] Retransmit List: 34 3c 42 71 8f 1 a b 46 48 54 5a 61 62 63 6b 72 77 78 83 8c 8e
Oct 27 18:44:46 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 6b 8f 1 a b d e 10 11 3c 42 46 48 6c 71 72 77 78 83 8c 8e 90
Oct 27 18:44:46 myserver corosync[865560]:   [TOTEM ] Retransmit List: 46 48 6c 71 1 a b d e 54 5a 61 62 63 6b 72 77 78 83 8c 8e 90
Oct 27 18:44:48 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 6b 1 a b d 10 11 3c 42 46 48 6c 71 77 78 83 8c 8e 90
Oct 27 18:44:54 myserver pvestatd[1494]: status update time (13.120 seconds)
Oct 27 18:44:56 myserver corosync[865560]:   [TOTEM ] Retransmit List: 48 6c 71 77 93 1 a b d e 10 11 46 54 5a 61 62 63 6b 78 83 84 8c 8e
Oct 27 18:44:56 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 93 1 a b d e 3c 42 46 48 6b 6c 71 77 78 83 84 89 8c
Oct 27 18:44:56 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6b 6c 71 93 1 a b d 10 11 46 54 5a 61 62 63 77 78 83 84 89 8a 8c
Oct 27 18:44:56 myserver corosync[865560]:   [TOTEM ] Retransmit List: 62 63 93 1 a b d e 10 11 3c 42 46 48 6b 6c 71 77 78 83 84 89 8a
Oct 27 18:44:58 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6b 6c 71 93 1 a b d e 10 46 54 5a 61 62 63 77 78 83 84 89 8a
Oct 27 18:44:59 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 93 1 a b d e 11 3c 42 46 48 6b 6c 71 77 78 83 84 89
Oct 27 18:44:59 myserver corosync[865560]:   [TOTEM ] Retransmit List: 48 6b 6c 71 93 1 a b d e 10 11 46 54 5a 61 62 63 77 78 83 84 89 8a
Oct 27 18:44:59 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 93 1 a b d e 3c 42 46 48 6b 6c 71 77 78 83 84 89 8a
Oct 27 18:44:59 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6b 6c 71 93 1 a b d 10 11 46 54 5a 61 62 63 77 78 83 84 89 8a 8c
Oct 27 18:44:59 myserver corosync[865560]:   [TOTEM ] Retransmit List: 62 63 93 1 a b d e 10 11 3c 42 46 48 6b 6c 71 77 78 83 84 89 8a
Oct 27 18:45:01 myserver corosync[865560]:   [TOTEM ] Retransmit List: 48 6b 6c 71 1 a b d e 10 54 5a 61 62 63 77 78 83 84 89 8a 8c 97
Oct 27 18:45:01 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 77 78 1 a b d e 11 42 48 6b 6c 71 83 84 89 8a 8c 97
Oct 27 18:45:01 myserver corosync[865560]:   [TOTEM ] Retransmit List: 48 6b 6c 71 83 1 a b d e 10 54 5a 61 62 63 77 78 84 89 8a 8c 8e 97
Oct 27 18:45:01 myserver corosync[865560]:   [TOTEM ] Retransmit List: 62 63 77 78 1 a b d e 11 42 48 6b 6c 71 83 84 89 8a 8c 8e 8f 97
Oct 27 18:45:01 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6c 71 83 1 a b d 10 54 5a 61 62 63 77 78 84 89 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 63 77 78 1 a b d e 10 11 42 48 54 6b 6c 71 84 89 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6b 6c 71 84 1 a b d e 10 11 5a 61 62 63 77 89 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 77 89 1 a b d e 42 48 54 6b 6c 71 78 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 54 6b 6c 71 78 1 a b d e 10 11 5a 61 62 63 77 83 84 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 62 63 77 83 1 a b d e 42 48 54 6b 6c 71 78 84 89 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6c 71 78 1 a b d 10 11 5a 61 62 63 77 83 84 89 8a 8c 8e 8f 97
Oct 27 18:45:02 myserver corosync[865560]:   [TOTEM ] Retransmit List: 63 77 83 1 a b d e 10 11 42 48 54 6b 6c 71 84 89 8a 8c 8e 8f 97
Oct 27 18:45:04 myserver corosync[865560]:   [TOTEM ] Retransmit List: 6b 6c 71 84 1 a b d e 10 11 5a 61 62 63 77 89 8a 8c 8e 8f 97
Oct 27 18:45:06 myserver corosync[865560]:   [TOTEM ] Retransmit List: 61 62 63 77 89 1 a b d e 42 48 54 6b 6c 71 78 8a 8c 8e 8f 97
Oct 27 18:45:07 myserver watchdog-mux[744]: client watchdog expired - disable watchdog updates
Oct 27 18:45:07 myserver pvestatd[1494]: status update time (13.144 seconds)
Oct 27 18:45:11 myserver corosync[865560]:   [TOTEM ] Retransmit List: 54 6b 6c 71 78 1 a b d e 10 11 5a 61 62 77 8a 8c 8e 8f 97 98 9a 9b
Oct 27 18:45:11 myserver corosync[865560]:   [TOTEM ] Retransmit List: 62 8c 77 8a 8e 8f 92 97 98 9a 9e a0 a1 a3 a4
-- Reboot --

Thanks.
 
I'm afraid that as long as corosync is on the same network as public traffic, you will be susceptible to this kind of problem. Do you maybe have other networks for storage? You could use them as additional fallback links, so you don't have to have your own dedicated network for Corosync, but sill can fallback to some other network in case the public network gets overlaoded. Generally speaking though, it is not a really good idea to have Corosync share its network with public traffic.
 
I'm afraid that as long as corosync is on the same network as public traffic, you will be susceptible to this kind of problem. Do you maybe have other networks for storage? You could use them as additional fallback links, so you don't have to have your own dedicated network for Corosync, but sill can fallback to some other network in case the public network gets overlaoded. Generally speaking though, it is not a really good idea to have Corosync share its network with public traffic.

Thank you for your reply. I'll add new fail over interfaces. I hope, it helps. Kind regards.