hi fabian,packages are now on pve-no-subscription, please see the following extra information if you have been affected by a full-cluster-crash with the mentioned symptoms (one node rebooting/upgrading/restarting corosync -> cpg_join/cpg_send_message retry log entries followed by watchdog expiring on all nodes):
https://bugzilla.proxmox.com/show_bug.cgi?id=3672
for people who triggered this easily because of their particular load/network situation, it might be advisable to follow the following procedure to avoid triggering it again when installing the fixed versions:
stop the HA services (first LRM on all nodes, then CRM on all nodes - this disables HA, but also disarms the watchdog so you don't risk fencing)
then upgrade all nodes (this will automatically restart corosync, which will pick up the fixed libknet as well)
then start the HA services again (again first LRM on all nodes, then CRM) to re-enable HA features
i seem to have faced the same situation here concerning the thread we are discussing . possible ?
https://forum.proxmox.com/threads/3...n-1000-email-in-5-minutes.122354/#post-531832