Hi,
our setup:
two "big" proxmox 5.3.8 server with zfs-filesystem and HA configured for 3 VMs
one old server only as quorum-server.
Steps to reproduce:
And here the same effect:
Looking at the monitor/shell of server-1 there is even no kernel-panic message on the screen before hard power down...
Steps to prevent:
As there is no log at all and it is reproducible and a little bit unbelievable her the hand-made;-) video:
drive.google.com/file/d/1zxmtd-WyMO61FTP7Bq8R-jappPmWJaqM/view?usp=sharing
Daniel
our setup:
two "big" proxmox 5.3.8 server with zfs-filesystem and HA configured for 3 VMs
one old server only as quorum-server.
Steps to reproduce:
- As we have new switches we wanted just to plug network-cables from old to new switch (with some seconds of interruption).
- So we first unplugged the third quorum-server, so the HA would not trigger any VM migrations.
- unplug network of server-2 for 5 seconds
- about 20 seconds later server-1 crashes hard an restarts
- Feb 22 13:13:23 proxmox-1 corosync[6597]: [MAIN ] Completed service synchronization, ready to provide service.
- fine: both servers back in sync, seeing and knowing each other
- Next logline after hard crash:
- Feb 22 13:15:18 proxmox-1 systemd[1]: Starting Flush Journal to Persistent Storage...
And here the same effect:
Looking at the monitor/shell of server-1 there is even no kernel-panic message on the screen before hard power down...
Steps to prevent:
- No problem when the third quorum-server stays plugged in.
- No crash if all/the HA-group and HA-configurations are deleted before.
- Cluster and replications-jobs can stay active.
- quorum-server can be unplugged as long as HA is deleted.
As there is no log at all and it is reproducible and a little bit unbelievable her the hand-made;-) video:
drive.google.com/file/d/1zxmtd-WyMO61FTP7Bq8R-jappPmWJaqM/view?usp=sharing
Daniel