Hello
Since install in september, I have got ramdom reboot of my 4 nodes cluster
each node reboot once a week
No reason in the log. I have put corosync in debug mode but still nothing
the only thing i have is when i test with omping . it loose 1 packet out of 600.
but nothinhg is log about watchdog in the log.
I'm using intel card with ixgbe I have seen stuff in forum about this forum but nothing is clear about a solution.
I'm stuck because don't know where to look for or what to do that help me determine the problem
Hope someone can help
Zorg
Since install in september, I have got ramdom reboot of my 4 nodes cluster
each node reboot once a week
No reason in the log. I have put corosync in debug mode but still nothing
the only thing i have is when i test with omping . it loose 1 packet out of 600.
but nothinhg is log about watchdog in the log.
I'm using intel card with ixgbe I have seen stuff in forum about this forum but nothing is clear about a solution.
I'm stuck because don't know where to look for or what to do that help me determine the problem
Hope someone can help
Zorg
Code:
pve-manager: 5.3-5 (running version: 5.3-5/97ae681d)
pve-kernel-4.15: 5.2-12
pve-kernel-4.15.18-9-pve: 4.15.18-30
pve-kernel-4.15.17-3-pve: 4.15.17-14
ceph: 12.2.10-1~bpo90+1
corosync: 2.4.4-pve1
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
libjs-extjs: 6.0.1-2
libpve-access-control: 5.1-3
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-43
libpve-guest-common-perl: 2.0-18
libpve-http-server-perl: 2.0-11
libpve-storage-perl: 5.0-33
libqb0: 1.0.3-1~bpo9
lvm2: 2.02.168-pve6
lxc-pve: 3.0.2+pve1-5
lxcfs: 3.0.2-2
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-22
pve-cluster: 5.0-31
pve-container: 2.0-31
pve-docs: 5.3-1
pve-edk2-firmware: 1.20181023-1
pve-firewall: 3.0-16
pve-firmware: 2.0-6
pve-ha-manager: 2.0-5
pve-i18n: 1.0-9
pve-libspice-server1: 0.14.1-1
pve-qemu-kvm: 2.12.1-1
pve-xtermjs: 1.0-5
qemu-server: 5.0-43
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3