Hi,
In our 3-nodes cluster, the VMs are backuped once a week each saturdays night. (The local storage is the destination of these backups)
However, regularly, maybe one out of four or five times, one of the three nodes crashes and restart by itself.
I checked the /var/lib/vz/dump folder, it seems this is always during the same VM backup job the node crashes. I found .dat files and temp folders for the VM number 200.
I also checked /var/logs (attached) if there are any signs of errors, and I seen that corosync lost network links a few minutes before crash. (0:13)
I heard about a similar bug that the overload of network links during backup could make corosync struggling, but I thought it was fixed. (We run Proxmox VE 6.0-9) For information, the VMs running on a network SAN (iSCSI), so indeed the backup jobs generates necessarily network traffic.
Thanks in advance for your help.
(And apologies for bad english )
				
			In our 3-nodes cluster, the VMs are backuped once a week each saturdays night. (The local storage is the destination of these backups)
However, regularly, maybe one out of four or five times, one of the three nodes crashes and restart by itself.
I checked the /var/lib/vz/dump folder, it seems this is always during the same VM backup job the node crashes. I found .dat files and temp folders for the VM number 200.
I also checked /var/logs (attached) if there are any signs of errors, and I seen that corosync lost network links a few minutes before crash. (0:13)
I heard about a similar bug that the overload of network links during backup could make corosync struggling, but I thought it was fixed. (We run Proxmox VE 6.0-9) For information, the VMs running on a network SAN (iSCSI), so indeed the backup jobs generates necessarily network traffic.
Thanks in advance for your help.
(And apologies for bad english )
 
	 
	