Hi
For some time now, we have had a strange issue with the VM running our Zabbix monitoring server.
Zabbix runs as a VM on our PVE cluster and monitors all running virtual machines on the cluster. Zabbix VM is running CentOS 7 as the guest OS.
Whenever a backup of this VM is triggered, it seems to loose connectivity to all virtual machines across all nodes and thus kicking off 1000+ email alerts.
This behavior has been seen on backups prior to the new Proxmox Backup Server as well as after implementing the Proxmox Backup Server solution.
The Zabbix VM has been located on various different hosts throughout its lifecycle.
The VM also remains responsive during the backup
What we have found is that whenever the backup job executes "freeze" or "thaw" of the VM disk, The freeze seems to trigger connectivity issues and thaw restores it. We then thought that it had something to do with maybe a conflict with the qemu guest agent, so we disabled it on the vm configuration in Proxmox and disabled the related services in the guest OS. Still same result.
Has anyone else experienced similar issues
For now we have configured a manitenance window in Zabbix that roughly matches the window in which backup of this specific VM runs
For some time now, we have had a strange issue with the VM running our Zabbix monitoring server.
Zabbix runs as a VM on our PVE cluster and monitors all running virtual machines on the cluster. Zabbix VM is running CentOS 7 as the guest OS.
Whenever a backup of this VM is triggered, it seems to loose connectivity to all virtual machines across all nodes and thus kicking off 1000+ email alerts.
This behavior has been seen on backups prior to the new Proxmox Backup Server as well as after implementing the Proxmox Backup Server solution.
The Zabbix VM has been located on various different hosts throughout its lifecycle.
The VM also remains responsive during the backup
What we have found is that whenever the backup job executes "freeze" or "thaw" of the VM disk, The freeze seems to trigger connectivity issues and thaw restores it. We then thought that it had something to do with maybe a conflict with the qemu guest agent, so we disabled it on the vm configuration in Proxmox and disabled the related services in the guest OS. Still same result.
Has anyone else experienced similar issues
For now we have configured a manitenance window in Zabbix that roughly matches the window in which backup of this specific VM runs