Hi,
I am encountering some problems making backups on many servers running ZFS.
The PVE cluster is made of 4 nodes, every node have a ZFS mirror pool on 2 NVMe SSD drives and a full backup is made once per week at 00:30 on different days per each node:
Every time a backup is started I see some outages on some virtual machines which remains hanged for a while, and backups takes about 2 hours normally.
This happens on all nodes when a backup is made.
Last night a backup on one node took four times more (about 8 hours!) and I saw 8 to ~12 IO delay during the backup job (from 2:30 to 8:30), and during this period some virtual machines were randomly hanged:
I have the same behaviour during a restore of a virtual machine from backup (the 5.34 IO delay peak at 16:30 on the above graph).
Could you help me to understand what's going on, please?
Thank you very much!
I am encountering some problems making backups on many servers running ZFS.
The PVE cluster is made of 4 nodes, every node have a ZFS mirror pool on 2 NVMe SSD drives and a full backup is made once per week at 00:30 on different days per each node:
Every time a backup is started I see some outages on some virtual machines which remains hanged for a while, and backups takes about 2 hours normally.
This happens on all nodes when a backup is made.
Last night a backup on one node took four times more (about 8 hours!) and I saw 8 to ~12 IO delay during the backup job (from 2:30 to 8:30), and during this period some virtual machines were randomly hanged:
I have the same behaviour during a restore of a virtual machine from backup (the 5.34 IO delay peak at 16:30 on the above graph).
Could you help me to understand what's going on, please?
Thank you very much!