Hi all,
We are currently running a PVE cluster with multiple machines and serve about 15 LXC containers and 10 VMs.
We are using PVE replication (every 15min) to onsite PVE machines and znapzend to create hourly snapshots that are also replicated off site.
This should protect us against a variety of worst case scenarios apart from one: A severe ZFS bug that would render both the production systems and snapshots (ie. backups) useless.
I now wonder if we should run PVE backups of VMs and containers on top of the backups. I don't like the idea because, with nearly 3TB of data, this would need a lot of additional storage plus create a lot of IO when all backups run, often causing PVE sync time-outs.
What would be the general opinion and risk assessment regarding reliability of ZFS, zfs send and snapshots?
We are currently running a PVE cluster with multiple machines and serve about 15 LXC containers and 10 VMs.
We are using PVE replication (every 15min) to onsite PVE machines and znapzend to create hourly snapshots that are also replicated off site.
This should protect us against a variety of worst case scenarios apart from one: A severe ZFS bug that would render both the production systems and snapshots (ie. backups) useless.
I now wonder if we should run PVE backups of VMs and containers on top of the backups. I don't like the idea because, with nearly 3TB of data, this would need a lot of additional storage plus create a lot of IO when all backups run, often causing PVE sync time-outs.
What would be the general opinion and risk assessment regarding reliability of ZFS, zfs send and snapshots?