Hey,
I've recently updated from PVE 7.0 to PVE 7.1, since then I've been getting IO issues in my VMs, docker images getting downloaded and then being corrupt or deb files being corrupted when trying to upgrade packages in VMs.
I thought the issue was my two NVMe disks dying so I've had them replaced with some new ones (they were for boot), then a fresh install of PVE 7.1, restored all my VMs from backup (Thank you PBS!), and the issues look to be happening again, at least currently console is getting buffer I/O errors. Also its happening on VMs that are running off HDDs, so it's not the NVMe's again, and maybe it never was.
I've got 2 ZFS mirrors.
First zfs pool which is called rpool which is created and installed using PVE installer, using the 2 NVME drives.
Second zfs pool is called hdd, using 2 4TB HDDs in a mirror.
All my Disc are Data Center grade.
2x WDC CL SN720 SDAQNTW-512G-2000
2x HGST_HUS726T4TALA6L1
ZFS isn't showing any errors even after multiple scrubs, so I'm wondering are the errors at the qemu layer?
I use the virtio driver for disks and networking.
SMART passes on all drives.
Anyone getting issues?
Here's a screenshot from the console, VMs were started within an hour of each other.
VM 1 and 2, were started 17 or so hours ago, VM 3 recently started and also errors.
VM 1 (NVMe ZFS Mirror) 14ish hours up
VM 2 (NVMe ZFS Mirror) 14ish hours up
VM 3 (HDD ZFS Mirror), recently start and up 2 hours at this point
I've recently updated from PVE 7.0 to PVE 7.1, since then I've been getting IO issues in my VMs, docker images getting downloaded and then being corrupt or deb files being corrupted when trying to upgrade packages in VMs.
I thought the issue was my two NVMe disks dying so I've had them replaced with some new ones (they were for boot), then a fresh install of PVE 7.1, restored all my VMs from backup (Thank you PBS!), and the issues look to be happening again, at least currently console is getting buffer I/O errors. Also its happening on VMs that are running off HDDs, so it's not the NVMe's again, and maybe it never was.
I've got 2 ZFS mirrors.
First zfs pool which is called rpool which is created and installed using PVE installer, using the 2 NVME drives.
Second zfs pool is called hdd, using 2 4TB HDDs in a mirror.
All my Disc are Data Center grade.
2x WDC CL SN720 SDAQNTW-512G-2000
2x HGST_HUS726T4TALA6L1
ZFS isn't showing any errors even after multiple scrubs, so I'm wondering are the errors at the qemu layer?
I use the virtio driver for disks and networking.
SMART passes on all drives.
Anyone getting issues?
Here's a screenshot from the console, VMs were started within an hour of each other.
VM 1 and 2, were started 17 or so hours ago, VM 3 recently started and also errors.
VM 1 (NVMe ZFS Mirror) 14ish hours up
VM 2 (NVMe ZFS Mirror) 14ish hours up
VM 3 (HDD ZFS Mirror), recently start and up 2 hours at this point