Hi,
Brand new servers with PVE 5 installed. We started pushing more and more load to the servers and we are experiencing:
* IO and CPU stalls on our monitoring station (Centos 7) even when load reported by the machine is under 1
* IO stalls around 1am on our SQL Server (Windows 2019) which sometimes impact the log files and corrupts it and blocks the backups....
* IO Stalls on other servers where they need an hour to just do a simple reboot whether windows or linux.
Am still looking on the hardware side of things but:
* ZFS which is underlying the VM disks reports no errors, same for disk SMARTs
So, where can I gather more information in PVE about stalls and performance statistics (What is the default location for those log files)
And does anyone ever had that issue and knows how they fixed it?
Thanks
Brand new servers with PVE 5 installed. We started pushing more and more load to the servers and we are experiencing:
* IO and CPU stalls on our monitoring station (Centos 7) even when load reported by the machine is under 1
* IO stalls around 1am on our SQL Server (Windows 2019) which sometimes impact the log files and corrupts it and blocks the backups....
* IO Stalls on other servers where they need an hour to just do a simple reboot whether windows or linux.
Am still looking on the hardware side of things but:
* ZFS which is underlying the VM disks reports no errors, same for disk SMARTs
So, where can I gather more information in PVE about stalls and performance statistics (What is the default location for those log files)
And does anyone ever had that issue and knows how they fixed it?
Thanks