Dear all,
We have a recurring issue where our VM become either very slow or unresponsive during backups. We have a cluster of 6 hypervisors (3 with SAS disks, 2 with SSDs, 1 with SATA disks, all servers have 3 identical disks on a LSI MegaRAID RAID5) backing up to a shared NFS mount (1 Gbps link). All hypervisors have the issue and since we created the cluster (several months ago).
The backup is configured as follow:
- Compression: LZO
- Mode: Snapshot
I agree a small freeze is required to snapshot, but this is not the case. The slowness/unresponsiveness lasts as long as the backup lasts.
We were able to get some logs from the VM during the issue, the following message is displayed (varying CPU ID/time): NMI watchdog: BUG: soft lockup - CPU#14 stuck for 22s
Any idea how to improve it?
Thanks!
We have a recurring issue where our VM become either very slow or unresponsive during backups. We have a cluster of 6 hypervisors (3 with SAS disks, 2 with SSDs, 1 with SATA disks, all servers have 3 identical disks on a LSI MegaRAID RAID5) backing up to a shared NFS mount (1 Gbps link). All hypervisors have the issue and since we created the cluster (several months ago).
The backup is configured as follow:
- Compression: LZO
- Mode: Snapshot
I agree a small freeze is required to snapshot, but this is not the case. The slowness/unresponsiveness lasts as long as the backup lasts.
We were able to get some logs from the VM during the issue, the following message is displayed (varying CPU ID/time): NMI watchdog: BUG: soft lockup - CPU#14 stuck for 22s
Any idea how to improve it?
Thanks!