Hello $all,
I am running a Proxmox 4.0 Server for a customer, with qcow2 files and zfs as storage.
ZFS is set up this way:
- 2x2 mirror
- 2 cache devices (SSD)
The server has:
- 32gig of ram
- 16gig are used for the VMs (5, all windows XP to 7)
- 2 Xeon CPUs
For backup I run a script which creates a zfs snapshot and then rdiffs the qcow2 files. This evening the server became close to unresponsive. Once I managed to login, I saw this:
Please note the CPU usage values. Also, in the web interface I see totally strange CPU usage values for the VMs. First I thought a fan has died and the CPUs are throtteling massively, but I checked the BMC as well as thermal throttle counters. The Xeons look fine and run with full power.
What is happening here?
Regards,
Andreas
I am running a Proxmox 4.0 Server for a customer, with qcow2 files and zfs as storage.
ZFS is set up this way:
- 2x2 mirror
- 2 cache devices (SSD)
The server has:
- 32gig of ram
- 16gig are used for the VMs (5, all windows XP to 7)
- 2 Xeon CPUs
For backup I run a script which creates a zfs snapshot and then rdiffs the qcow2 files. This evening the server became close to unresponsive. Once I managed to login, I saw this:
Code:
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
6626 root 20 0 2526920 1.401g 2808 S 231.5 4.5 370:28.37 kvm
1104 root 1 -19 0 0 0 R 90.3 0.0 1221:11 z_wr_iss
1107 root 0 -20 0 0 0 R 90.3 0.0 67:06.84 z_wr_int_1
1175 root 20 0 0 0 0 R 90.3 0.0 1105:08 txg_sync
12554 root 20 0 18996 10184 1388 R 90.3 0.0 138:10.60 rdiff
13574 root 20 0 5373656 3.733g 2304 S 90.3 11.9 6339:06 kvm
31375 root 1 -19 0 0 0 R 90.3 0.0 0:26.16 z_wr_iss
31430 root 1 -19 0 0 0 R 90.3 0.0 0:20.30 z_wr_iss
31462 root 1 -19 0 0 0 R 90.3 0.0 0:06.71 z_wr_iss
31463 root 1 -19 0 0 0 R 90.3 0.0 0:03.54 z_wr_iss
31464 root 1 -19 0 0 0 R 90.3 0.0 0:02.51 z_wr_iss
31470 root 1 -19 0 0 0 R 90.3 0.0 0:02.43 z_wr_iss
31471 root 1 -19 0 0 0 R 90.3 0.0 0:02.50 z_wr_iss
3252 root 20 0 1449640 870372 2248 S 79.1 2.6 405:42.81 kvm
31476 root 20 0 25864 3048 2392 R 11.3 0.0 0:00.02 top
Please note the CPU usage values. Also, in the web interface I see totally strange CPU usage values for the VMs. First I thought a fan has died and the CPUs are throtteling massively, but I checked the BMC as well as thermal throttle counters. The Xeons look fine and run with full power.
What is happening here?
Regards,
Andreas