Hi fellow Proxmoxers
two days ago, our nightly backup raised an error with one VM:
ERROR: Backup of VM 160 failed - VM 160 qmp command 'query-backup' failed - got wrong command id '12299:46449189' (expected 27831:3521)
We manually tried to backup the VM and it failed. When we set the backup mode from snapshot to stop, it worked fine.
Then, in the night from yesterday to today, the same errors occured on multiple VMs:
Clues:
- There's a QEMU guest agent installed on all the above suspects/VMs (Windows & Mac).
- All are on the same host. The other hosts are not affected (yet).
- The same VM from day 1 had the same error on day 2.
- If we manually backup in snapshot mode, the same error occures. But if we manually backup in stop mode, no errors occure.
- OS independent: some are Windows Servers, some Linux.
- It seems to spread, day 1 one VM was affected, day 2 five VMs are affected.
Just rebooting is not an option as there are a couple dozens of VMs running on the host.
And the pve versions:
Any help or hints from the hive is appreciated
Best,
Andy
two days ago, our nightly backup raised an error with one VM:
ERROR: Backup of VM 160 failed - VM 160 qmp command 'query-backup' failed - got wrong command id '12299:46449189' (expected 27831:3521)
160: 2021-04-19 23:51:05 INFO: Starting Backup of VM 160 (qemu)
160: 2021-04-19 23:51:05 INFO: status = running
160: 2021-04-19 23:51:05 INFO: VM Name: XXX
160: 2021-04-19 23:51:05 INFO: include disk 'ide0' 'ironVM:160/vm-160-disk-0.qcow2' 1000G
160: 2021-04-19 23:51:05 INFO: exclude disk 'ide1' 'ironNFS:160/vm-160-disk-1.qcow2' (backup=no)
160: 2021-04-19 23:51:05 INFO: backup mode: snapshot
160: 2021-04-19 23:51:05 INFO: ionice priority: 7
160: 2021-04-19 23:51:05 INFO: skip unused drive 'ironNFS:160/vm-160-disk-0.qcow2' (not included into backup)
160: 2021-04-19 23:51:05 INFO: creating vzdump archive '/mnt/pve/ironNFS/dump/vzdump-qemu-160-2021_04_19-23_51_04.vma.zst'
160: 2021-04-19 23:51:05 INFO: issuing guest-agent 'fs-freeze' command
160: 2021-04-19 23:51:06 INFO: issuing guest-agent 'fs-thaw' command
160: 2021-04-19 23:51:06 INFO: started backup task 'd2a5692c-f11e-4227-939f-6688a456cbf5'
160: 2021-04-19 23:51:06 INFO: resuming VM again
160: 2021-04-19 23:51:09 INFO: 0% (634.4 MiB of 1000.0 GiB) in 3s, read: 211.5 MiB/s, write: 167.7 MiB/s
160: 2021-04-19 23:52:20 INFO: 1% (10.1 GiB of 1000.0 GiB) in 1m 14s, read: 136.8 MiB/s, write: 134.5 MiB/s
160: 2021-04-19 23:53:38 INFO: 2% (20.0 GiB of 1000.0 GiB) in 2m 32s, read: 130.1 MiB/s, write: 128.5 MiB/s
160: 2021-04-19 23:54:53 INFO: 3% (30.5 GiB of 1000.0 GiB) in 3m 47s, read: 143.1 MiB/s, write: 137.2 MiB/s
160: 2021-04-19 23:56:02 INFO: 4% (40.1 GiB of 1000.0 GiB) in 4m 56s, read: 142.5 MiB/s, write: 140.9 MiB/s
160: 2021-04-19 23:57:20 INFO: 5% (50.1 GiB of 1000.0 GiB) in 6m 14s, read: 131.5 MiB/s, write: 129.9 MiB/s
160: 2021-04-19 23:58:43 INFO: 6% (60.1 GiB of 1000.0 GiB) in 7m 37s, read: 123.1 MiB/s, write: 121.7 MiB/s
160: 2021-04-19 23:59:55 INFO: 7% (70.1 GiB of 1000.0 GiB) in 8m 49s, read: 143.0 MiB/s, write: 141.0 MiB/s
160: 2021-04-20 00:01:08 INFO: 8% (80.1 GiB of 1000.0 GiB) in 10m 2s, read: 139.4 MiB/s, write: 137.7 MiB/s
160: 2021-04-20 00:02:19 INFO: 9% (90.0 GiB of 1000.0 GiB) in 11m 13s, read: 143.0 MiB/s, write: 141.3 MiB/s
160: 2021-04-20 00:03:34 INFO: 10% (100.1 GiB of 1000.0 GiB) in 12m 28s, read: 138.2 MiB/s, write: 134.5 MiB/s
160: 2021-04-20 00:04:50 INFO: 11% (110.0 GiB of 1000.0 GiB) in 13m 44s, read: 133.6 MiB/s, write: 131.9 MiB/s
160: 2021-04-20 00:06:00 INFO: 12% (120.0 GiB of 1000.0 GiB) in 14m 54s, read: 146.3 MiB/s, write: 143.7 MiB/s
160: 2021-04-20 00:07:10 INFO: 13% (130.0 GiB of 1000.0 GiB) in 16m 4s, read: 145.8 MiB/s, write: 144.5 MiB/s
160: 2021-04-20 00:07:15 INFO: 14% (140.1 GiB of 1000.0 GiB) in 16m 9s, read: 2.0 GiB/s, write: 108.9 MiB/s
160: 2021-04-20 00:07:18 INFO: 15% (157.2 GiB of 1000.0 GiB) in 16m 12s, read: 5.7 GiB/s, write: 21.3 KiB/s
160: 2021-04-20 00:07:35 INFO: 16% (160.1 GiB of 1000.0 GiB) in 16m 29s, read: 172.0 MiB/s, write: 122.3 MiB/s
160: 2021-04-20 00:08:48 INFO: 17% (170.1 GiB of 1000.0 GiB) in 17m 42s, read: 140.2 MiB/s, write: 138.9 MiB/s
160: 2021-04-20 00:10:06 INFO: 18% (180.1 GiB of 1000.0 GiB) in 19m, read: 131.6 MiB/s, write: 130.5 MiB/s
160: 2021-04-20 00:11:17 INFO: 19% (190.0 GiB of 1000.0 GiB) in 20m 11s, read: 143.2 MiB/s, write: 136.8 MiB/s
160: 2021-04-20 00:12:37 INFO: 20% (200.1 GiB of 1000.0 GiB) in 21m 31s, read: 129.1 MiB/s, write: 127.2 MiB/s
160: 2021-04-20 00:13:54 INFO: 21% (210.1 GiB of 1000.0 GiB) in 22m 48s, read: 132.5 MiB/s, write: 131.4 MiB/s
160: 2021-04-20 00:15:13 INFO: 22% (220.0 GiB of 1000.0 GiB) in 24m 7s, read: 128.8 MiB/s, write: 127.1 MiB/s
160: 2021-04-20 00:16:22 INFO: 23% (230.1 GiB of 1000.0 GiB) in 25m 16s, read: 150.1 MiB/s, write: 149.6 MiB/s
160: 2021-04-20 00:17:43 INFO: 24% (240.2 GiB of 1000.0 GiB) in 26m 37s, read: 127.5 MiB/s, write: 125.8 MiB/s
160: 2021-04-20 00:18:23 ERROR: VM 160 qmp command 'query-backup' failed - got wrong command id '12299:46449189' (expected 27831:3521)
160: 2021-04-20 00:18:23 INFO: aborting backup job
160: 2021-04-20 00:18:28 ERROR: VM 160 qmp command 'backup-cancel' failed - client closed connection
160: 2021-04-20 00:18:38 ERROR: Backup of VM 160 failed - VM 160 qmp command 'query-backup' failed - got wrong command id '12299:46449189' (expected 27831:3521)
160: 2021-04-19 23:51:05 INFO: status = running
160: 2021-04-19 23:51:05 INFO: VM Name: XXX
160: 2021-04-19 23:51:05 INFO: include disk 'ide0' 'ironVM:160/vm-160-disk-0.qcow2' 1000G
160: 2021-04-19 23:51:05 INFO: exclude disk 'ide1' 'ironNFS:160/vm-160-disk-1.qcow2' (backup=no)
160: 2021-04-19 23:51:05 INFO: backup mode: snapshot
160: 2021-04-19 23:51:05 INFO: ionice priority: 7
160: 2021-04-19 23:51:05 INFO: skip unused drive 'ironNFS:160/vm-160-disk-0.qcow2' (not included into backup)
160: 2021-04-19 23:51:05 INFO: creating vzdump archive '/mnt/pve/ironNFS/dump/vzdump-qemu-160-2021_04_19-23_51_04.vma.zst'
160: 2021-04-19 23:51:05 INFO: issuing guest-agent 'fs-freeze' command
160: 2021-04-19 23:51:06 INFO: issuing guest-agent 'fs-thaw' command
160: 2021-04-19 23:51:06 INFO: started backup task 'd2a5692c-f11e-4227-939f-6688a456cbf5'
160: 2021-04-19 23:51:06 INFO: resuming VM again
160: 2021-04-19 23:51:09 INFO: 0% (634.4 MiB of 1000.0 GiB) in 3s, read: 211.5 MiB/s, write: 167.7 MiB/s
160: 2021-04-19 23:52:20 INFO: 1% (10.1 GiB of 1000.0 GiB) in 1m 14s, read: 136.8 MiB/s, write: 134.5 MiB/s
160: 2021-04-19 23:53:38 INFO: 2% (20.0 GiB of 1000.0 GiB) in 2m 32s, read: 130.1 MiB/s, write: 128.5 MiB/s
160: 2021-04-19 23:54:53 INFO: 3% (30.5 GiB of 1000.0 GiB) in 3m 47s, read: 143.1 MiB/s, write: 137.2 MiB/s
160: 2021-04-19 23:56:02 INFO: 4% (40.1 GiB of 1000.0 GiB) in 4m 56s, read: 142.5 MiB/s, write: 140.9 MiB/s
160: 2021-04-19 23:57:20 INFO: 5% (50.1 GiB of 1000.0 GiB) in 6m 14s, read: 131.5 MiB/s, write: 129.9 MiB/s
160: 2021-04-19 23:58:43 INFO: 6% (60.1 GiB of 1000.0 GiB) in 7m 37s, read: 123.1 MiB/s, write: 121.7 MiB/s
160: 2021-04-19 23:59:55 INFO: 7% (70.1 GiB of 1000.0 GiB) in 8m 49s, read: 143.0 MiB/s, write: 141.0 MiB/s
160: 2021-04-20 00:01:08 INFO: 8% (80.1 GiB of 1000.0 GiB) in 10m 2s, read: 139.4 MiB/s, write: 137.7 MiB/s
160: 2021-04-20 00:02:19 INFO: 9% (90.0 GiB of 1000.0 GiB) in 11m 13s, read: 143.0 MiB/s, write: 141.3 MiB/s
160: 2021-04-20 00:03:34 INFO: 10% (100.1 GiB of 1000.0 GiB) in 12m 28s, read: 138.2 MiB/s, write: 134.5 MiB/s
160: 2021-04-20 00:04:50 INFO: 11% (110.0 GiB of 1000.0 GiB) in 13m 44s, read: 133.6 MiB/s, write: 131.9 MiB/s
160: 2021-04-20 00:06:00 INFO: 12% (120.0 GiB of 1000.0 GiB) in 14m 54s, read: 146.3 MiB/s, write: 143.7 MiB/s
160: 2021-04-20 00:07:10 INFO: 13% (130.0 GiB of 1000.0 GiB) in 16m 4s, read: 145.8 MiB/s, write: 144.5 MiB/s
160: 2021-04-20 00:07:15 INFO: 14% (140.1 GiB of 1000.0 GiB) in 16m 9s, read: 2.0 GiB/s, write: 108.9 MiB/s
160: 2021-04-20 00:07:18 INFO: 15% (157.2 GiB of 1000.0 GiB) in 16m 12s, read: 5.7 GiB/s, write: 21.3 KiB/s
160: 2021-04-20 00:07:35 INFO: 16% (160.1 GiB of 1000.0 GiB) in 16m 29s, read: 172.0 MiB/s, write: 122.3 MiB/s
160: 2021-04-20 00:08:48 INFO: 17% (170.1 GiB of 1000.0 GiB) in 17m 42s, read: 140.2 MiB/s, write: 138.9 MiB/s
160: 2021-04-20 00:10:06 INFO: 18% (180.1 GiB of 1000.0 GiB) in 19m, read: 131.6 MiB/s, write: 130.5 MiB/s
160: 2021-04-20 00:11:17 INFO: 19% (190.0 GiB of 1000.0 GiB) in 20m 11s, read: 143.2 MiB/s, write: 136.8 MiB/s
160: 2021-04-20 00:12:37 INFO: 20% (200.1 GiB of 1000.0 GiB) in 21m 31s, read: 129.1 MiB/s, write: 127.2 MiB/s
160: 2021-04-20 00:13:54 INFO: 21% (210.1 GiB of 1000.0 GiB) in 22m 48s, read: 132.5 MiB/s, write: 131.4 MiB/s
160: 2021-04-20 00:15:13 INFO: 22% (220.0 GiB of 1000.0 GiB) in 24m 7s, read: 128.8 MiB/s, write: 127.1 MiB/s
160: 2021-04-20 00:16:22 INFO: 23% (230.1 GiB of 1000.0 GiB) in 25m 16s, read: 150.1 MiB/s, write: 149.6 MiB/s
160: 2021-04-20 00:17:43 INFO: 24% (240.2 GiB of 1000.0 GiB) in 26m 37s, read: 127.5 MiB/s, write: 125.8 MiB/s
160: 2021-04-20 00:18:23 ERROR: VM 160 qmp command 'query-backup' failed - got wrong command id '12299:46449189' (expected 27831:3521)
160: 2021-04-20 00:18:23 INFO: aborting backup job
160: 2021-04-20 00:18:28 ERROR: VM 160 qmp command 'backup-cancel' failed - client closed connection
160: 2021-04-20 00:18:38 ERROR: Backup of VM 160 failed - VM 160 qmp command 'query-backup' failed - got wrong command id '12299:46449189' (expected 27831:3521)
We manually tried to backup the VM and it failed. When we set the backup mode from snapshot to stop, it worked fine.
Then, in the night from yesterday to today, the same errors occured on multiple VMs:
168: 2021-04-21 21:10:37 INFO: Starting Backup of VM 168 (qemu)
168: 2021-04-21 21:10:37 INFO: status = running
168: 2021-04-21 21:10:37 INFO: VM Name: XXX
168: 2021-04-21 21:10:37 INFO: include disk 'scsi0' 'ironVM:168/vm-168-disk-0.qcow2' 32G
168: 2021-04-21 21:10:37 INFO: exclude disk 'scsi1' 'ironNFS:168/vm-168-disk-1.qcow2' (backup=no)
168: 2021-04-21 21:10:37 INFO: backup mode: snapshot
168: 2021-04-21 21:10:37 INFO: ionice priority: 7
168: 2021-04-21 21:10:37 INFO: skip unused drive 'ironNFS:168/vm-168-disk-0.qcow2' (not included into backup)
168: 2021-04-21 21:10:37 INFO: creating vzdump archive '/mnt/pve/ironNFS/dump/vzdump-qemu-168-2021_04_21-21_10_37.vma.zst'
168: 2021-04-21 21:10:37 INFO: issuing guest-agent 'fs-freeze' command
168: 2021-04-21 21:10:37 INFO: issuing guest-agent 'fs-thaw' command
168: 2021-04-21 21:10:38 INFO: started backup task '3c48b6d7-1488-4c40-848c-4d66eb977e30'
168: 2021-04-21 21:10:38 INFO: resuming VM again
168: 2021-04-21 21:10:41 INFO: 1% (456.8 MiB of 32.0 GiB) in 3s, read: 152.2 MiB/s, write: 136.5 MiB/s
168: 2021-04-21 21:10:44 INFO: 2% (866.4 MiB of 32.0 GiB) in 6s, read: 136.5 MiB/s, write: 133.2 MiB/s
168: 2021-04-21 21:10:47 INFO: 3% (1.2 GiB of 32.0 GiB) in 9s, read: 120.8 MiB/s, write: 117.4 MiB/s
168: 2021-04-21 21:10:50 INFO: 5% (1.7 GiB of 32.0 GiB) in 12s, read: 161.9 MiB/s, write: 161.3 MiB/s
168: 2021-04-21 21:10:53 INFO: 6% (2.1 GiB of 32.0 GiB) in 15s, read: 161.9 MiB/s, write: 144.0 MiB/s
168: 2021-04-21 21:10:56 INFO: 8% (2.8 GiB of 32.0 GiB) in 18s, read: 219.9 MiB/s, write: 214.5 MiB/s
168: 2021-04-21 21:10:59 INFO: 10% (3.3 GiB of 32.0 GiB) in 21s, read: 187.3 MiB/s, write: 185.8 MiB/s
168: 2021-04-21 21:11:02 INFO: 12% (3.9 GiB of 32.0 GiB) in 24s, read: 180.0 MiB/s, write: 178.5 MiB/s
168: 2021-04-21 21:11:06 INFO: 13% (4.4 GiB of 32.0 GiB) in 28s, read: 125.1 MiB/s, write: 117.7 MiB/s
168: 2021-04-21 21:11:09 INFO: 15% (4.9 GiB of 32.0 GiB) in 31s, read: 189.7 MiB/s, write: 189.5 MiB/s
168: 2021-04-21 21:11:12 INFO: 17% (5.5 GiB of 32.0 GiB) in 34s, read: 190.9 MiB/s, write: 190.9 MiB/s
168: 2021-04-21 21:11:15 INFO: 18% (6.0 GiB of 32.0 GiB) in 37s, read: 193.3 MiB/s, write: 184.4 MiB/s
168: 2021-04-21 21:11:18 INFO: 20% (6.4 GiB of 32.0 GiB) in 40s, read: 139.0 MiB/s, write: 136.5 MiB/s
168: 2021-04-21 21:11:21 INFO: 21% (6.9 GiB of 32.0 GiB) in 43s, read: 142.6 MiB/s, write: 136.4 MiB/s
168: 2021-04-21 21:11:26 INFO: 23% (7.6 GiB of 32.0 GiB) in 48s, read: 142.8 MiB/s, write: 141.0 MiB/s
168: 2021-04-21 21:11:29 INFO: 24% (8.0 GiB of 32.0 GiB) in 51s, read: 147.0 MiB/s, write: 131.0 MiB/s
168: 2021-04-21 21:11:32 INFO: 26% (8.4 GiB of 32.0 GiB) in 54s, read: 138.1 MiB/s, write: 125.6 MiB/s
168: 2021-04-21 21:11:36 INFO: 28% (9.0 GiB of 32.0 GiB) in 58s, read: 145.0 MiB/s, write: 131.6 MiB/s
168: 2021-04-21 21:11:39 INFO: 29% (9.3 GiB of 32.0 GiB) in 1m 1s, read: 128.1 MiB/s, write: 118.1 MiB/s
168: 2021-04-21 21:11:42 INFO: 30% (9.7 GiB of 32.0 GiB) in 1m 4s, read: 122.0 MiB/s, write: 120.5 MiB/s
168: 2021-04-21 21:11:46 INFO: 32% (10.2 GiB of 32.0 GiB) in 1m 8s, read: 139.6 MiB/s, write: 114.4 MiB/s
168: 2021-04-21 21:11:49 INFO: 33% (10.6 GiB of 32.0 GiB) in 1m 11s, read: 131.7 MiB/s, write: 128.1 MiB/s
168: 2021-04-21 21:11:52 INFO: 34% (11.0 GiB of 32.0 GiB) in 1m 14s, read: 114.8 MiB/s, write: 109.9 MiB/s
168: 2021-04-21 21:11:56 INFO: 35% (11.5 GiB of 32.0 GiB) in 1m 18s, read: 135.0 MiB/s, write: 134.4 MiB/s
168: 2021-04-21 21:11:59 INFO: 36% (11.8 GiB of 32.0 GiB) in 1m 21s, read: 113.6 MiB/s, write: 113.4 MiB/s
168: 2021-04-21 21:12:02 INFO: 38% (12.2 GiB of 32.0 GiB) in 1m 24s, read: 129.3 MiB/s, write: 118.2 MiB/s
168: 2021-04-21 21:12:06 INFO: 39% (12.7 GiB of 32.0 GiB) in 1m 28s, read: 128.7 MiB/s, write: 128.4 MiB/s
168: 2021-04-21 21:12:09 INFO: 40% (13.1 GiB of 32.0 GiB) in 1m 31s, read: 124.5 MiB/s, write: 123.9 MiB/s
168: 2021-04-21 21:12:12 INFO: 41% (13.4 GiB of 32.0 GiB) in 1m 34s, read: 116.0 MiB/s, write: 115.6 MiB/s
168: 2021-04-21 21:12:16 INFO: 43% (14.0 GiB of 32.0 GiB) in 1m 38s, read: 145.9 MiB/s, write: 143.2 MiB/s
168: 2021-04-21 21:12:19 INFO: 45% (14.5 GiB of 32.0 GiB) in 1m 41s, read: 172.8 MiB/s, write: 164.2 MiB/s
168: 2021-04-21 21:12:22 INFO: 46% (14.9 GiB of 32.0 GiB) in 1m 44s, read: 125.7 MiB/s, write: 125.5 MiB/s
168: 2021-04-21 21:12:26 ERROR: VM 168 qmp command 'query-backup' failed - got wrong command id '12299:49138754' (expected 73836:1299)
168: 2021-04-21 21:12:26 INFO: aborting backup job
168: 2021-04-21 21:12:27 ERROR: Backup of VM 168 failed - VM 168 qmp command 'query-backup' failed - got wrong command id '12299:49138754' (expected 73836:1299)
168: 2021-04-21 21:10:37 INFO: status = running
168: 2021-04-21 21:10:37 INFO: VM Name: XXX
168: 2021-04-21 21:10:37 INFO: include disk 'scsi0' 'ironVM:168/vm-168-disk-0.qcow2' 32G
168: 2021-04-21 21:10:37 INFO: exclude disk 'scsi1' 'ironNFS:168/vm-168-disk-1.qcow2' (backup=no)
168: 2021-04-21 21:10:37 INFO: backup mode: snapshot
168: 2021-04-21 21:10:37 INFO: ionice priority: 7
168: 2021-04-21 21:10:37 INFO: skip unused drive 'ironNFS:168/vm-168-disk-0.qcow2' (not included into backup)
168: 2021-04-21 21:10:37 INFO: creating vzdump archive '/mnt/pve/ironNFS/dump/vzdump-qemu-168-2021_04_21-21_10_37.vma.zst'
168: 2021-04-21 21:10:37 INFO: issuing guest-agent 'fs-freeze' command
168: 2021-04-21 21:10:37 INFO: issuing guest-agent 'fs-thaw' command
168: 2021-04-21 21:10:38 INFO: started backup task '3c48b6d7-1488-4c40-848c-4d66eb977e30'
168: 2021-04-21 21:10:38 INFO: resuming VM again
168: 2021-04-21 21:10:41 INFO: 1% (456.8 MiB of 32.0 GiB) in 3s, read: 152.2 MiB/s, write: 136.5 MiB/s
168: 2021-04-21 21:10:44 INFO: 2% (866.4 MiB of 32.0 GiB) in 6s, read: 136.5 MiB/s, write: 133.2 MiB/s
168: 2021-04-21 21:10:47 INFO: 3% (1.2 GiB of 32.0 GiB) in 9s, read: 120.8 MiB/s, write: 117.4 MiB/s
168: 2021-04-21 21:10:50 INFO: 5% (1.7 GiB of 32.0 GiB) in 12s, read: 161.9 MiB/s, write: 161.3 MiB/s
168: 2021-04-21 21:10:53 INFO: 6% (2.1 GiB of 32.0 GiB) in 15s, read: 161.9 MiB/s, write: 144.0 MiB/s
168: 2021-04-21 21:10:56 INFO: 8% (2.8 GiB of 32.0 GiB) in 18s, read: 219.9 MiB/s, write: 214.5 MiB/s
168: 2021-04-21 21:10:59 INFO: 10% (3.3 GiB of 32.0 GiB) in 21s, read: 187.3 MiB/s, write: 185.8 MiB/s
168: 2021-04-21 21:11:02 INFO: 12% (3.9 GiB of 32.0 GiB) in 24s, read: 180.0 MiB/s, write: 178.5 MiB/s
168: 2021-04-21 21:11:06 INFO: 13% (4.4 GiB of 32.0 GiB) in 28s, read: 125.1 MiB/s, write: 117.7 MiB/s
168: 2021-04-21 21:11:09 INFO: 15% (4.9 GiB of 32.0 GiB) in 31s, read: 189.7 MiB/s, write: 189.5 MiB/s
168: 2021-04-21 21:11:12 INFO: 17% (5.5 GiB of 32.0 GiB) in 34s, read: 190.9 MiB/s, write: 190.9 MiB/s
168: 2021-04-21 21:11:15 INFO: 18% (6.0 GiB of 32.0 GiB) in 37s, read: 193.3 MiB/s, write: 184.4 MiB/s
168: 2021-04-21 21:11:18 INFO: 20% (6.4 GiB of 32.0 GiB) in 40s, read: 139.0 MiB/s, write: 136.5 MiB/s
168: 2021-04-21 21:11:21 INFO: 21% (6.9 GiB of 32.0 GiB) in 43s, read: 142.6 MiB/s, write: 136.4 MiB/s
168: 2021-04-21 21:11:26 INFO: 23% (7.6 GiB of 32.0 GiB) in 48s, read: 142.8 MiB/s, write: 141.0 MiB/s
168: 2021-04-21 21:11:29 INFO: 24% (8.0 GiB of 32.0 GiB) in 51s, read: 147.0 MiB/s, write: 131.0 MiB/s
168: 2021-04-21 21:11:32 INFO: 26% (8.4 GiB of 32.0 GiB) in 54s, read: 138.1 MiB/s, write: 125.6 MiB/s
168: 2021-04-21 21:11:36 INFO: 28% (9.0 GiB of 32.0 GiB) in 58s, read: 145.0 MiB/s, write: 131.6 MiB/s
168: 2021-04-21 21:11:39 INFO: 29% (9.3 GiB of 32.0 GiB) in 1m 1s, read: 128.1 MiB/s, write: 118.1 MiB/s
168: 2021-04-21 21:11:42 INFO: 30% (9.7 GiB of 32.0 GiB) in 1m 4s, read: 122.0 MiB/s, write: 120.5 MiB/s
168: 2021-04-21 21:11:46 INFO: 32% (10.2 GiB of 32.0 GiB) in 1m 8s, read: 139.6 MiB/s, write: 114.4 MiB/s
168: 2021-04-21 21:11:49 INFO: 33% (10.6 GiB of 32.0 GiB) in 1m 11s, read: 131.7 MiB/s, write: 128.1 MiB/s
168: 2021-04-21 21:11:52 INFO: 34% (11.0 GiB of 32.0 GiB) in 1m 14s, read: 114.8 MiB/s, write: 109.9 MiB/s
168: 2021-04-21 21:11:56 INFO: 35% (11.5 GiB of 32.0 GiB) in 1m 18s, read: 135.0 MiB/s, write: 134.4 MiB/s
168: 2021-04-21 21:11:59 INFO: 36% (11.8 GiB of 32.0 GiB) in 1m 21s, read: 113.6 MiB/s, write: 113.4 MiB/s
168: 2021-04-21 21:12:02 INFO: 38% (12.2 GiB of 32.0 GiB) in 1m 24s, read: 129.3 MiB/s, write: 118.2 MiB/s
168: 2021-04-21 21:12:06 INFO: 39% (12.7 GiB of 32.0 GiB) in 1m 28s, read: 128.7 MiB/s, write: 128.4 MiB/s
168: 2021-04-21 21:12:09 INFO: 40% (13.1 GiB of 32.0 GiB) in 1m 31s, read: 124.5 MiB/s, write: 123.9 MiB/s
168: 2021-04-21 21:12:12 INFO: 41% (13.4 GiB of 32.0 GiB) in 1m 34s, read: 116.0 MiB/s, write: 115.6 MiB/s
168: 2021-04-21 21:12:16 INFO: 43% (14.0 GiB of 32.0 GiB) in 1m 38s, read: 145.9 MiB/s, write: 143.2 MiB/s
168: 2021-04-21 21:12:19 INFO: 45% (14.5 GiB of 32.0 GiB) in 1m 41s, read: 172.8 MiB/s, write: 164.2 MiB/s
168: 2021-04-21 21:12:22 INFO: 46% (14.9 GiB of 32.0 GiB) in 1m 44s, read: 125.7 MiB/s, write: 125.5 MiB/s
168: 2021-04-21 21:12:26 ERROR: VM 168 qmp command 'query-backup' failed - got wrong command id '12299:49138754' (expected 73836:1299)
168: 2021-04-21 21:12:26 INFO: aborting backup job
168: 2021-04-21 21:12:27 ERROR: Backup of VM 168 failed - VM 168 qmp command 'query-backup' failed - got wrong command id '12299:49138754' (expected 73836:1299)
Clues:
- There's a QEMU guest agent installed on all the above suspects/VMs (Windows & Mac).
- All are on the same host. The other hosts are not affected (yet).
- The same VM from day 1 had the same error on day 2.
- If we manually backup in snapshot mode, the same error occures. But if we manually backup in stop mode, no errors occure.
- OS independent: some are Windows Servers, some Linux.
- It seems to spread, day 1 one VM was affected, day 2 five VMs are affected.
Just rebooting is not an option as there are a couple dozens of VMs running on the host.
And the pve versions:
Proxmox
Virtual Environment 6.3-6
Node 'XY'
CPU usage
9.54% of 96 CPU(s)
IO delay
0.01%
Load average
12.03,12.39,13.20
RAM usage
79.50% (1.17 TiB of 1.48 TiB)
KSM sharing
61.61 GiB
HD space(root)
3.26% (7.01 GiB of 215.05 GiB)
SWAP usage
N/A
CPU(s)
96 x Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz (2 Sockets)
Kernel Version
Linux 5.4.103-1-pve #1 SMP PVE 5.4.103-1 (Sun, 07 Mar 2021 15:55:09 +0100)
PVE Manager Version
pve-manager/6.3-6/2184247e
Logs
()
proxmox-ve: 6.3-1 (running kernel: 5.4.103-1-pve)
pve-manager: 6.3-6 (running version: 6.3-6/2184247e)
pve-kernel-5.4: 6.3-7
pve-kernel-helper: 6.3-7
pve-kernel-5.3: 6.1-6
pve-kernel-5.4.103-1-pve: 5.4.103-1
pve-kernel-5.4.78-2-pve: 5.4.78-2
pve-kernel-5.4.65-1-pve: 5.4.65-1
pve-kernel-5.4.60-1-pve: 5.4.60-2
pve-kernel-5.3.18-3-pve: 5.3.18-3
pve-kernel-5.3.10-1-pve: 5.3.10-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.1.0-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: not correctly installed
ifupdown2: 3.0.0-1+pve3
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.20-pve1
libproxmox-acme-perl: 1.0.7
libproxmox-backup-qemu0: 1.0.3-1
libpve-access-control: 6.1-3
libpve-apiclient-perl: 3.1-3
libpve-common-perl: 6.3-5
libpve-guest-common-perl: 3.1-5
libpve-http-server-perl: 3.1-1
libpve-storage-perl: 6.3-7
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.6-2
lxcfs: 4.0.6-pve1
novnc-pve: 1.1.0-1
proxmox-backup-client: 1.0.10-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.4-6
pve-cluster: 6.2-1
pve-container: 3.3-4
pve-docs: 6.3-1
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.2-2
pve-ha-manager: 3.1-1
pve-i18n: 2.2-2
pve-qemu-kvm: 5.2.0-4
pve-xtermjs: 4.7.0-3
qemu-server: 6.3-8
smartmontools: 7.2-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 2.0.3-pve2
Virtual Environment 6.3-6
Node 'XY'
CPU usage
9.54% of 96 CPU(s)
IO delay
0.01%
Load average
12.03,12.39,13.20
RAM usage
79.50% (1.17 TiB of 1.48 TiB)
KSM sharing
61.61 GiB
HD space(root)
3.26% (7.01 GiB of 215.05 GiB)
SWAP usage
N/A
CPU(s)
96 x Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz (2 Sockets)
Kernel Version
Linux 5.4.103-1-pve #1 SMP PVE 5.4.103-1 (Sun, 07 Mar 2021 15:55:09 +0100)
PVE Manager Version
pve-manager/6.3-6/2184247e
Logs
()
proxmox-ve: 6.3-1 (running kernel: 5.4.103-1-pve)
pve-manager: 6.3-6 (running version: 6.3-6/2184247e)
pve-kernel-5.4: 6.3-7
pve-kernel-helper: 6.3-7
pve-kernel-5.3: 6.1-6
pve-kernel-5.4.103-1-pve: 5.4.103-1
pve-kernel-5.4.78-2-pve: 5.4.78-2
pve-kernel-5.4.65-1-pve: 5.4.65-1
pve-kernel-5.4.60-1-pve: 5.4.60-2
pve-kernel-5.3.18-3-pve: 5.3.18-3
pve-kernel-5.3.10-1-pve: 5.3.10-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.1.0-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: not correctly installed
ifupdown2: 3.0.0-1+pve3
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.20-pve1
libproxmox-acme-perl: 1.0.7
libproxmox-backup-qemu0: 1.0.3-1
libpve-access-control: 6.1-3
libpve-apiclient-perl: 3.1-3
libpve-common-perl: 6.3-5
libpve-guest-common-perl: 3.1-5
libpve-http-server-perl: 3.1-1
libpve-storage-perl: 6.3-7
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.6-2
lxcfs: 4.0.6-pve1
novnc-pve: 1.1.0-1
proxmox-backup-client: 1.0.10-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.4-6
pve-cluster: 6.2-1
pve-container: 3.3-4
pve-docs: 6.3-1
pve-edk2-firmware: 2.20200531-1
pve-firewall: 4.1-3
pve-firmware: 3.2-2
pve-ha-manager: 3.1-1
pve-i18n: 2.2-2
pve-qemu-kvm: 5.2.0-4
pve-xtermjs: 4.7.0-3
qemu-server: 6.3-8
smartmontools: 7.2-pve2
spiceterm: 3.1-1
vncterm: 1.6-2
zfsutils-linux: 2.0.3-pve2
Any help or hints from the hive is appreciated
Best,
Andy