Hi,
My physical CPUs are 2x Intel(R) Xeon(R) Silver 4316 CPU @ 2.30GHz
I don't have latest microcode manually installed
I tried with my prod VM after a shutdown x86-64-v2-AES , after 2 snapshots with ram => same problem VM begun to stall dmesg/journalctl warning
oct. 23 15:33:10 xxxxx kernel: clocksource: Long readout interval, skipping watchdog check: cs_nsec: 1000034688 wd_nsec: 1000034116
Then i halt change to host cpu and boot because my VM is in production.
I changed on my second proxmox server (cluster) same hardware configuration and done some more tests on another debian12 VM (2 Cpus 1G RAM)
Done 2 snapshot without issue, then i decide to simulate the CPU and IOs charge on this VM i used a stress and dd like this in a screen
simple-stress.sh
Bash:
#!/bin/bash
while true; do
stress -c 1 --timeout 1
sleep 0.5
done
and a dd randowm write in loop
Bash:
while [ 1 ]; do date ; dd if=/dev/random of=/tmp/FILE-BIG-1G bs=1M count=1000 ;done
DD result is OK
1048576000 octets (1,0 GB, 1000 MiB) copiés, 3,3899 s, 309 MB/s
After that i use a top and btop to watch the process
Here are the results after a second run (the first one was near same results) where i changed the guest cpu from kvm64 to hosts
first snapshot under stress => some failure
saving VM state and RAM using storage 'vms'
4.02 MiB in 0s
1.12 GiB in 1s
1.49 GiB in 2s
1.80 GiB in 3s
2.05 GiB in 4s
2.36 GiB in 5s
snapshot create failed: starting cleanup
TASK ERROR: unable to save VM state and RAM - qemu_savevm_state_complete_precopy error -5
dd not so good after that
1048576000 octets (1,0 GB, 1000 MiB) copiés, 4,84992 s, 216 MB/s
Next snapshot is OK but dd perf is down
saving VM state and RAM using storage 'vms'
1.51 MiB in 0s
882.54 MiB in 1s
1008.48 MiB in 2s
1.22 GiB in 3s
1.39 GiB in 5s
1.56 GiB in 6s
1.69 GiB in 7s
1.84 GiB in 8s
1.95 GiB in 9s
2.10 GiB in 10s
2.18 GiB in 11s
2.19 GiB in 12s
completed saving the VM state in 13s, saved 2.44 GiB
snapshotting 'drive-scsi0' (vms:vm-111-disk-0)
snapshotting 'drive-efidisk0' (vms:vm-111-disk-1)
TASK OK
dd =>
1048576000 octets (1,0 GB, 1000 MiB) copiés, 11,4522 s, 91,6 MB/s
I continue snapshots untill stalling/saturation , each time dd is down ..
1048576000 octets (1,0 GB, 1000 MiB) copiés, 17,3274 s, 60,5 MB/s
1048576000 octets (1,0 GB, 1000 MiB) copiés, 35,1212 s, 29,9 MB/s
1048576000 octets (1,0 GB, 1000 MiB) copiés, 253,325 s, 4,1 MB/s
Message dmesg/journalctl
oct. 23 17:32:15 debian12 kernel: clocksource: Long readout interval, skipping watchdog check: cs_nsec: 1076691946 wd_nsec: 1076691530
Time to reboot VM ..
PS: sorry for my English it's not my native language