Hi team,
Former ESXi user here evaluating Proxmox on a single host prior to testing clusters and considering it for our datacenter environments as a vCenter replacement.
I'm used to a Snapshot being an instant process with no VM pause witnessed unless you're grabbing the memory also, which I am not.
I last used Proxmox many years ago and am somewhat familiar with it's UI
I have a Win 2022 Server with the below config
I have the virtio PCIe driver and the QEMU Guest Agent running correctly,
But the Snapshot seems to run quickly through my SSD drive 80GB Disk and times out on the SAS Drive (6TB)
The error I get after a hung VM console and 20 minutes of waiting is
snapshotting 'drive-scsi0' (SSD_VOL1:100/vm-100-disk-1.qcow2)
snapshotting 'drive-scsi1' (SAS_VOL1:100/vm-100-disk-0.qcow2)
VM 100 qmp command 'savevm-end' failed - unable to connect to VM 100 qmp socket - timeout after 5989 retries
guest-fsfreeze-thaw problems - VM 100 qmp command 'guest-fsfreeze-thaw' failed - got timeout
snapshot create failed: starting cleanup
TASK ERROR: VM 100 qmp command 'blockdev-snapshot-internal-sync' failed - got timeout
Server CPU is elevated during the snapshot attempt.
As you can see from the vm.conf I turned off freeze/thaw in a failed attempt to mitigate this.
The host is a brand new HPE server with a RAID1 SSD array and a RAID6 SAS array with EXT4 partitions as "Directories" that otherwise perform well under simulated load.
When I detect the larger disk, the snapshot completes, albeit still locking the guest OS while it runs.
Please let me know if you have any thoughts on
a) Why it's locking the guest OS
and
b) How we can get snapshots working on the larger volume?
Thanks in advance
Former ESXi user here evaluating Proxmox on a single host prior to testing clusters and considering it for our datacenter environments as a vCenter replacement.
I'm used to a Snapshot being an instant process with no VM pause witnessed unless you're grabbing the memory also, which I am not.
I last used Proxmox many years ago and am somewhat familiar with it's UI
I have a Win 2022 Server with the below config
agent: 1,freeze-fs-on-backup=0balloon: 0bios: ovmfboot: order=scsi0;net0;scsi1;ide2cores: 8cpu: x86-64-v2-AESefidisk0: SSD_VOL1:100/vm-100-disk-0.qcow2,efitype=4m,pre-enrolled-keys=1,size=528Kide2: SAS_VOL1:iso/virtio-win-0.1.248.iso,media=cdrom,size=715188Klock: snapshotmachine: pc-i440fx-8.1memory: 16384meta: creation-qemu=8.1.5,ctime=1715560288name:Redactednet0: virtio=Redacted,bridge=vmbr0,tag=2numa: 0ostype: win11scsi0: SSD_VOL1:100/vm-100-disk-1.qcow2,backup=0,iothread=1,size=80Gscsi1: SAS_VOL1:100/vm-100-disk-0.qcow2,backup=0,iothread=1,size=6Tscsihw: virtio-scsi-singlesmbios1: uuid=fbb8944e-1cca-4e67-ac06-bbed39d2e377,manufacturer=SFA=,product=UHJvTGlhbnQgTUwzNTAgR2VuMTE=,family=SFA=,base64=1sockets: 1vmgenid: f5b375f6-f477-47b6-b6c6-320503848c1b[preupdates]agent: 1,freeze-fs-on-backup=0balloon: 0bios: ovmfboot: order=scsi0;net0;scsi1;ide2cores: 8cpu: x86-64-v2-AESefidisk0: SSD_VOL1:100/vm-100-disk-0.qcow2,efitype=4m,pre-enrolled-keys=1,size=528Kide2: SAS_VOL1:iso/virtio-win-0.1.248.iso,media=cdrom,size=715188Kmachine: pc-i440fx-8.1memory: 16384meta: creation-qemu=8.1.5,ctime=1715560288name: Redactednet0: virtio=BC:24:11:53:CD:86,bridge=vmbr0,tag=2numa: 0ostype: win11scsi0: SSD_VOL1:100/vm-100-disk-1.qcow2,backup=0,iothread=1,size=80Gscsi1: SAS_VOL1:100/vm-100-disk-0.qcow2,backup=0,iothread=1,size=6Tscsihw: virtio-scsi-singlesmbios1: uuid=fbb8944e-1cca-4e67-ac06-bbed39d2e377,manufacturer=SFA=,product=UHJvTGlhbnQgTUwzNTAgR2VuMTE=,family=SFA=,base64=1snapstate: preparesnaptime: 1715657000sockets: 1vmgenid: f5b375f6-f477-47b6-b6c6-320503848c1bI have the virtio PCIe driver and the QEMU Guest Agent running correctly,
But the Snapshot seems to run quickly through my SSD drive 80GB Disk and times out on the SAS Drive (6TB)
The error I get after a hung VM console and 20 minutes of waiting is
snapshotting 'drive-scsi0' (SSD_VOL1:100/vm-100-disk-1.qcow2)
snapshotting 'drive-scsi1' (SAS_VOL1:100/vm-100-disk-0.qcow2)
VM 100 qmp command 'savevm-end' failed - unable to connect to VM 100 qmp socket - timeout after 5989 retries
guest-fsfreeze-thaw problems - VM 100 qmp command 'guest-fsfreeze-thaw' failed - got timeout
snapshot create failed: starting cleanup
TASK ERROR: VM 100 qmp command 'blockdev-snapshot-internal-sync' failed - got timeout
Server CPU is elevated during the snapshot attempt.
As you can see from the vm.conf I turned off freeze/thaw in a failed attempt to mitigate this.
The host is a brand new HPE server with a RAID1 SSD array and a RAID6 SAS array with EXT4 partitions as "Directories" that otherwise perform well under simulated load.
When I detect the larger disk, the snapshot completes, albeit still locking the guest OS while it runs.
Please let me know if you have any thoughts on
a) Why it's locking the guest OS
and
b) How we can get snapshots working on the larger volume?
Thanks in advance