Hi team,
Former ESXi user here evaluating Proxmox on a single host prior to testing clusters and considering it for our datacenter environments as a vCenter replacement.
I'm used to a Snapshot being an instant process with no VM pause witnessed unless you're grabbing the memory also, which I am not.
I last used Proxmox many years ago and am somewhat familiar with it's UI
I have a Win 2022 Server with the below config
I have the virtio PCIe driver and the QEMU Guest Agent running correctly,
But the Snapshot seems to run quickly through my SSD drive 80GB Disk and times out on the SAS Drive (6TB)
The error I get after a hung VM console and 20 minutes of waiting is
snapshotting 'drive-scsi0' (SSD_VOL1:100/vm-100-disk-1.qcow2)
snapshotting 'drive-scsi1' (SAS_VOL1:100/vm-100-disk-0.qcow2)
VM 100 qmp command 'savevm-end' failed - unable to connect to VM 100 qmp socket - timeout after 5989 retries
guest-fsfreeze-thaw problems - VM 100 qmp command 'guest-fsfreeze-thaw' failed - got timeout
snapshot create failed: starting cleanup
TASK ERROR: VM 100 qmp command 'blockdev-snapshot-internal-sync' failed - got timeout
Server CPU is elevated during the snapshot attempt.
As you can see from the vm.conf I turned off freeze/thaw in a failed attempt to mitigate this.
The host is a brand new HPE server with a RAID1 SSD array and a RAID6 SAS array with EXT4 partitions as "Directories" that otherwise perform well under simulated load.
When I detect the larger disk, the snapshot completes, albeit still locking the guest OS while it runs.
Please let me know if you have any thoughts on
a) Why it's locking the guest OS
and
b) How we can get snapshots working on the larger volume?
Thanks in advance
Former ESXi user here evaluating Proxmox on a single host prior to testing clusters and considering it for our datacenter environments as a vCenter replacement.
I'm used to a Snapshot being an instant process with no VM pause witnessed unless you're grabbing the memory also, which I am not.
I last used Proxmox many years ago and am somewhat familiar with it's UI
I have a Win 2022 Server with the below config
agent: 1,freeze-fs-on-backup=0
balloon: 0
bios: ovmf
boot: order=scsi0;net0;scsi1;ide2
cores: 8
cpu: x86-64-v2-AES
efidisk0: SSD_VOL1:100/vm-100-disk-0.qcow2,efitype=4m,pre-enrolled-keys=1,size=528K
ide2: SAS_VOL1:iso/virtio-win-0.1.248.iso,media=cdrom,size=715188K
lock: snapshot
machine: pc-i440fx-8.1
memory: 16384
meta: creation-qemu=8.1.5,ctime=1715560288
name:Redacted
net0: virtio=Redacted,bridge=vmbr0,tag=2
numa: 0
ostype: win11
scsi0: SSD_VOL1:100/vm-100-disk-1.qcow2,backup=0,iothread=1,size=80G
scsi1: SAS_VOL1:100/vm-100-disk-0.qcow2,backup=0,iothread=1,size=6T
scsihw: virtio-scsi-single
smbios1: uuid=fbb8944e-1cca-4e67-ac06-bbed39d2e377,manufacturer=SFA=,product=UHJvTGlhbnQgTUwzNTAgR2VuMTE=,family=SFA=,base64=1
sockets: 1
vmgenid: f5b375f6-f477-47b6-b6c6-320503848c1b
[preupdates]
agent: 1,freeze-fs-on-backup=0
balloon: 0
bios: ovmf
boot: order=scsi0;net0;scsi1;ide2
cores: 8
cpu: x86-64-v2-AES
efidisk0: SSD_VOL1:100/vm-100-disk-0.qcow2,efitype=4m,pre-enrolled-keys=1,size=528K
ide2: SAS_VOL1:iso/virtio-win-0.1.248.iso,media=cdrom,size=715188K
machine: pc-i440fx-8.1
memory: 16384
meta: creation-qemu=8.1.5,ctime=1715560288
name: Redacted
net0: virtio=BC:24:11:53:CD:86,bridge=vmbr0,tag=2
numa: 0
ostype: win11
scsi0: SSD_VOL1:100/vm-100-disk-1.qcow2,backup=0,iothread=1,size=80G
scsi1: SAS_VOL1:100/vm-100-disk-0.qcow2,backup=0,iothread=1,size=6T
scsihw: virtio-scsi-single
smbios1: uuid=fbb8944e-1cca-4e67-ac06-bbed39d2e377,manufacturer=SFA=,product=UHJvTGlhbnQgTUwzNTAgR2VuMTE=,family=SFA=,base64=1
snapstate: prepare
snaptime: 1715657000
sockets: 1
vmgenid: f5b375f6-f477-47b6-b6c6-320503848c1b
I have the virtio PCIe driver and the QEMU Guest Agent running correctly,
But the Snapshot seems to run quickly through my SSD drive 80GB Disk and times out on the SAS Drive (6TB)
The error I get after a hung VM console and 20 minutes of waiting is
snapshotting 'drive-scsi0' (SSD_VOL1:100/vm-100-disk-1.qcow2)
snapshotting 'drive-scsi1' (SAS_VOL1:100/vm-100-disk-0.qcow2)
VM 100 qmp command 'savevm-end' failed - unable to connect to VM 100 qmp socket - timeout after 5989 retries
guest-fsfreeze-thaw problems - VM 100 qmp command 'guest-fsfreeze-thaw' failed - got timeout
snapshot create failed: starting cleanup
TASK ERROR: VM 100 qmp command 'blockdev-snapshot-internal-sync' failed - got timeout
Server CPU is elevated during the snapshot attempt.
As you can see from the vm.conf I turned off freeze/thaw in a failed attempt to mitigate this.
The host is a brand new HPE server with a RAID1 SSD array and a RAID6 SAS array with EXT4 partitions as "Directories" that otherwise perform well under simulated load.
When I detect the larger disk, the snapshot completes, albeit still locking the guest OS while it runs.
Please let me know if you have any thoughts on
a) Why it's locking the guest OS
and
b) How we can get snapshots working on the larger volume?
Thanks in advance