Hello,
i have an Issue where the creation of a snapshot failed, leading to a corrupting databases on this machine.
We are currently recoverying the database. We also want to know why the snapshot failed, because its the 2nd time within several weeks, that we have corrupted machines in two different environments. Basically we cant trust the snapshot function.
Here the current Cluster:
-------------------------------------------------------------------------
proxmox-ve: 5.2-2 (running kernel: 4.15.18-1-pve)
pve-manager: 5.2-6 (running version: 5.2-6/bcd5f008)
pve-kernel-4.15: 5.2-4
pve-kernel-4.15.18-1-pve: 4.15.18-17
pve-kernel-4.13.8-2-pve: 4.13.8-28
pve-kernel-4.10.17-3-pve: 4.10.17-23
pve-kernel-4.4.83-1-pve: 4.4.83-96
pve-kernel-4.2.8-1-pve: 4.2.8-41
pve-kernel-3.10.0-10-pve: 3.10.0-34
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-37
libpve-guest-common-perl: 2.0-17
libpve-http-server-perl: 2.0-9
libpve-storage-perl: 5.0-24
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.0-3
lxcfs: 3.0.0-1
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-19
pve-cluster: 5.0-29
pve-container: 2.0-24
pve-docs: 5.2-5
pve-firewall: 3.0-13
pve-firmware: 2.0-5
pve-ha-manager: 2.0-5
pve-i18n: 1.0-6
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.2-1
pve-xtermjs: 1.0-5
qemu-server: 5.0-30
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
---------
An here is the log from the failed task:
Formatting '/var/lib/vz/images/106/vm-106-state-RO_user_for_BMD.raw', fmt=raw size=17704157184
VM 106 not running
snapshot create failed: starting cleanup
TASK ERROR: VM 106 qmp command 'snapshot-drive' failed - client closed connection
Does anyone has an idea how to fix/avoid the problem in the future?
i have an Issue where the creation of a snapshot failed, leading to a corrupting databases on this machine.
We are currently recoverying the database. We also want to know why the snapshot failed, because its the 2nd time within several weeks, that we have corrupted machines in two different environments. Basically we cant trust the snapshot function.
Here the current Cluster:
-------------------------------------------------------------------------
proxmox-ve: 5.2-2 (running kernel: 4.15.18-1-pve)
pve-manager: 5.2-6 (running version: 5.2-6/bcd5f008)
pve-kernel-4.15: 5.2-4
pve-kernel-4.15.18-1-pve: 4.15.18-17
pve-kernel-4.13.8-2-pve: 4.13.8-28
pve-kernel-4.10.17-3-pve: 4.10.17-23
pve-kernel-4.4.83-1-pve: 4.4.83-96
pve-kernel-4.2.8-1-pve: 4.2.8-41
pve-kernel-3.10.0-10-pve: 3.10.0-34
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-37
libpve-guest-common-perl: 2.0-17
libpve-http-server-perl: 2.0-9
libpve-storage-perl: 5.0-24
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.0-3
lxcfs: 3.0.0-1
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-19
pve-cluster: 5.0-29
pve-container: 2.0-24
pve-docs: 5.2-5
pve-firewall: 3.0-13
pve-firmware: 2.0-5
pve-ha-manager: 2.0-5
pve-i18n: 1.0-6
pve-libspice-server1: 0.12.8-3
pve-qemu-kvm: 2.11.2-1
pve-xtermjs: 1.0-5
qemu-server: 5.0-30
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
---------
An here is the log from the failed task:
Formatting '/var/lib/vz/images/106/vm-106-state-RO_user_for_BMD.raw', fmt=raw size=17704157184
VM 106 not running
snapshot create failed: starting cleanup
TASK ERROR: VM 106 qmp command 'snapshot-drive' failed - client closed connection
Does anyone has an idea how to fix/avoid the problem in the future?