Hi all, we're having trouble backing up Containers on Ceph storage.
Workflow as follows:
The only way of fixing it (so far) is to:
"can't unmap rbd device /dev/rbd/ha-container-32-pool/vm-538-disk-0@vzdump: rbd: sysfs write failed"
We've seen this before when running a regular VM on KRBD storage, the disk isn't released on migration or backup, but I thought Containers needed KRBD to work properly?. To me it looks like KRBD isn't unmapping the image during the backup operation.
Has anyone seen this before? any ideas?
Thanks,
Stuart.
Workflow as follows:
- Container (LXC) on Proxmox 5.3-5 cluster
- Dedicated Ceph 3/2 replicated pool for storage, with KRBD enabled
- The RBD running the Root Disk is mounted on the host as /dev/rbd3
- Backup container via Proxmox GUI or scheduled job:
- Storage: NFS server
- Mode: Snapshot
- Compression: LZO
The only way of fixing it (so far) is to:
- Shut down the Container internally via the console (not the GUI)
- run "rbd showmapped | grep 538" on the host to get the mapped disks
- run "rbd unmap -o force /dev/rbd3" and "rbd unmap -o force /dev/rbd4" against the mounted disks to unmount them
- run "pct unlock 538" to relase the snapshot-delete lock
- Delete the snapshot via the Proxmox GUI
- Restart the Container
"can't unmap rbd device /dev/rbd/ha-container-32-pool/vm-538-disk-0@vzdump: rbd: sysfs write failed"
We've seen this before when running a regular VM on KRBD storage, the disk isn't released on migration or backup, but I thought Containers needed KRBD to work properly?. To me it looks like KRBD isn't unmapping the image during the backup operation.
Has anyone seen this before? any ideas?
Thanks,
Stuart.
Code:
Virtual Environment 5.3-5
Container 538 (ct-XXXXX.co.uk) on node 'vms603'
Logs
()
INFO: starting new backup job: vzdump 538 --remove 0 --mode snapshot --compress lzo --mailto XXXXX.XXXXX@hahosting.com --storage XXXXX.XX.hahosting.net --node vms603
INFO: Starting Backup of VM 538 (lxc)
INFO: status = running
INFO: CT Name: ct-XXXXX.co.uk
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: create storage snapshot 'vzdump'
2019-03-14 13:39:32.159154 7fa9119f3100 -1 did not load config file, using default settings.
/dev/rbd4
2019-03-14 13:39:33.318460 7f0cb0552100 -1 did not load config file, using default settings.
INFO: creating archive '/mnt/pve/XXXXX.XX.hahosting.net/dump/vzdump-lxc-538-2019_03_14-13_39_30.tar.lzo'
INFO: Total bytes written: 991344640 (946MiB, 18MiB/s)
INFO: archive file size: 466MB
INFO: remove vzdump snapshot
2019-03-14 13:40:30.449460 7fe042842100 -1 did not load config file, using default settings.
rbd: sysfs write failed
can't unmap rbd device /dev/rbd/ha-container-32-pool/vm-538-disk-0@vzdump: rbd: sysfs write failed
INFO: Finished Backup of VM 538 (00:01:00)
INFO: Backup job finished successfully
TASK OK