D
deejross
Guest
My node has an uptime of 28 days now and I just noticed that all of my container backups have been failing for the last couple of days. I am running version 2.0-30 and this is what my backup log says:
I Googled a few of those lines, but the "maximum number of semaphores reached" INFO line is what lead me to a temporary solution. I increased the number of semaphores from 128 to 256, and was able to manually run
From my Google searching, it appears there may be an issue with lvcreate/lvremove that doesn't release the arrays it uses. Not that I understand all of this, I am just repeating it.
Does anyone know of a way fix this issue? I had thought about running a cron script that kills semaphores that start with 0x0D4D, but I have no idea what affect that will have on the system. Any ideas? Thanks.
Code:
[COLOR=#000000][FONT=tahoma]INFO: starting new backup job: vzdump --quiet 1 --mode snapshot --compress gzip --maxfiles 2 --storage mac --node cloud --all 1[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: filesystem type on dumpdir is 'fuse.sshfs' -using /var/tmp/vzdumptmp740285 for temporary files[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Starting Backup of VM 100 (openvz)[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: CTID 100 exist mounted running[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: status = running[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: backup mode: snapshot[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: ionice priority: 7[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: trying to remove stale snapshot '/dev/storage-group/vzsnap-cloud-0'[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: umount: /mnt/vzsnap0: not mounted[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'umount /mnt/vzsnap0' failed: exit code 1[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem.[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Unable to deactivate storage--group-vzsnap--cloud--0 (253:4)[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Unable to deactivate logical volume "vzsnap-cloud-0"[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'lvremove -f /dev/storage-group/vzsnap-cloud-0' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: creating lvm snapshot of /dev/mapper/storage--group-storage ('/dev/storage-group/vzsnap-cloud-0')[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Logical volume "vzsnap-cloud-0" already exists in volume group "storage-group"[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 8 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 16 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 32 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'lvremove -f /dev/storage-group/vzsnap-cloud-0' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: Backup of VM 100 failed - command 'lvcreate --size 1024M --snapshot --name vzsnap-cloud-0 /dev/storage-group/storage' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: filesystem type on dumpdir is 'fuse.sshfs' -using /var/tmp/vzdumptmp740285 for temporary files[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Starting Backup of VM 101 (openvz)[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: CTID 101 exist mounted running[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: status = running[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: backup mode: snapshot[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: ionice priority: 7[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: trying to remove stale snapshot '/dev/storage-group/vzsnap-cloud-0'[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: umount: /mnt/vzsnap0: not mounted[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'umount /mnt/vzsnap0' failed: exit code 1[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem.[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Unable to deactivate storage--group-vzsnap--cloud--0 (253:4)[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Unable to deactivate logical volume "vzsnap-cloud-0"[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'lvremove -f /dev/storage-group/vzsnap-cloud-0' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: creating lvm snapshot of /dev/mapper/storage--group-storage ('/dev/storage-group/vzsnap-cloud-0')[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Logical volume "vzsnap-cloud-0" already exists in volume group "storage-group"[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 8 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 16 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 32 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'lvremove -f /dev/storage-group/vzsnap-cloud-0' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: Backup of VM 101 failed - command 'lvcreate --size 1024M --snapshot --name vzsnap-cloud-0 /dev/storage-group/storage' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: filesystem type on dumpdir is 'fuse.sshfs' -using /var/tmp/vzdumptmp740285 for temporary files[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Starting Backup of VM 102 (openvz)[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: CTID 102 exist mounted running[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: status = running[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: backup mode: snapshot[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: ionice priority: 7[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: trying to remove stale snapshot '/dev/storage-group/vzsnap-cloud-0'[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: umount: /mnt/vzsnap0: not mounted[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'umount /mnt/vzsnap0' failed: exit code 1[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Limit for the maximum number of semaphores reached. You can check and set the limits in /proc/sys/kernel/sem.[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Unable to deactivate storage--group-vzsnap--cloud--0 (253:4)[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Unable to deactivate logical volume "vzsnap-cloud-0"[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'lvremove -f /dev/storage-group/vzsnap-cloud-0' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: creating lvm snapshot of /dev/mapper/storage--group-storage ('/dev/storage-group/vzsnap-cloud-0')[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Logical volume "vzsnap-cloud-0" already exists in volume group "storage-group"[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 8 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 16 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: lvremove failed - trying again in 32 seconds[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: command 'lvremove -f /dev/storage-group/vzsnap-cloud-0' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]ERROR: Backup of VM 102 failed - command 'lvcreate --size 1024M --snapshot --name vzsnap-cloud-0 /dev/storage-group/storage' failed: exit code 5[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]INFO: Backup job finished with errors[/FONT][/COLOR]
[COLOR=#000000][FONT=tahoma]TASK ERROR: job errors[/FONT][/COLOR]
I Googled a few of those lines, but the "maximum number of semaphores reached" INFO line is what lead me to a temporary solution. I increased the number of semaphores from 128 to 256, and was able to manually run
Code:
lvremove -f /dev/storage-group/vzsnap-cloud-0
Does anyone know of a way fix this issue? I had thought about running a cron script that kills semaphores that start with 0x0D4D, but I have no idea what affect that will have on the system. Any ideas? Thanks.