Backup errors

bigfishinnet

Member
Feb 2, 2010
47
0
6
I am trying to backup a nagios installation on my proxmox 1.9 Think the clue is in the lines

101: Nov 14 04:04:30 INFO: rsync warning: some files vanished before they could be transferred (code 24) at main.c(1058) [sender=3.0.3]

Can anyone provide any pointers

Thanks

S

Detailed backup logs:


vzdump --quiet --suspend --compress --storage VirtualDiscBackups --maxfiles 1 --mailto stephenw@jesuswhydidigoselfemployed.co.uk 101 101: Nov 14 04:00:01 INFO: Starting Backup of VM 101 (openvz) 101: Nov 14 04:00:01 INFO: CTID 101 exist mounted running 101: Nov 14 04:00:01 INFO: status = CTID 101 exist mounted running 101: Nov 14 04:00:01 INFO: backup mode: suspend 101: Nov 14 04:00:01 INFO: ionice priority: 7 101: Nov 14 04:00:01 INFO: starting first sync /var/lib/vz/private/101/ to /var/lib/vz/vbackup/vzdump-openvz-101-2011_11_14-04_00_01.tmp 101: Nov 14 04:04:29 INFO: file has vanished: "/var/lib/vz/private/101/var/log/pnp4nagios/stats/22020723" 101: Nov 14 04:04:30 INFO: Number of files: 37286 101: Nov 14 04:04:30 INFO: Number of files transferred: 31372 101: Nov 14 04:04:30 INFO: Total file size: 1913510707 bytes 101: Nov 14 04:04:30 INFO: Total transferred file size: 1910080938 bytes 101: Nov 14 04:04:30 INFO: Literal data: 1910081045 bytes 101: Nov 14 04:04:30 INFO: Matched data: 0 bytes 101: Nov 14 04:04:30 INFO: File list size: 829683 101: Nov 14 04:04:30 INFO: File list generation time: 0.001 seconds 101: Nov 14 04:04:30 INFO: File list transfer time: 0.000 seconds 101: Nov 14 04:04:30 INFO: Total bytes sent: 1912508040 101: Nov 14 04:04:30 INFO: Total bytes received: 633010 101: Nov 14 04:04:30 INFO: sent 1912508040 bytes received 633010 bytes 7098853.62 bytes/sec 101: Nov 14 04:04:30 INFO: total size is 1913510707 speedup is 1.00 101: Nov 14 04:04:30 INFO: rsync warning: some files vanished before they could be transferred (code 24) at main.c(1058) [sender=3.0.3] 101: Nov 14 04:04:30 INFO: first sync finished (269 seconds) 101: Nov 14 04:04:30 INFO: suspend vm 101: Nov 14 04:04:30 INFO: Setting up checkpoint... 101: Nov 14 04:04:30 INFO: suspend... 101: Nov 14 04:04:30 INFO: Can not suspend container: Device or resource busy 101: Nov 14 04:04:30 INFO: Error: task 3511/778(ntpd) uses posix timers 101: Nov 14 04:04:30 INFO: Checkpointing failed 101: Nov 14 04:04:34 ERROR: Backup of VM 101 failed - command 'vzctl --skiplock chkpnt 101 --suspend' failed with exit code 16
 
101: Nov 14 04:04:30 INFO: rsync warning: some files vanished before they could be transferred (code 24) at main.c(1058)

This is not the problem and is quite normal - you can ignore that info.

INFO: Can not suspend container: Device or resource busy

This is the real problem. Maybe you have an open console ('vzctl enter') while trying to backup?
 
This is not the problem and is quite normal - you can ignore that info.

OK.

This is the real problem. Maybe you have an open console ('vzctl enter') while trying to backup?

Hmm not sure what you mean by this - novice. No one is using the server while the backup takes place. The server has a number of openVPN connections to other nagios boxes as it is a distributed monitoring system so the central server is continuously communicating with the other servers.

So is something holding something open like the vpn's that means the backup is failing. Can i suspend the vm while the backup starts and then enable it once it finishes.

Thanks Dietmar

Stephen
 
pls post the output of 'pveversion -v'
 
Well, 'suspend' is the thing that does not work. But you can use 'stop' or 'snapshot' mode instead.

OK Will try Stop mode as suspend or snapshot is not working. Stop mode might cause a few more alerts mind!! Maybe I can scheduled downtime for the main server? Will let u know.

Thanks

S
 
pls post the output of 'pveversion -v'

pve-manager: 1.9-26 (pve-manager/1.9/6567)
running kernel: 2.6.32-4-pve
proxmox-ve-2.6.32: 1.8-33
pve-kernel-2.6.32-4-pve: 2.6.32-33
qemu-server: 1.1-32
pve-firmware: 1.0-14
libpve-storage-perl: 1.0-19
vncterm: 0.9-2
vzctl: 3.0.29-3pve1
vzdump: 1.2-16
vzprocps: 2.0.11-2
vzquota: 3.0.11-1
pve-qemu-kvm: 0.15.0-1
ksm-control-daemon: 1.0-6

Stop worked?
 

OK upgraded to latest stable kernel. Will try and revisit the backups.

pve-manager: 1.9-26 (pve-manager/1.9/6567)
running kernel: 2.6.32-6-pve
proxmox-ve-2.6.32: 1.9-50
pve-kernel-2.6.32-4-pve: 2.6.32-33
pve-kernel-2.6.32-6-pve: 2.6.32-50
qemu-server: 1.1-32
pve-firmware: 1.0-14
libpve-storage-perl: 1.0-19
vncterm: 0.9-2
vzctl: 3.0.29-3pve1
vzdump: 1.2-16
vzprocps: 2.0.11-2
vzquota: 3.0.11-1
pve-qemu-kvm: 0.15.0-1
ksm-control-daemon: 1.0-6

Thanks

Stephen
 
Hi forum - still got errors even though I have the latest kernel, etc. Please refer to logs can anyone provide any pointers. Can I exclude vz machine directory from the backup? It seems the stuff in spool is causing an issue.

Stephen.

vzdump --quiet --snapshot --compress --storage VirtualDiscBackups --maxfiles 1 --mailto stephenw@doesgodexist.com 101 101: Nov 25 04:00:01 INFO: Starting Backup of VM 101 (openvz) 101: Nov 25 04:00:01 INFO: CTID 101 exist mounted running 101: Nov 25 04:00:01 INFO: status = CTID 101 exist mounted running 101: Nov 25 04:00:01 INFO: mode failure - unable to dump into snapshot (use option --dumpdir) 101: Nov 25 04:00:01 INFO: trying 'suspend' mode instead 101: Nov 25 04:00:01 INFO: backup mode: suspend 101: Nov 25 04:00:01 INFO: ionice priority: 7 101: Nov 25 04:00:01 INFO: starting first sync /var/lib/vz/private/101/ to /var/lib/vz/vbackup/vzdump-openvz-101-2011_11_25-04_00_01.tmp 101: Nov 25 04:03:35 INFO: file has vanished: "/var/lib/vz/private/101/var/lib/nagios3/spool/checkresults/c4c37US" 101: Nov 25 04:03:35 INFO: file has vanished: "/var/lib/vz/private/101/var/lib/nagios3/spool/checkresults/c4c37US.ok" 101: Nov 25 04:04:17 INFO: file has vanished: "/var/lib/vz/private/101/var/log/pnp4nagios/stats/22036563" 101: Nov 25 04:04:17 INFO: Number of files: 37308 101: Nov 25 04:04:17 INFO: Number of files transferred: 31394 101: Nov 25 04:04:17 INFO: Total file size: 1918569762 bytes 101: Nov 25 04:04:17 INFO: Total transferred file size: 1915139993 bytes 101: Nov 25 04:04:17 INFO: Literal data: 1915140423 bytes 101: Nov 25 04:04:17 INFO: Matched data: 0 bytes 101: Nov 25 04:04:17 INFO: File list size: 830489 101: Nov 25 04:04:17 INFO: File list generation time: 0.001 seconds 101: Nov 25 04:04:17 INFO: File list transfer time: 0.000 seconds 101: Nov 25 04:04:17 INFO: Total bytes sent: 1917569698 101: Nov 25 04:04:17 INFO: Total bytes received: 633430 101: Nov 25 04:04:17 INFO: sent 1917569698 bytes received 633430 bytes 7478374.77 bytes/sec 101: Nov 25 04:04:17 INFO: total size is 1918569762 speedup is 1.00 101: Nov 25 04:04:17 INFO: rsync warning: some files vanished before they could be transferred (code 24) at main.c(1058) [sender=3.0.3] 101: Nov 25 04:04:17 INFO: first sync finished (256 seconds) 101: Nov 25 04:04:17 INFO: suspend vm 101: Nov 25 04:04:17 INFO: Setting up checkpoint... 101: Nov 25 04:04:17 INFO: suspend... 101: Nov 25 04:04:17 INFO: Can not suspend container: Device or resource busy 101: Nov 25 04:04:17 INFO: Error: task 218352/764(ntpd) uses posix timers 101: Nov 25 04:04:17 INFO: Checkpointing failed 101: Nov 25 04:04:24 ERROR: Backup of VM 101 failed - command 'vzctl --skiplock chkpnt 101 --suspend' failed with exit code 16
 
Nov 25 04:04:17 INFO: Error: task 218352/764(ntpd) uses posix timers 101: Nov 25 04:04:17 INFO: Checkpointing failed.

The problem seems to be the ntpd - I would disable that inside the container(runs on the hots anyways).