Hello,
I set up a backup of a VM with stop mode. It used to work nicely before the day before yesterday.
Now, I get the following error :
INFO: starting new backup job: vzdump 100 --notes-template '{{guestname}}' --mailnotification failure --quiet 1 --mode stop --prune-backups 'keep-daily=10,keep-monthly=3,keep-weekly=4,keep-yearly=1' --mailto root@xxx.in,root@xxx.fr --node pve --storage PBS_H
So it looks like the VM didn't stop within the 149 seconds and thus, the backup failed.
However, the VM was indeed switched off when I wake up and if I look at /var/log/syslog in the VM, here is what I found :
So it looks like the VM stop at Sep 18 02:00:04 and wake up at Sep 18 07:35:11 when I indeed woke it up.
So from what I found in /var/log/syslog of the VM, the VM took about 4 seconds to switch off, not 149 seconds.
The question is then, why PVE thought the VM was not switched off after 149 seconds? And how can I solve this issue to get nightly working backup again?
I set up a backup of a VM with stop mode. It used to work nicely before the day before yesterday.
Now, I get the following error :
INFO: starting new backup job: vzdump 100 --notes-template '{{guestname}}' --mailnotification failure --quiet 1 --mode stop --prune-backups 'keep-daily=10,keep-monthly=3,keep-weekly=4,keep-yearly=1' --mailto root@xxx.in,root@xxx.fr --node pve --storage PBS_H
Code:
INFO: Starting Backup of VM 100 (qemu)
INFO: Backup started at 2024-09-19 02:00:03
INFO: status = running
INFO: backup mode: stop
INFO: ionice priority: 7
INFO: VM Name: xxx.in
INFO: include disk 'scsi0' 'local-lvm:vm-100-disk-0' 500G
INFO: include disk 'scsi1' 'local-lvm:vm-100-disk-1' 300G
INFO: stopping virtual guest
INFO: creating Proxmox Backup Server archive 'vm/100/2024-09-19T00:00:03Z'
INFO: enabling encryption
INFO: starting kvm to execute backup task
INFO: guest is online again after 149 seconds
ERROR: Backup of VM 100 failed - start failed: command '/usr/bin/kvm -id 100 -name 'xxx.in,debug-threads=on' -no-shutdown -chardev 'socket,id=qmp,path=/var/run/qemu-server/100.qmp,server=on,wait=off' -mon 'chardev=qmp,mode=control' -chardev 'socket,id=qmp-event,path=/var/run/qmeventd.sock,reconnect=5' -mon 'chardev=qmp-event,mode=control' -pidfile /var/run/qemu-server/100.pid -daemonize -smbios 'type=1,uuid=15ae1c18-8964-4824-9c26-5442f669b54e' -smp '3,sockets=1,cores=3,maxcpus=3' -nodefaults -boot 'menu=on,strict=on,reboot-timeout=1000,splash=/usr/share/qemu-server/bootsplash.jpg' -vnc 'unix:/var/run/qemu-server/100.vnc,password=on' -cpu qemu64,+aes,enforce,+kvm_pv_eoi,+kvm_pv_unhalt,+pni,+popcnt,+sse4.1,+sse4.2,+ssse3 -m 8192 -object 'iothread,id=iothread-virtioscsi0' -object 'iothread,id=iothread-virtioscsi1' -device 'pci-bridge,id=pci.1,chassis_nr=1,bus=pci.0,addr=0x1e' -device 'pci-bridge,id=pci.2,chassis_nr=2,bus=pci.0,addr=0x1f' -device 'pci-bridge,id=pci.3,chassis_nr=3,bus=pci.0,addr=0x5' -device 'vmgenid,guid=07fb610a-a824-4655-a698-ef1e404b80fe' -device 'piix3-usb-uhci,id=uhci,bus=pci.0,addr=0x1.0x2' -device 'qemu-xhci,p2=15,p3=15,id=xhci,bus=pci.1,addr=0x1b' -device 'usb-tablet,id=tablet,bus=uhci.0,port=1' -device 'usb-host,bus=xhci.0,port=1,vendorid=0x0cf3,productid=0x7015,id=usb0' -device 'VGA,id=vga,bus=pci.0,addr=0x2' -chardev 'socket,path=/var/run/qemu-server/100.qga,server=on,wait=off,id=qga0' -device 'virtio-serial,id=qga0,bus=pci.0,addr=0x8' -device 'virtserialport,chardev=qga0,name=org.qemu.guest_agent.0' -device 'virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x3,free-page-reporting=on' -iscsi 'initiator-name=iqn.1993-08.org.debian:01:e193297b7c1' -drive 'if=none,id=drive-ide2,media=cdrom,aio=io_uring' -device 'ide-cd,bus=ide.1,unit=0,drive=drive-ide2,id=ide2,bootindex=100' -device 'virtio-scsi-pci,id=virtioscsi0,bus=pci.3,addr=0x1,iothread=iothread-virtioscsi0' -drive 'file=/dev/pve/vm-100-disk-0,if=none,id=drive-scsi0,discard=on,format=raw,cache=none,aio=io_uring,detect-zeroes=unmap' -device 'scsi-hd,bus=virtioscsi0.0,channel=0,scsi-id=0,lun=0,drive=drive-scsi0,id=scsi0,rotation_rate=1,bootindex=101' -device 'virtio-scsi-pci,id=virtioscsi1,bus=pci.3,addr=0x2,iothread=iothread-virtioscsi1' -drive 'file=/dev/pve/vm-100-disk-1,if=none,id=drive-scsi1,discard=on,format=raw,cache=none,aio=io_uring,detect-zeroes=unmap' -device 'scsi-hd,bus=virtioscsi1.0,channel=0,scsi-id=0,lun=1,drive=drive-scsi1,id=scsi1,rotation_rate=1' -netdev 'type=tap,id=net0,ifname=tap100i0,script=/var/lib/qemu-server/pve-bridge,downscript=/var/lib/qemu-server/pve-bridgedown,vhost=on' -device 'virtio-net-pci,mac=BC:24:11:E0:2B:82,netdev=net0,bus=pci.0,addr=0x12,id=net0,rx_queue_size=1024,tx_queue_size=256,bootindex=102' -machine 'type=pc+pve0' -S' failed: got timeout
INFO: Failed at 2024-09-19 02:02:32
INFO: Backup job finished with errors
INFO: notified via target `<root@xxx.in,root@xxx.fr>`
TASK ERROR: job errors
So it looks like the VM didn't stop within the 149 seconds and thus, the backup failed.
However, the VM was indeed switched off when I wake up and if I look at /var/log/syslog in the VM, here is what I found :
Code:
Sep 18 02:00:01 xxx CRON[28796]: (nextcloud) CMD (/usr/bin/php8.3 --define apc.enable_cli=1 -f /var/www/nextcloud/cron.php)
Sep 18 02:00:04 xxx qemu-ga: info: guest-shutdown called, mode: (null)
Sep 18 02:00:04 xxx systemd[1]: unattended-upgrades.service: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Removed slice system-modprobe.slice.
Sep 18 02:00:04 xxx systemd[1]: Stopped target Graphical Interface.
Sep 18 02:00:04 xxx systemd[1]: Stopped target Multi-User System.
Sep 18 02:00:04 xxx systemd[1]: Stopped target Login Prompts.
Sep 18 02:00:04 xxx systemd[1]: Stopped target Timers.
Sep 18 02:00:04 xxx systemd[1]: anacron.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Trigger anacron every hour.
Sep 18 02:00:04 xxx systemd[1]: apt-daily-upgrade.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Daily apt upgrade and clean activities.
Sep 18 02:00:04 xxx systemd[1]: apt-daily.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Daily apt download activities.
Sep 18 02:00:04 xxx systemd[1]: e2scrub_all.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Periodic ext4 Online Metadata Check for All Filesystems.
Sep 18 02:00:04 xxx systemd[1]: fstrim.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Discard unused blocks once a week.
Sep 18 02:00:04 xxx systemd[1]: logrotate.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Daily rotation of log files.
Sep 18 02:00:04 xxx systemd[1]: man-db.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Daily man-db regeneration.
Sep 18 02:00:04 xxx systemd[1]: phpsessionclean.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Clean PHP session files every 30 mins.
Sep 18 02:00:04 xxx systemd[1]: systemd-tmpfiles-clean.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Clean PHP session files every 30 mins.
Sep 18 02:00:04 xxx systemd[1]: systemd-tmpfiles-clean.timer: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Daily Cleanup of Temporary Directories.
Sep 18 02:00:04 xxx systemd[1]: systemd-rfkill.socket: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Closed Load/Save RF Kill Switch Status /dev/rfkill Watch.
Sep 18 02:00:04 xxx systemd[1]: Unmounting /media/stockage...
Sep 18 02:00:04 xxx systemd[1]: Stopping Deferred execution scheduler...
Sep 18 02:00:04 xxx systemd[1]: Stopping coTURN STUN/TURN Server...
Sep 18 02:00:04 xxx systemd[1]: Stopping Regular background program processing daemon...
Sep 18 02:00:04 xxx systemd[1]: Stopping Dovecot IMAP/POP3 email server...
Sep 18 02:00:04 xxx systemd[1]: Stopping Fail2Ban Service...
Sep 18 02:00:04 xxx systemd[1]: Stopping Getty on tty1...
Sep 18 02:00:04 xxx systemd[1]: Stopping LSB: disk temperature monitoring daemon...
Sep 18 02:00:04 xxx systemd[1]: Stopping LSB: Advanced IEEE 802.11 management daemon...
Sep 18 02:00:04 xxx systemd[1]: lm-sensors.service: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Initialize hardware monitoring sensors.
Sep 18 02:00:04 xxx systemd[1]: Stopping MariaDB 10.5.23 database server...
Sep 18 02:00:04 xxx systemd[1]: Stopping Matrix Signal Bridge...
Sep 18 02:00:04 xxx systemd[1]: Stopping Matrix Telegram Bridge...
Sep 18 02:00:04 xxx mariadbd[977]: 2024-09-18 2:00:04 0 [Note] InnoDB: Starting shutdown...
Sep 18 02:00:04 xxx mariadbd[977]: 2024-09-18 2:00:04 0 [Note] InnoDB: Dumping buffer pool(s) to /var/lib/mysql/ib_buffer_pool
Sep 18 02:00:04 xxx mariadbd[977]: 2024-09-18 2:00:04 0 [Note] InnoDB: Buffer pool(s) dump completed at 240918 2:00:04
Sep 18 02:00:04 xxx ntpd[654]: ntpd exiting on signal 15 (Terminated)
Sep 18 02:00:04 xxx systemd[1]: Stopping Matrix Whatsapp Bridge...
Sep 18 02:00:04 xxx systemd[1]: Stopped OpenVPN service.
Sep 18 02:00:04 xxx systemd[1]: Stopping The PHP 7.4 FastCGI Process Manager...
Sep 18 02:00:04 xxx dovecot: master: Warning: Killed with signal 15 (by pid=28815 uid=0 code=kill)
Sep 18 02:00:04 xxx systemd[1]: Stopping The PHP 8.0 FastCGI Process Manager...
Sep 18 02:00:04 xxx systemd[1]: Stopping The PHP 8.2 FastCGI Process Manager...
Sep 18 02:00:04 xxx python3[578]: [2024-09-18 02:00:04,292] [#033[36mINFO#033[0m@#033[32mmau.init#033[0m] Everything stopped, shutting down
Sep 18 02:00:04 xxx systemd[1]: Stopping The PHP 8.3 FastCGI Process Manager...
Sep 18 02:00:04 xxx systemd[1]: postfix.service: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped Postfix Mail Transport Agent.
Sep 18 02:00:04 xxx systemd[1]: Stopping Postfix Mail Transport Agent (instance -)...
Sep 18 02:00:04 xxx systemd[1]: postgresql.service: Succeeded.
Sep 18 02:00:04 xxx systemd[1]: Stopped PostgreSQL RDBMS.
Sep 18 02:00:04 xxx systemd[1]: Stopping PostgreSQL Cluster 13-main...
Sep 18 02:00:04 xxx systemd[1]: Stopping PostSRSd Daemon...
Sep 18 07:35:11 xxx kernel: [ 0.000000] Linux version 5.10.0-32-amd64 (debian-kernel@lists.debian.org) (gcc-10 (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP Debian 5.10.223-1 (2024->
Sep 18 07:35:11 xxx kernel: [ 0.000000] Command line: BOOT_IMAGE=/vmlinuz-5.10.0-32-amd64 root=UUID=de565b6a-9830-49c0-8c17-7c2cf81f747a ro quiet
Sep 18 07:35:11 xxx kernel: [ 0.000000] BIOS-provided physical RAM map:
So it looks like the VM stop at Sep 18 02:00:04 and wake up at Sep 18 07:35:11 when I indeed woke it up.
So from what I found in /var/log/syslog of the VM, the VM took about 4 seconds to switch off, not 149 seconds.
The question is then, why PVE thought the VM was not switched off after 149 seconds? And how can I solve this issue to get nightly working backup again?
Last edited: