unknown node status when vm backup job

dadadadadadadad

New Member
Mar 2, 2024
1
0
1
Help, has anyone encountered a similar problem, a cluster of 3 nodes, one random node freezes, with a VM backup task, restarting networking.service helps and the node comes to life, the VM comes to life after unlocking qm, but the network on the VM does not work if you migrate VM to another node, then the network works or remove and add a network card.
I don’t know if this is connected or not, one node changes the ssh key, you have to delete the key on two nodes and add it again
 

Attachments

  • photo_2024-01-05_13-27-08.jpg
    photo_2024-01-05_13-27-08.jpg
    11.7 KB · Views: 7
  • photo_2024-03-08_18-04-34.jpg
    photo_2024-03-08_18-04-34.jpg
    100.4 KB · Views: 7
We have seen a similar behavior on LXC backups.
On multiple clusters/environments (work, side-business and home-lab, local backup storage and remote PBS).
1723539804337.png
What I've seen this ocurrs while processing one container.


State:
  • The backup-job gets stuck.
    image9.png

  • The container that is currently processed by the backup-job dies.
  • All other containers and VMs are still working without any issues.
  • PVE cli tools like `pct` or `qm` are non-responsive
  • We found the only way to recover the node without a reboot is to unlock the LXC and kill it:
    1723539746603.png

Bash:
# try to get the LXC-ID
pct list
## if that does not work - you can find the ID via
ls -l /etc/pve/nodes/<NODE>/lxc/
grep hostname < /etc/pve/nodes/<NODE>/lxc/<ID>.conf


# check it is not locked
pct unlock <ID>

# try to stop the container
pct shutdown <ID>
## OR
pct stop <ID>

# if that does not work - get its process and kill it
ps aux | grep <ID> | grep lxc
# should be: '/usr/bin/lxc-start -F -n <ID>'
kill -9 <PID>

After that the node is back to being online and the PVE tools and services work as expected.
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!