Hi, just writing this in case it helps anyone else.
Suddenly all the lxc containers of a node in a cluster failed to start, the only clues were as follows
or
This happened with all the containers, but not with KVM servers.
The reason was a mistake (my mistake) when modifying /usr/share/lxc/config/common.conf.
I use ansible to set some node parameters and I had wrongly added the same line twice:
lxc.prlimit.nofile = 65536
lxc.prlimit.nofile = 1024
once i remove the duplicated row, all containers were back to normal.
Suddenly all the lxc containers of a node in a cluster failed to start, the only clues were as follows
Code:
root@proxmox-2:/var/log# pct start 129
setup_resource_limits: 2517 Unknown resource
lxc_spawn: 1813 Failed to setup resource limits
__lxc_start: 1999 Failed to spawn container "129"
startup for container '129' failed
or
Code:
root@proxmox-2:~# lxc-start -n 129 -F -l DEBUG -o /tmp/lxc-129.log
Segmentation fault
This happened with all the containers, but not with KVM servers.
The reason was a mistake (my mistake) when modifying /usr/share/lxc/config/common.conf.
I use ansible to set some node parameters and I had wrongly added the same line twice:
lxc.prlimit.nofile = 65536
lxc.prlimit.nofile = 1024
once i remove the duplicated row, all containers were back to normal.