[SOLVED] Proxmox 5.1.46 LXC cluster error Job for pve-container@101.service failed

Vasu Sreekumar · Mar 15, 2018

I stopped the test.

It ran 54 hours without an issue.

And I moved all my 25 live nodes to kernel 4.15, no issues so far.

Calmor · Mar 15, 2018

Thanks for your testing, Vasu. I'm patiently waiting for this to move to stable, as I am running a mixed system with qemu and lxc. Good to know that the solution is around the corner!

Vasu Sreekumar · Mar 16, 2018

Actually kernel 4.13 has 3 issues

1. LXC starting issues.
2. kworker using 100% CPU
3. Extra IP in LXC is gone after restart.

Kernel 4.15 solved issue some issues

For example issue 1. is solved with kernel 4.15.

I am still testing issue 2. with kernel 4.15 inb 25 live nodes, no issues so far.

Issue 3. is still there in kernel 4.15

Calmor · Mar 16, 2018

Could you explain issue 3? I have multiple containers running multiple IPs, though they are on different configured interfaces (host has multiple VLANs configured and shares them as separate network cards to the containers).

Are you assigning multiple IPs from the same network interface / subnet to a container?

Vasu Sreekumar · Mar 16, 2018

On reboot LXC loses all extra IPs.

I am adding eth1 with new IP, but on reboot it is gone.

Vasu Sreekumar · Mar 16, 2018

I am using this method now and it works fine.

LXC extra IP with different subnet adding steps.

If LXC has different subnet than extra IP........

First add interface eth0 via GUI in the Node for the guest

Then login to guest and create extra config file for each IP.

1] login to proxmox select CT XXX > Network > add one NIC with XXX.XXX.XXX.XXX/32 and gateway XXX.XXX.XXX.1

2.login to CT and navigate to cd /etc/sysconfig/network-scripts

3. Run ls and you can list the network config files and will see :-

ifcfg-eth0

4. Now we can add extra ip by creating a new file : vi ifcfg-eth0:0

DEVICE=eth0:0
ONBOOT=yes
BOOTPROTO=none
IPADDR=10.10.10.10
NETMASK=255.255.255.255
GATEWAY=10.10.10.1

and save it

5. like this create one file each with ip and gateway of the each extra ip and save it (vi ifcfg-eth0:1 vi ifcfg-eth0:2 vi ifcfg-eth0:3 vi ifcfg-eth0:4 so 0n.......)

6. Then run service network restart.

May take upto 1 -2 minutes to get it ping. so please wait.

Vasu Sreekumar · Mar 16, 2018

I upgraded all 25 live nodes to Kernel 4.15 and everything is running fine for last 24 hours.

Now 4.15 kernel is available in no-subscription repo.

Calmor · Mar 16, 2018

Vasu Sreekumar said:
On reboot LXC loses all extra IPs.

I am adding eth1 with new IP, but on reboot it is gone.

I'm a couple upgrades behind (5.1_43) - is this a regression? I've had no issues with multiple IPs on a container, or kworker CPU utilization. One container has three, and they persist with no special configuration through a container or server reboot (single node, no HA).

Calmor · Dec 26, 2018

My apologies for the necrobump, but I upgraded to 5.3-6 over the weekend and am experiencing the same issue starting LXC containers again. Is there a known regression and/or a fix?

denos · Dec 29, 2018

The issue is still present but less frequently encountered in the 4.15.x line. See: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1779678

I saw it as recently as 4.15.18-8-pve and moved to custom 4.18 and 4.19 kernels afterwards. As in the bug report, I haven't seen the issue on these kernels but they both break AppArmor at the version level packaged with Debian Stretch. You'll need to rebuild the apparmor_parser (and libapparmor due to dependency) from source or use third party packages if you want to do the same.

Search

Search

[SOLVED] Proxmox 5.1.46 LXC cluster error Job for pve-container@101.service failed

Vasu Sreekumar

Active Member

Calmor

Member

Vasu Sreekumar

Active Member

Calmor

Member

Vasu Sreekumar

Active Member

Vasu Sreekumar

Active Member

Vasu Sreekumar

Active Member

Calmor

Member

Calmor

Member

denos

Well-Known Member

We value your privacy