PVE 2.3 vmbr0 not passing UDP traffic correctly

jasone

New Member
Jul 1, 2013
2
0
1
Hello,

I have a 4 node PVE 2.3 cluster. This used to work until last week, so I'm preplexed on why it just stopped working. The only thing that was changed was the addition of another PVE node.

We use PXE booting to do our installs from a kickstart server for CentOS. Quite often the DHCP/BOOTP process will fail to even get an address. When it does make it past getting an address, the pxelinux menu may or may not load, and when it does, it always dies somewhere in TFTP downloading the vmlinux or initrd.img.

Both the DHCP server, and the TFTP server are on different networks, going through a router. I configured a DHCP server locally on the same physical segment that the PVE cluster is on, and DHCP worked 98% of the time. (still a few occasions where it failed, but I experienced that level of failure when I used to do this by hand with qemu/kvm on another system).

What I don't understand though, is why a PHYSICAL server on this network segment has absolutely no problem getting a DHCP address, loading pxelinux, and downloading the installer via TFTP. I tested this by rebooting my 3rd PVE node, and having it boot over the network. Repeatedly doing this worked 100% of the time. Its only when guests running on that same node try to do this is the failure rate nearly 100%.

Any ideas on what is causing this? It seems to be something fundamentally wrong with the bridge. At first I suspected duplicate IP, or arptables having issues, but the IP isn't the issue, and arptables isn't even on these Debian installs.

I have attached a network diagram to help (4th node not pictured, btw). Keep in mind, this works just fine when its a physical server (in this case vm-c1-b3)... It doesn't work when a guest is on vm-c1-b3.

Thanks,
Jason
 

Attachments

  • proxmox-2013-07-01.jpg
    proxmox-2013-07-01.jpg
    61.4 KB · Views: 18
Hi, I have notice the problem with kvm guest machines, and virtio drivers, with bad checksum udp error in /var/log/messages

on proxmox 3, with recent iptables you can do on your proxmox host

iptables -A POSTROUTING -t mangle -p udp --dport bootpc -j CHECKSUM --checksum-fill

for proxmox 2.3, I don't known because iptables is too old. (Or you need to backport it manually from wheezy).

you can try e1000 as network card for your guests as workaround.
 
Last edited:
Hi, I have notice the problem with kvm guest machines, and virtio drivers, with bad checksum udp error in /var/log/messages

on proxmox 3, with recent iptables you can do on your proxmox host

iptables -A POSTROUTING -t mangle -p udp --dport bootpc -j CHECKSUM --checksum-fill

for proxmox 2.3, I don't known because iptables is too old. (Or you need to backport it manually from wheezy).

you can try e1000 as network card for your guests as workaround.

It doesn't seem to matter which network emulation I do. And the odd thing is, is that this used to work just fine, and pretty much stopped working when we added the 4th node to the cluster.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!