Hello,
Anyone noticed slow network performance on AMD EPYC CPUs? I have now confirmed this on two generations.
- AMD EPYC 9454P (single socket)
- AMD EPYC 7502 (dual socket)
Network configuration:
- vmbr bridge dedicated for private VM traffic
VMs config:
- CPU type: host
- 10 cores (single socket)
- 10GB RAM
- SCSI VirtIO - Storage
- VirtIO network
- MTU 1500
VMs are on the same Proxmox node.
With iperf3/iperf between the two VMs, 15Gbps. If you enable multiqueues on virtio-net, you'll start seeing close to 50Gbps.
Setting up NFS server on one of the VMs and mounting it on the other, and trying to copy a 50G file, you'll only see about 500MB/s. But if I try to upload with another VM that is on another cluster (routed network), I see 1GB/s easily.
VM to VM inside of the Proxmox node, is poor, but as soon as the TX end is outside of cluster and node, its good throughput.
Any ideas on this? I attempted to disable SMT, and I saw a little better performance, wondering if there is anything else that goes for best practices and AMD CPUs like this.
Never seen anything like this on Intel Scalable CPUs. Also easily been able to see 100G iperfs between VMs on the same node with Intel CPUs.
Anyone noticed slow network performance on AMD EPYC CPUs? I have now confirmed this on two generations.
- AMD EPYC 9454P (single socket)
- AMD EPYC 7502 (dual socket)
Network configuration:
- vmbr bridge dedicated for private VM traffic
VMs config:
- CPU type: host
- 10 cores (single socket)
- 10GB RAM
- SCSI VirtIO - Storage
- VirtIO network
- MTU 1500
VMs are on the same Proxmox node.
With iperf3/iperf between the two VMs, 15Gbps. If you enable multiqueues on virtio-net, you'll start seeing close to 50Gbps.
Setting up NFS server on one of the VMs and mounting it on the other, and trying to copy a 50G file, you'll only see about 500MB/s. But if I try to upload with another VM that is on another cluster (routed network), I see 1GB/s easily.
VM to VM inside of the Proxmox node, is poor, but as soon as the TX end is outside of cluster and node, its good throughput.
Any ideas on this? I attempted to disable SMT, and I saw a little better performance, wondering if there is anything else that goes for best practices and AMD CPUs like this.
Never seen anything like this on Intel Scalable CPUs. Also easily been able to see 100G iperfs between VMs on the same node with Intel CPUs.