Good afternoon,
We recently switched from an old Hyper-V cluster to a ProxMox Cluster.
However, since the migration there have been many complaints about performance.
I myself have no experience with ProxMox so I am 100% newbie.
That's why I came to ask the specialists how I can optimize the cluster or where mistakes have been made.
The cluster itself was set up by an external party that sets up clusters more often, but they let me know that the cluster looks good and recommended that I come here for advice.
There are currently 80 VMs running on the cluster.
These are mainly virtual PFsenses, domain controllers, remote desktop servers and some application servers (MSSQL).
Complaints:
There is a delay on typing in outlook, for example.
Opening PDF files is slower than before (on Hyper-V).
The server does not feel as snappy as before.
etc..
Specifications (per node):
Networking (per node):
-
I know we shouldn't do storage tests directly on the VM, but did it anyway. :-D
Attached is a print screen.
Attached also an export FIO tests on the node (recommended by ChatGPT :-D).
Test 1:
fio --name=cephtest --rw=randrw --rwmixread=70 --bs=4k --size=10G --numjobs=4 --iodepth=16 --ioengine=libaio --runtime=60 --group_reporting --filename=/dev/rbd7
Test 2:
fio --name=cephfile --rw=randrw --rwmixread=70 --bs=4k --size=10G --numjobs=4 --iodepth=16 --ioengine=libaio --runtime=60 --group_reporting --filename=/mnt/testfio/testfile
-
Where should I start looking?
Because the performance should be better since the Hyper-Vs were really old machines with ISCSI storage over 2Gbps network.
I have already researched some things I found on this forum, google & ChatGPT.
But nothing I have tried has offered a solution.
Then I think of deactivating ballooning, setting CPU to “HOST”, QEMU agent, Write Back cache, etc....
Thanks in advance!
Kind Regards,
We recently switched from an old Hyper-V cluster to a ProxMox Cluster.
However, since the migration there have been many complaints about performance.
I myself have no experience with ProxMox so I am 100% newbie.
That's why I came to ask the specialists how I can optimize the cluster or where mistakes have been made.
The cluster itself was set up by an external party that sets up clusters more often, but they let me know that the cluster looks good and recommended that I come here for advice.
There are currently 80 VMs running on the cluster.
These are mainly virtual PFsenses, domain controllers, remote desktop servers and some application servers (MSSQL).
Complaints:
There is a delay on typing in outlook, for example.
Opening PDF files is slower than before (on Hyper-V).
The server does not feel as snappy as before.
etc..
Specifications (per node):
- 3 Node Cluster
- 1U Supermicro AS-1115SV-WTNRT
- 1x AMD EPYC™ Siena 8324P 32C/64T - 2.65Ghz - 3.0Ghz boost - 128MB - 180W - SP6
- 6x Samsung 96GB DDR-5 5600Mhz Registered ECC
- 4x Samsung DC PM9A3, 3.84TB , NVMe PCIe4.0x4, V4 TLC V, U2/2.5 inch Enterprise SSD, medium endurance
- OS Boot: 2x Samsung PM9A1 - 512GB NVME M.2 Gen4 - workstation class - 6.9GB/s read - 5GB/s write - short form factor - low endurance
Networking (per node):
- CoroSync 2x 10Gbps
- Management Network / Live Migrations 2x 25Gbps
- Ceph Storage 2x 25Gbps
- VM Network 2x 25Gbps
- CEPH Storage
- 12x OSD
-
I know we shouldn't do storage tests directly on the VM, but did it anyway. :-D
Attached is a print screen.
Attached also an export FIO tests on the node (recommended by ChatGPT :-D).
Test 1:
fio --name=cephtest --rw=randrw --rwmixread=70 --bs=4k --size=10G --numjobs=4 --iodepth=16 --ioengine=libaio --runtime=60 --group_reporting --filename=/dev/rbd7
Test 2:
fio --name=cephfile --rw=randrw --rwmixread=70 --bs=4k --size=10G --numjobs=4 --iodepth=16 --ioengine=libaio --runtime=60 --group_reporting --filename=/mnt/testfio/testfile
-
Where should I start looking?
Because the performance should be better since the Hyper-Vs were really old machines with ISCSI storage over 2Gbps network.
I have already researched some things I found on this forum, google & ChatGPT.
But nothing I have tried has offered a solution.
Then I think of deactivating ballooning, setting CPU to “HOST”, QEMU agent, Write Back cache, etc....
Thanks in advance!
Kind Regards,
Attachments
-
Test 2.txt2.9 KB · Views: 0
-
Test 1.txt3.2 KB · Views: 0
-
2025-07-18 09_36_36-pve1 - Proxmox Virtual Environment - Work - Microsoft Edge.jpg30.7 KB · Views: 7
-
2025-07-18 09_55_01-pve1 - Proxmox Virtual Environment - Work - Microsoft Edge.jpg17.9 KB · Views: 7
-
2025-07-18 09_50_26-pve1 - Proxmox Virtual Environment - Work - Microsoft Edge.jpg37.7 KB · Views: 7