Good afternoon. I have a production HA setup with shared iscsi. I fear I've misconfigured things and am stuck with the poor performance I'm experiencing.
I have 5 HA hosts. 4 active with guests and #5 for voting only.
- Each is dual port 802.3ad LACP bound 10 GIG MTU 9000
- HA configured iSCSI with then 2x LVM's over the top. The spinning and SSD's below on the qNAP.
- Each is dual port gigabit to 2x 5120 HP's for redundant host communication traffic. Each host's gigabit ports are set for "balance-alb".
- 256 Gigs of RAM each with a mix of EPYC procs on the 4 active hosts and a Xeon on the voting member with 16 Gigs of RAM on it.
- Because of the mix of EPYCS ranging from 7542, 7351, 7302, and a 7401 each guest is set to x86-64-v2-AES for the most compatible setup for migration. I tried others and when I would migrate a guest, the guest would reset, so I had to settle on that proc config for the guests.
qNAP TS-h1886XU-RP-R2-D1622-32G iSCSI
- Spinning LUN - SATA
- SSD LUN - SATA
- Dual port 802.3ad LACP bound 10 GIG MTU 9000
- 128 Gigs of RAM with a 4 core, 8 thread Xeon
Netgear XS724EM 24-Port 10 Gig Switch
- Each machine's port is configured for 802.3ad LACP support which the switch's documentation states it is capable of doing.
- Storage traffic and migration traffic only.
- Max MTU 9216
2x HP 5120 48 port gigabit switches for guests and redundant host communication.
- Guest traffic
- Host HA communication
I get poor performance even on the SSD side which only has 6 guests at the moment (4 domain controllers and 2 database apps). Even poorer performance on the spinners. Either I've configured something wrong or the tuning is off OR I've hit my limit and won't get anything more out of it. Throwing cores at the guests doesn't help much. It's the I/O that's eating me up, I think.
I guess my question is can I get more out of this configuration, or do I need to do something different, which just isn't in the budget right now? This is what I had to work with so I made lemonade to get HA and some redundancy.
I've read that multipath is ideal but really only talks about it being a redundant option and not a performant one. I know LACP isn't true 20 gig but I thought I'd be seeing more out of this setup... again... unless I missed something.
Please be kind. I need advice. Thank you.
I have 5 HA hosts. 4 active with guests and #5 for voting only.
- Each is dual port 802.3ad LACP bound 10 GIG MTU 9000
- HA configured iSCSI with then 2x LVM's over the top. The spinning and SSD's below on the qNAP.
- Each is dual port gigabit to 2x 5120 HP's for redundant host communication traffic. Each host's gigabit ports are set for "balance-alb".
- 256 Gigs of RAM each with a mix of EPYC procs on the 4 active hosts and a Xeon on the voting member with 16 Gigs of RAM on it.
- Because of the mix of EPYCS ranging from 7542, 7351, 7302, and a 7401 each guest is set to x86-64-v2-AES for the most compatible setup for migration. I tried others and when I would migrate a guest, the guest would reset, so I had to settle on that proc config for the guests.
qNAP TS-h1886XU-RP-R2-D1622-32G iSCSI
- Spinning LUN - SATA
- SSD LUN - SATA
- Dual port 802.3ad LACP bound 10 GIG MTU 9000
- 128 Gigs of RAM with a 4 core, 8 thread Xeon
Netgear XS724EM 24-Port 10 Gig Switch
- Each machine's port is configured for 802.3ad LACP support which the switch's documentation states it is capable of doing.
- Storage traffic and migration traffic only.
- Max MTU 9216
2x HP 5120 48 port gigabit switches for guests and redundant host communication.
- Guest traffic
- Host HA communication
I get poor performance even on the SSD side which only has 6 guests at the moment (4 domain controllers and 2 database apps). Even poorer performance on the spinners. Either I've configured something wrong or the tuning is off OR I've hit my limit and won't get anything more out of it. Throwing cores at the guests doesn't help much. It's the I/O that's eating me up, I think.
I guess my question is can I get more out of this configuration, or do I need to do something different, which just isn't in the budget right now? This is what I had to work with so I made lemonade to get HA and some redundancy.
I've read that multipath is ideal but really only talks about it being a redundant option and not a performant one. I know LACP isn't true 20 gig but I thought I'd be seeing more out of this setup... again... unless I missed something.
Please be kind. I need advice. Thank you.
Last edited: