We switched from a HP hardware raid (HP P420i) system to our new Dell R730xd system with 128GB RAM and ZFS. The integrated PERC H730 Mini controller has been configured for HBA mode. There are 30 VMs running (KVM with zvol plugin, thin-provisioning), with assigned memory of 33GB (sum). Most VMs are using the IDE-Bus (no Virtio) because we migrated them from physical machines some years ago.
When the Dell system is freshly booted, it behaves really fine - it needs some time to load the data from the Toshiba SATA harddisks, but then it's stable. ARC is limited to 64GB and the system begins slowly to fill this until the limit is reached. After some hours/days with disk activity (e.g. rsync backup jobs at night) the first I/O problems are beginning - some VMs behave "sluggy" and unresponsive, others are working. Today, one of the bigger VMs showed very high I/O wait (> 80-90%) and was nearly unresponsive. After a reboot of the Dell host, performance is back again.
Unfortunatly I cannot find the bottleneck in the system - in my opinion, there could be 3 main reasons:
1) The H730 controller cannot deliver the performance which is needed
2) The SATA drives are too slow
3) There's not enough RAM for the VMs
Some info about the system:
zpool status
pveperf /tank
zpool iostat -v 2 300
vmstat 1
uptime + arc-stats
/etc/modprobe.d/zfs.conf
free
When the Dell system is freshly booted, it behaves really fine - it needs some time to load the data from the Toshiba SATA harddisks, but then it's stable. ARC is limited to 64GB and the system begins slowly to fill this until the limit is reached. After some hours/days with disk activity (e.g. rsync backup jobs at night) the first I/O problems are beginning - some VMs behave "sluggy" and unresponsive, others are working. Today, one of the bigger VMs showed very high I/O wait (> 80-90%) and was nearly unresponsive. After a reboot of the Dell host, performance is back again.
Unfortunatly I cannot find the bottleneck in the system - in my opinion, there could be 3 main reasons:
1) The H730 controller cannot deliver the performance which is needed
2) The SATA drives are too slow
3) There's not enough RAM for the VMs
Some info about the system:
zpool status
Code:
pool: rpool
state: ONLINE
scan: none requested
config:
NAME STATE READ WRITE CKSUM
rpool ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
sda2 ONLINE 0 0 0
sdb2 ONLINE 0 0 0
errors: No known data errors
pool: tank
state: ONLINE
scan: none requested
config:
NAME STATE READ WRITE CKSUM
tank ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_13S8D22AS ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_43N2AXSGS ONLINE 0 0 0
mirror-1 ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_43N2EV5GS ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_43N2JP8GS ONLINE 0 0 0
mirror-2 ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_43O0ZHLAS ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_43O2H8VGS ONLINE 0 0 0
mirror-3 ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_43O2K1GGS ONLINE 0 0 0
ata-TOSHIBA_DT01ACA200_53VE7WTGS ONLINE 0 0 0
logs
mirror-4 ONLINE 0 0 0
ata-SAMSUNG_MZ7KM120HAFD-00005_S2HPNX0H500037-part1 ONLINE 0 0 0
ata-SAMSUNG_MZ7KM120HAFD-00005_S2HPNX0H500035-part1 ONLINE 0 0 0
errors: No known data errors
pveperf /tank
Code:
CPU BOGOMIPS: 115208.52
REGEX/SECOND: 2461409
HD SIZE: 5874.93 GB (tank)
FSYNCS/SECOND: 5621.81
DNS EXT: 32.33 ms
DNS INT: 97.47 ms
zpool iostat -v 2 300
Code:
capacity operations bandwidth
pool alloc free read write read write
------------------------------------------------------- ----- ----- ----- ----- ----- -----
rpool 1.97G 109G 0 61 0 237K
mirror 1.97G 109G 0 61 0 237K
sda2 - - 0 23 0 266K
sdb2 - - 0 23 0 266K
------------------------------------------------------- ----- ----- ----- ----- ----- -----
tank 1.29T 5.96T 33 2.56K 118K 14.5M
mirror 329G 1.49T 9 537 23.2K 2.93M
ata-TOSHIBA_DT01ACA200_13S8D22AS - - 3 40 20.0K 2.94M
ata-TOSHIBA_DT01ACA200_43N2AXSGS - - 5 42 24.0K 2.94M
mirror 329G 1.49T 6 821 13.5K 4.17M
ata-TOSHIBA_DT01ACA200_43N2EV5GS - - 2 51 14.0K 4.19M
ata-TOSHIBA_DT01ACA200_43N2JP8GS - - 3 54 14.0K 4.19M
mirror 329G 1.49T 4 502 22.0K 2.88M
ata-TOSHIBA_DT01ACA200_43O0ZHLAS - - 2 25 16.0K 2.90M
ata-TOSHIBA_DT01ACA200_43O2H8VGS - - 1 26 10.0K 2.90M
mirror 329G 1.49T 13 752 59.2K 3.83M
ata-TOSHIBA_DT01ACA200_43O2K1GGS - - 5 50 56.0K 3.85M
ata-TOSHIBA_DT01ACA200_53VE7WTGS - - 4 48 24.0K 3.85M
logs - - - - - -
mirror 4.97M 15.9G 0 5 0 674K
ata-SAMSUNG_MZ7KM120HAFD-00005_S2HPNX0H500037-part1 - - 0 5 0 674K
ata-SAMSUNG_MZ7KM120HAFD-00005_S2HPNX0H500035-part1 - - 0 5 0 674K
------------------------------------------------------- ----- ----- ----- ----- ----- -----
vmstat 1
Code:
procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu-----
r b swpd free buff cache si so bi bo in cs us sy id wa st
4 1 0 64548264 18282908 100120 0 0 584 633 82 643 8 2 81 9 0
2 0 0 64549680 18283156 100216 0 0 304 596 34982 74746 9 1 89 1 0
3 0 0 64549676 18283648 100016 0 0 260 60 34683 75484 9 1 89 1 0
2 0 0 64544416 18284772 99816 0 0 1792 65060 35431 78989 10 3 86 2 0
4 0 0 64544316 18285244 100224 0 0 1812 2080 34002 74553 10 1 87 1 0
4 0 0 64540464 18285784 100052 0 0 3976 64 34523 74093 11 1 87 1 0
4 0 0 64539544 18286044 100328 0 0 444 220 35031 73226 14 2 84 1 0
3 2 0 64537408 18286492 100116 0 0 2568 392 34367 73940 11 2 86 1 0
3 0 0 64536724 18286848 100220 0 0 576 21924 34567 77946 11 2 86 1 0
2 0 0 64535928 18287112 100352 0 0 828 2732 34894 75539 11 1 87 1 0
2 0 0 64533608 18287368 100260 0 0 96 1112 34360 74720 8 1 90 0 0
4 0 0 64533348 18287576 100316 0 0 176 1592 33807 72820 9 1 89 1 0
2 1 0 64532112 18288024 100204 0 0 1132 108 33895 74384 10 1 88 1 0
2 0 0 64512964 18293328 100008 0 0 16332 18192 36548 84743 9 2 88 1 0
2 1 0 64507796 18295756 99480 0 0 5444 4576 34483 77125 10 1 87 2 0
uptime + arc-stats
Code:
18:50:27 up 2:58, 2 users, load average: 1.97, 2.00, 1.95
time read miss miss% dmis dm% pmis pm% mmis mm% arcsz c
18:50:31 3 0 0 0 0 0 0 0 0 45G 64G
18:50:32 28 7 25 5 19 2 100 2 11 45G 64G
18:50:33 443 67 15 62 14 5 50 3 60 45G 64G
18:50:34 2.6K 47 1 13 0 34 64 5 22 45G 64G
18:50:35 4.5K 15 0 15 0 0 0 0 0 45G 64G
18:50:36 1.0K 290 28 33 4 257 71 1 4 45G 64G
18:50:37 694 169 24 11 2 158 49 2 0 45G 64G
18:50:38 90 18 20 11 13 7 100 2 100 45G 64G
18:50:39 3 1 33 1 33 0 0 0 0 45G 64G
18:50:40 94 7 7 7 7 0 0 0 0 45G 64G
/etc/modprobe.d/zfs.conf
Code:
options zfs zfs_arc_max=68719476736
free
Code:
total used free shared buffers cached
Mem: 131915736 94668292 37247444 55156 23380992 100564
-/+ buffers/cache: 71186736 60729000
Swap: 8388604 0 8388604