Hello,
I'm experiencing random system lockups with the following error message:
The last system lockup occurred a couple of hours ago, and I figured out the problem has somewhat to do with disk I/O.
The only thing I tried to do, is to tweak /etc/sysctl.conf (see at the bottom of this post).
I have Proxmox 3.4-6/102d4547 on a Dell PowerEdge T110 II equipped with:
No more disks can be added.
There are 2 VMs on this systems:
Here's my /etc/modprobe.d/zfs.conf
and my /etc/sysctl.conf:
pveperf
I need your help to avoid this to happen again.
Thanks.
I'm experiencing random system lockups with the following error message:
task txg_sync blocked for more than 300 seconds
The last system lockup occurred a couple of hours ago, and I figured out the problem has somewhat to do with disk I/O.
The only thing I tried to do, is to tweak /etc/sysctl.conf (see at the bottom of this post).
I have Proxmox 3.4-6/102d4547 on a Dell PowerEdge T110 II equipped with:
- 16GB RAM
- 2x 500GB SATA Disks (Seagate ST500NM0011)
- 2x 250GB SSD Drives (Samsung SSD 850 EVO).
No more disks can be added.
Code:
pool alloc free read write read write
------------------------------------------------------- ----- ----- ----- ----- ----- -----
rpool 114G 350G 300 66 1.24M 316K
mirror 114G 350G 300 62 1.17M 204K
ata-ST500NM0011_Z1M0QP69-part3 - - 106 12 931K 340K
ata-ST500NM0011_Z1M0PYK0-part3 - - 105 12 897K 340K
logs - - - - - -
mirror 1.99M 29.7G 0 3 74.6K 112K
ata-Samsung_SSD_850_EVO_250GB_S21PNXAG406075Z-part1 - - 0 3 75.0K 112K
ata-Samsung_SSD_850_EVO_250GB_S21PNXAG406071R-part1 - - 0 3 75.0K 112K
cache - - - - - -
ata-Samsung_SSD_850_EVO_250GB_S21PNXAG406075Z-part2 709M 202G 0 2 2.81K 322K
ata-Samsung_SSD_850_EVO_250GB_S21PNXAG406071R-part2 778M 202G 0 3 3.18K 354K
------------------------------------------------------- ----- ----- ----- ----- ----- -----
There are 2 VMs on this systems:
- 1x Windows Server 2003 - RAM: 2GB
- 1x Linux CentOS - RAM: 4GB
Here's my /etc/modprobe.d/zfs.conf
Code:
# MIN: 4Gb
options zfs zfs_arc_min=4294967296
# MAX: 8Gb
options zfs zfs_arc_max=8589934592
# L2ARC tuning
options zfs l2arc_noprefetch=0
options zfs l2arc_write_max=26214400
options zfs l2arc_write_boost=52428800
and my /etc/sysctl.conf:
Code:
net.ipv4.tcp_syncookies=1
# -- added after first lockup
vm.swappiness=0
# -- added after second lockup
kernel.panic = 5
kernel.hung_task_panic = 1
kernel.hung_task_timeout_secs = 300
# -- added after third lockup
# see: https://lonesysadmin.net/2013/12/22/better-linux-disk-caching-performance-vm-dirty_ratio/
vm.dirty_ratio = 5
vm.dirty_background_ratio = 10
pveperf
Code:
CPU BOGOMIPS: 24742.36
REGEX/SECOND: 1377638
HD SIZE: 327.50 GB (rpool/ROOT/pve-1)
FSYNCS/SECOND: 294.61
DNS EXT: 76.29 ms
DNS INT: 72.69 ms
I need your help to avoid this to happen again.
Thanks.
Last edited: