Hello,
One of my proxmox servers became unresponsive after an update of the kernel from 2.6.32.34 to 2.6.32.39.
Basically, it works correctly after reboot for about 24h and then start to be excessively slow (~2-3mn for a simple ls) and
in the logs, I have things like :
I tryed to change the vm.dirty_ratio to 5, desactivated all bios energy saving options but nothing changed.
In the meantime the 2.6.32.40 kernel was released, thus I tryed to upgrade, but it seems that this didn't fix the problem.
I restarted on 2.6.32.34 kernel and it works.
The server is a HP ProLiant BL460c Gen8 (with Intel Xeon and HP Smart Array P220i Controller) running pvemanager 3.4-8
I have another HP server in the same bay (ProLiant BL465c G7 but with AMD Opteron processors and Smart Array P711m Controller) running the same versions of proxmox without any problem.
Any suggestion ?
Thanks.
One of my proxmox servers became unresponsive after an update of the kernel from 2.6.32.34 to 2.6.32.39.
Basically, it works correctly after reboot for about 24h and then start to be excessively slow (~2-3mn for a simple ls) and
in the logs, I have things like :
Code:
Sep 7 06:30:04 lpnhevictor kernel: INFO: task gpg:520457 blocked for more than 120 seconds.
Sep 7 06:30:04 lpnhevictor kernel: Not tainted 2.6.32-40-pve #1
Sep 7 06:30:04 lpnhevictor kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 7 06:30:04 lpnhevictor kernel: gpg D ffff880825a1c7c0 0 520457 520150 0 0x00000000
Sep 7 06:30:04 lpnhevictor kernel: ffff8808258e5d58 0000000000000086 0000000000000000 ffff8808609dde00
Sep 7 06:30:04 lpnhevictor kernel: 0000000000000003 ffff880049c5de00 ffff88083c7fc348 ffff88083c75e810
Sep 7 06:30:04 lpnhevictor kernel: 0001ca380b804814 ffffffff8105825f 000000011e0a7c58 0000000000001a26
Sep 7 06:30:04 lpnhevictor kernel: Call Trace:
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff8105825f>] ? __dequeue_entity+0x2f/0x50
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff815625f4>] schedule_timeout+0x204/0x300
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff81058153>] ? __wake_up+0x53/0x70
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff81561e37>] wait_for_completion+0xd7/0x110
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff8106da50>] ? default_wake_function+0x0/0x20
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff81152f20>] ? lru_add_drain_per_cpu+0x0/0x10
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff810a00d6>] flush_work+0x76/0xc0
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff8109e3f0>] ? wq_barrier_func+0x0/0x20
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff810a0303>] schedule_on_each_cpu+0x103/0x160
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff81152495>] lru_add_drain_all+0x15/0x20
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff81171391>] __mlock+0x41/0x110
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff81171473>] sys_mlock+0x13/0x20
Sep 7 06:30:04 lpnhevictor kernel: [<ffffffff8100b162>] system_call_fastpath+0x16/0x1b
Sep 7 06:31:40 lpnhevictor pmxcfs[3271]: [status] notice: received log
Sep 7 06:32:04 lpnhevictor kernel: INFO: task gpg:520457 blocked for more than 120 seconds.
Sep 7 06:32:04 lpnhevictor kernel: Not tainted 2.6.32-40-pve #1
...
Sep 7 17:00:04 lpnhevictor kernel: INFO: task bash:691295 blocked for more than 120 seconds.
Sep 7 17:00:04 lpnhevictor kernel: Not tainted 2.6.32-40-pve #1
Sep 7 17:00:04 lpnhevictor kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 7 17:00:04 lpnhevictor kernel: bash D ffff880826198880 0 691295 691294 0 0x00000004
Sep 7 17:00:04 lpnhevictor kernel: ffff880838465bf8 0000000000000086 0000000000000000 ffff88083fc22800
Sep 7 17:00:04 lpnhevictor kernel: 0000000000000000 0000000000000010 ffff880838465c28 ffffffff8105ca23
Sep 7 17:00:04 lpnhevictor kernel: 0001ec935695e36d ffff88083c75ec00 00000001204b8046 0000000000000ea1
Sep 7 17:00:04 lpnhevictor kernel: Call Trace:
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff8105ca23>] ? perf_event_task_sched_out+0x33/0x70
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff815625f4>] schedule_timeout+0x204/0x300
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff81058153>] ? __wake_up+0x53/0x70
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff81561e37>] wait_for_completion+0xd7/0x110
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff8106da50>] ? default_wake_function+0x0/0x20
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff810a00d6>] flush_work+0x76/0xc0
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff8109e3f0>] ? wq_barrier_func+0x0/0x20
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff810a0172>] flush_delayed_work+0x52/0x70
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff81344775>] tty_flush_to_ldisc+0x15/0x20
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff81340e4c>] n_tty_read+0x1dc/0x960
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff810a6a3c>] ? remove_wait_queue+0x3c/0x50
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff8106da50>] ? default_wake_function+0x0/0x20
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff8133b692>] tty_read+0x92/0xf0
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff811ae18e>] vfs_read+0x9e/0x190
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff811ae2ca>] sys_read+0x4a/0x90
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff810957c4>] ? sys_rt_sigprocmask+0xa4/0x120
Sep 7 17:00:04 lpnhevictor kernel: [<ffffffff8100b162>] system_call_fastpath+0x16/0x1b
Sep 7 17:01:57 lpnhevictor pmxcfs[3271]: [status] notice: received log
Sep 7 17:02:04 lpnhevictor kernel: INFO: task bash:691295 blocked for more than 120 seconds.
I tryed to change the vm.dirty_ratio to 5, desactivated all bios energy saving options but nothing changed.
In the meantime the 2.6.32.40 kernel was released, thus I tryed to upgrade, but it seems that this didn't fix the problem.
I restarted on 2.6.32.34 kernel and it works.
The server is a HP ProLiant BL460c Gen8 (with Intel Xeon and HP Smart Array P220i Controller) running pvemanager 3.4-8
I have another HP server in the same bay (ProLiant BL465c G7 but with AMD Opteron processors and Smart Array P711m Controller) running the same versions of proxmox without any problem.
Any suggestion ?
Thanks.