Hello everyone,
I setup a new Proxmox box and was transferring data from another server (heavy network and disk load) when I lost all network connectivity. I could not ping the host or the guests. After about 30 minutes it came back alive!
In the logs I found this:
This repeats itself for about 30 minutes.
Do I have a ZFS (four SATA drives in ZFS RAID 10)? or RAM issue?
Thank you in advance for your help.
Kevin
I setup a new Proxmox box and was transferring data from another server (heavy network and disk load) when I lost all network connectivity. I could not ping the host or the guests. After about 30 minutes it came back alive!
In the logs I found this:
Code:
[Jul 28 22:55:32 <local hostname> pveproxy[92077]: problem with client 68.2.26.152; Connection reset by peer
Jul 28 22:55:32 <local hostname> pveproxy[92077]: Can't call method "on_drain" on an undefined value at /usr/share/perl5/PVE/HTTPServer.pm line 215.
Jul 28 22:55:33 <local hostname> pve-firewall[3323]: firewall update time (25.399 seconds)
Jul 28 22:55:35 <local hostname> pvestatd[3363]: status update time (28.210 seconds)
Jul 28 23:02:21 <local hostname> pveproxy[92077]: problem with client 68.2.26.152; Connection reset by peer
Jul 28 23:02:21 <local hostname> pveproxy[92077]: Can't call method "on_drain" on an undefined value at /usr/share/perl5/PVE/HTTPServer.pm line 215.
Jul 28 23:02:21 <local hostname> kernel: INFO: task kswapd0:102 blocked for more than 120 seconds.
Jul 28 23:02:21 <local hostname> kernel: Tainted: P --------------- 2.6.32-39-pve #1
Jul 28 23:02:21 <local hostname> kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jul 28 23:02:21 <local hostname> kernel: kswapd0 D ffff88042903cf00 0 102 2 0 0x00000000
Jul 28 23:02:21 <local hostname> kernel: ffff88042903f890 0000000000000046 ffff88042903f840 ffffffffa06c407f
Jul 28 23:02:21 <local hostname> kernel: 0000000000000008 00007f0bd2e8d000 ffff88015e275000 0000000000005000
Jul 28 23:02:21 <local hostname> kernel: 0000000000000000 ffff8801b7420000 0000000103827332 ffff8801b7420000
Jul 28 23:02:21 <local hostname> kernel: Call Trace:
Jul 28 23:02:21 <local hostname> kernel: [<ffffffffa06c407f>] ? kvm_handle_hva+0xef/0x120 [kvm]
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81182600>] ? wait_for_discard+0x0/0x20
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8118260e>] wait_for_discard+0xe/0x20
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81562940>] __wait_on_bit+0x60/0x90
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81182600>] ? wait_for_discard+0x0/0x20
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff815629ec>] out_of_line_wait_on_bit+0x7c/0x90
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff810a6990>] ? wake_bit_function+0x0/0x50
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8118572c>] get_swap_page+0x48c/0x660
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81181d07>] add_to_swap+0x17/0x90
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81156292>] shrink_page_list.constprop.21+0x3a2/0x8f0
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81156b53>] shrink_inactive_list+0x373/0xa70
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8115751e>] shrink_lruvec+0x2ce/0x600
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81157a2d>] shrink_zone+0x1dd/0x410
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81158be2>] balance_pgdat+0xab2/0xc40
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8108d004>] ? try_to_del_timer_sync+0x84/0xe0
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81158edd>] kswapd+0x16d/0x300
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8106da62>] ? default_wake_function+0x12/0x20
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff810a6910>] ? autoremove_wake_function+0x0/0x40
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff81158d70>] ? kswapd+0x0/0x300
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff810a6040>] kthread+0x90/0xb0
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8100c3ca>] child_rip+0xa/0x20
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff810a5fb0>] ? kthread+0x0/0xb0
Jul 28 23:02:21 <local hostname> kernel: [<ffffffff8100c3c0>] ? child_rip+0x0/0x20
Jul 28 23:02:21 <local hostname> kernel: INFO: task ksmd:103 blocked for more than 120 seconds.
Jul 28 23:02:21 <local hostname> kernel: Tainted: P --------------- 2.6.32-39-pve #1
Jul 28 23:02:21 <local hostname> kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jul 28 23:02:21 <local hostname> kernel: ksmd D ffff88042903c400 0 103 2 0 0x00000000
Jul 28 23:02:21 <local hostname> kernel: ffff880429043cf8 0000000000000046 0000000000000000 ffff880429778a00
Jul 28 23:02:21 <local hostname> kernel: ffff880039c9de00 0000000000000000 ffff88042830cd00 000000000001de00
Jul 28 23:02:21 <local hostname> kernel: 000035dd7b6768a8 ffff88042830cd00 0000000103829688 00000000000001a3
Jul 28 23:02:21 <local hostname> kernel: Call Trace:
This repeats itself for about 30 minutes.
Do I have a ZFS (four SATA drives in ZFS RAID 10)? or RAM issue?
Thank you in advance for your help.
Kevin