Hi, strange behaviour on my installation: HP Proliant G8 microserver with Proxmox 5.1 (updated to pve-test)
Many SEGFAULT error in syslog like:
Dec 20 10:24:55 pve kernel: [100995.340203] ml4[62621]: segfault at 10 ip 00007fc3882f6410 sp 00007fc376c6ac80 error 4 in libc-2.19.so[7fc38824a000+1a1000]
Dec 20 10:25:59 pve kernel: [101059.247552] ml2[63212]: segfault at 10 ip 00007fe43bce6410 sp 00007fe42ae5bc80 error 4 in libc-2.19.so[7fe43bc3a000+1a1000]
Dec 20 10:26:31 pve kernel: [101091.247526] ml4[63456]: segfault at 10 ip 00007faa89487410 sp 00007faa775fac80 error 4 in libc-2.19.so[7faa893db000+1a1000]
Dec 20 10:27:03 pve kernel: [101123.243031] ml2[63799]: segfault at 10 ip 00007f7378434410 sp 00007f73675a9c80 error 4 in libc-2.19.so[7f7378388000+1a1000]
Dec 20 10:27:35 pve kernel: [101155.244188] ml2[64051]: segfault at 10 ip 00007f86abbb5410 sp 00007f869ad2ac80 error 4 in libc-2.19.so[7f86abb09000+1a1000]
Dec 20 10:28:07 pve kernel: [101187.250438] ml2[64288]: segfault at 10 ip 00007f53cfd2d410 sp 00007f53beea2c80 error 4 in libc-2.19.so[7f53cfc81000+1a1000]
Sometimes I lost control of node (SSH stop respondig/webUI can't issue command) and need to hw reboot it.
1) Any idea about cause? Faulty MEMORY_MODULE (I've seen that error is always at same relative position in ip xxxxxxxxxxxxx410)?
2) Any suggestion how to debug?
Thx a lot
Many SEGFAULT error in syslog like:
Dec 20 10:24:55 pve kernel: [100995.340203] ml4[62621]: segfault at 10 ip 00007fc3882f6410 sp 00007fc376c6ac80 error 4 in libc-2.19.so[7fc38824a000+1a1000]
Dec 20 10:25:59 pve kernel: [101059.247552] ml2[63212]: segfault at 10 ip 00007fe43bce6410 sp 00007fe42ae5bc80 error 4 in libc-2.19.so[7fe43bc3a000+1a1000]
Dec 20 10:26:31 pve kernel: [101091.247526] ml4[63456]: segfault at 10 ip 00007faa89487410 sp 00007faa775fac80 error 4 in libc-2.19.so[7faa893db000+1a1000]
Dec 20 10:27:03 pve kernel: [101123.243031] ml2[63799]: segfault at 10 ip 00007f7378434410 sp 00007f73675a9c80 error 4 in libc-2.19.so[7f7378388000+1a1000]
Dec 20 10:27:35 pve kernel: [101155.244188] ml2[64051]: segfault at 10 ip 00007f86abbb5410 sp 00007f869ad2ac80 error 4 in libc-2.19.so[7f86abb09000+1a1000]
Dec 20 10:28:07 pve kernel: [101187.250438] ml2[64288]: segfault at 10 ip 00007f53cfd2d410 sp 00007f53beea2c80 error 4 in libc-2.19.so[7f53cfc81000+1a1000]
Sometimes I lost control of node (SSH stop respondig/webUI can't issue command) and need to hw reboot it.
1) Any idea about cause? Faulty MEMORY_MODULE (I've seen that error is always at same relative position in ip xxxxxxxxxxxxx410)?
2) Any suggestion how to debug?
Thx a lot
pveversion -v
proxmox-ve: 5.1-31 (running kernel: 4.13.13-1-pve)
pve-manager: 5.1-40 (running version: 5.1-40/ea05b379)
pve-kernel-4.13.8-3-pve: 4.13.8-30
pve-kernel-4.13.13-1-pve: 4.13.13-31
libpve-http-server-perl: 2.0-8
lvm2: 2.02.168-pve6
corosync: 2.4.2-pve3
libqb0: 1.0.1-1
pve-cluster: 5.0-19
qemu-server: 5.0-18
pve-firmware: 2.0-3
libpve-common-perl: 5.0-25
libpve-guest-common-perl: 2.0-14
libpve-access-control: 5.0-7
libpve-storage-perl: 5.0-17
pve-libspice-server1: 0.12.8-3
vncterm: 1.5-3
pve-docs: 5.1-12
pve-qemu-kvm: 2.9.1-5
pve-container: 2.0-18
pve-firewall: 3.0-5
pve-ha-manager: 2.0-4
ksm-control-daemon: 1.2-2
glusterfs-client: 3.8.8-1
lxc-pve: 2.1.1-2
lxcfs: 2.0.8-1
criu: 2.11.1-1~bpo90
novnc-pve: 0.6-4
smartmontools: 6.5+svn4324-1
zfsutils-linux: 0.7.3-pve1~bpo9
proxmox-ve: 5.1-31 (running kernel: 4.13.13-1-pve)
pve-manager: 5.1-40 (running version: 5.1-40/ea05b379)
pve-kernel-4.13.8-3-pve: 4.13.8-30
pve-kernel-4.13.13-1-pve: 4.13.13-31
libpve-http-server-perl: 2.0-8
lvm2: 2.02.168-pve6
corosync: 2.4.2-pve3
libqb0: 1.0.1-1
pve-cluster: 5.0-19
qemu-server: 5.0-18
pve-firmware: 2.0-3
libpve-common-perl: 5.0-25
libpve-guest-common-perl: 2.0-14
libpve-access-control: 5.0-7
libpve-storage-perl: 5.0-17
pve-libspice-server1: 0.12.8-3
vncterm: 1.5-3
pve-docs: 5.1-12
pve-qemu-kvm: 2.9.1-5
pve-container: 2.0-18
pve-firewall: 3.0-5
pve-ha-manager: 2.0-4
ksm-control-daemon: 1.2-2
glusterfs-client: 3.8.8-1
lxc-pve: 2.1.1-2
lxcfs: 2.0.8-1
criu: 2.11.1-1~bpo90
novnc-pve: 0.6-4
smartmontools: 6.5+svn4324-1
zfsutils-linux: 0.7.3-pve1~bpo9