D
Deleted member 55253
Guest
Hello,
I have two Dell servers, both identical hardware.
pve1 at 192.168.1.11, pve2 at 192.168.1.12
One keeps crashing every 30 minutes to 6 hours, so no recurring task is causing this as far as I can tell.
syslog and kern.log don't tell me anything useful, though I do see two common items before each crash:
Syslog, just before the crash:
One crash:
An earlier crash:
The two repeating items i see are corosync and items dealing with file-replication_cfg, though I don't know what to make of it or where to begin fixing it.
pveversion reports:
pvecm status:
Any help would be appreciated.
Thanks!
I have two Dell servers, both identical hardware.
pve1 at 192.168.1.11, pve2 at 192.168.1.12
One keeps crashing every 30 minutes to 6 hours, so no recurring task is causing this as far as I can tell.
syslog and kern.log don't tell me anything useful, though I do see two common items before each crash:
Syslog, just before the crash:
One crash:
Code:
Nov 26 09:17:19 pve2 corosync[1793]: notice [TOTEM ] A new membership (192.168.1.12:1520) was formed. Members
Nov 26 09:17:19 pve2 corosync[1793]: warning [CPG ] downlist left_list: 0 received
Nov 26 09:17:19 pve2 corosync[1793]: notice [QUORUM] Members[1]: 2
Nov 26 09:17:19 pve2 corosync[1793]: notice [MAIN ] Completed service synchronization, ready to provide service.
Nov 26 09:17:19 pve2 corosync[1793]: [TOTEM ] A new membership (192.168.1.12:1520) was formed. Members
Nov 26 09:17:19 pve2 corosync[1793]: [CPG ] downlist left_list: 0 received
Nov 26 09:17:19 pve2 corosync[1793]: [QUORUM] Members[1]: 2
Nov 26 09:17:19 pve2 corosync[1793]: [MAIN ] Completed service synchronization, ready to provide service.
An earlier crash:
Code:
Nov 26 09:24:05 pve2 corosync[1813]: notice [TOTEM ] A new membership (192.168.1.12:1620) was formed. Members
Nov 26 09:24:05 pve2 corosync[1813]: [TOTEM ] A new membership (192.168.1.12:1620) was formed. Members
Nov 26 09:24:05 pve2 corosync[1813]: warning [CPG ] downlist left_list: 0 received
Nov 26 09:24:05 pve2 corosync[1813]: notice [QUORUM] Members[1]: 2
Nov 26 09:24:05 pve2 corosync[1813]: notice [MAIN ] Completed service synchronization, ready to provide service.
Nov 26 09:24:05 pve2 corosync[1813]: [CPG ] downlist left_list: 0 received
Nov 26 09:24:05 pve2 corosync[1813]: [QUORUM] Members[1]: 2
Nov 26 09:24:05 pve2 corosync[1813]: [MAIN ] Completed service synchronization, ready to provide service.
Nov 26 09:24:06 pve2 pvesr[6181]: trying to acquire cfs lock 'file-replication_cfg' ...
Nov 26 09:24:06 pve2 corosync[1813]: notice [TOTEM ] A new membership (192.168.1.12:1624) was formed. Members
Nov 26 09:24:06 pve2 corosync[1813]: [TOTEM ] A new membership (192.168.1.12:1624) was formed. Members
Nov 26 09:24:06 pve2 corosync[1813]: warning [CPG ] downlist left_list: 0 received
Nov 26 09:24:06 pve2 corosync[1813]: notice [QUORUM] Members[1]: 2
Nov 26 09:24:06 pve2 corosync[1813]: notice [MAIN ] Completed service synchrNov 26 09:28:17 pve2 systemd-modules-load[453]: Inserted module 'iscsi_tcp'
The two repeating items i see are corosync and items dealing with file-replication_cfg, though I don't know what to make of it or where to begin fixing it.
pveversion reports:
Code:
root@pve2:~# pveversion -v
proxmox-ve: 5.2-2 (running kernel: 4.15.18-8-pve)
pve-manager: 5.2-10 (running version: 5.2-10/6f892b40)
pve-kernel-4.15: 5.2-11
pve-kernel-4.15.18-8-pve: 4.15.18-28
pve-kernel-4.15.17-1-pve: 4.15.17-9
corosync: 2.4.2-pve5
criu: 2.11.1-1~bpo90
glusterfs-client: 3.8.8-1
ksm-control-daemon: 1.2-2
libjs-extjs: 6.0.1-2
libpve-access-control: 5.0-8
libpve-apiclient-perl: 2.0-5
libpve-common-perl: 5.0-41
libpve-guest-common-perl: 2.0-18
libpve-http-server-perl: 2.0-11
libpve-storage-perl: 5.0-30
libqb0: 1.0.1-1
lvm2: 2.02.168-pve6
lxc-pve: 3.0.2+pve1-3
lxcfs: 3.0.2-2
novnc-pve: 1.0.0-2
proxmox-widget-toolkit: 1.0-20
pve-cluster: 5.0-30
pve-container: 2.0-29
pve-docs: 5.2-9
pve-firewall: 3.0-14
pve-firmware: 2.0-6
pve-ha-manager: 2.0-5
pve-i18n: 1.0-6
pve-libspice-server1: 0.14.1-1
pve-qemu-kvm: 2.12.1-1
pve-xtermjs: 1.0-5
qemu-server: 5.0-38
smartmontools: 6.5+svn4324-1
spiceterm: 3.0-5
vncterm: 1.5-3
zfsutils-linux: 0.7.11-pve2~bpo1
pvecm status:
Code:
root@pve2:/etc/pve/ha# pvecm status
Quorum information
------------------
Date: Mon Nov 26 10:12:31 2018
Quorum provider: corosync_votequorum
Nodes: 2
Node ID: 0x00000002
Ring ID: 1/2020
Quorate: Yes
Votequorum information
----------------------
Expected votes: 2
Highest expected: 2
Total votes: 2
Quorum: 2
Flags: Quorate
Membership information
----------------------
Nodeid Votes Name
0x00000001 1 192.168.1.11
0x00000002 1 192.168.1.12 (local)
Any help would be appreciated.
Thanks!