This started after a power failure...
I have 3 servers, one is NOT doing this and is alive, the other two cannot be started.
The OSDs appear to be started and running, but the monitor fails to start.
SO... ceph cluster is down until I figure out how to fix this:
I have verified that /etc/ceph/ceph.conf AND /etc/pve/ceph.conf are identical on all 3 servers.
pve1 runs and is probing
pve2 throws this and won't start
pve3 throws this and wont't start
here is pveversion -v:
I have 3 servers, one is NOT doing this and is alive, the other two cannot be started.
The OSDs appear to be started and running, but the monitor fails to start.
SO... ceph cluster is down until I figure out how to fix this:
Code:
Dec 28 15:11:50 pve2 systemd[1]: ceph-mon@pve2.service: Failed with result 'signal'.
Dec 28 15:11:50 pve2 systemd[1]: ceph-mon@pve2.service: Main process exited, code=killed, status=6/ABRT
Dec 28 15:11:50 pve2 ceph-mon[856941]: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Dec 28 15:11:50 pve2 ceph-mon[856941]: 17: _start()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 16: __libc_start_main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 15: /lib/x86_64-linux-gnu/libc.so.6(+0x271ca) [0x7f7feac461ca]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 14: main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 13: (Monitor::preinit()+0x97a) [0x563038d1697a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 12: (Monitor::refresh_from_paxos(bool*)+0x163) [0x563038ce8a23]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 11: (LogMonitor::update_from_paxos(bool*)+0x53) [0x563038d734a3]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 10: (LogMonitor::log_external_backlog()+0xe29) [0x563038d70849]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 9: (std::__throw_invalid_argument(char const*)+0x40) [0x7f7feaaa0192]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 8: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa90d8) [0x7f7feaaa90d8]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 7: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e85) [0x7f7feaaa8e85]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 6: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e1a) [0x7f7feaaa8e1a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 5: /lib/x86_64-linux-gnu/libstdc++.so.6(+0x9d919) [0x7f7feaa9d919]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 4: abort()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 3: gsignal()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 2: /lib/x86_64-linux-gnu/libc.so.6(+0x8ad3c) [0x7f7feaca9d3c]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 1: /lib/x86_64-linux-gnu/libc.so.6(+0x3bfd0) [0x7f7feac5afd0]
Dec 28 15:11:50 pve2 ceph-mon[856941]: ceph version 17.2.7 (e303afc2e967a4705b40a7e5f76067c10eea0484) quincy (stable)
Dec 28 15:11:50 pve2 ceph-mon[856941]: in thread 7f7feb02ca00 thread_name:ceph-mon
Dec 28 15:11:50 pve2 ceph-mon[856941]: 0> 2023-12-28T15:11:50.992-0700 7f7feb02ca00 -1 *** Caught signal (Aborted) **
Dec 28 15:11:50 pve2 ceph-mon[856941]: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Dec 28 15:11:50 pve2 ceph-mon[856941]: 17: _start()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 16: __libc_start_main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 15: /lib/x86_64-linux-gnu/libc.so.6(+0x271ca) [0x7f7feac461ca]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 14: main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 13: (Monitor::preinit()+0x97a) [0x563038d1697a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 12: (Monitor::refresh_from_paxos(bool*)+0x163) [0x563038ce8a23]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 11: (LogMonitor::update_from_paxos(bool*)+0x53) [0x563038d734a3]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 10: (LogMonitor::log_external_backlog()+0xe29) [0x563038d70849]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 9: (std::__throw_invalid_argument(char const*)+0x40) [0x7f7feaaa0192]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 8: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa90d8) [0x7f7feaaa90d8]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 7: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e85) [0x7f7feaaa8e85]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 6: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e1a) [0x7f7feaaa8e1a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 5: /lib/x86_64-linux-gnu/libstdc++.so.6(+0x9d919) [0x7f7feaa9d919]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 4: abort()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 3: gsignal()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 2: /lib/x86_64-linux-gnu/libc.so.6(+0x8ad3c) [0x7f7feaca9d3c]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 1: /lib/x86_64-linux-gnu/libc.so.6(+0x3bfd0) [0x7f7feac5afd0]
Dec 28 15:11:50 pve2 ceph-mon[856941]: ceph version 17.2.7 (e303afc2e967a4705b40a7e5f76067c10eea0484) quincy (stable)
Dec 28 15:11:50 pve2 ceph-mon[856941]: in thread 7f7feb02ca00 thread_name:ceph-mon
Dec 28 15:11:50 pve2 ceph-mon[856941]: 0> 2023-12-28T15:11:50.992-0700 7f7feb02ca00 -1 *** Caught signal (Aborted) **
Dec 28 15:11:50 pve2 ceph-mon[856941]: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Dec 28 15:11:50 pve2 ceph-mon[856941]: 17: _start()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 16: __libc_start_main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 15: /lib/x86_64-linux-gnu/libc.so.6(+0x271ca) [0x7f7feac461ca]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 14: main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 13: (Monitor::preinit()+0x97a) [0x563038d1697a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 12: (Monitor::refresh_from_paxos(bool*)+0x163) [0x563038ce8a23]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 11: (LogMonitor::update_from_paxos(bool*)+0x53) [0x563038d734a3]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 10: (LogMonitor::log_external_backlog()+0xe29) [0x563038d70849]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 9: (std::__throw_invalid_argument(char const*)+0x40) [0x7f7feaaa0192]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 8: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa90d8) [0x7f7feaaa90d8]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 7: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e85) [0x7f7feaaa8e85]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 6: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e1a) [0x7f7feaaa8e1a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 5: /lib/x86_64-linux-gnu/libstdc++.so.6(+0x9d919) [0x7f7feaa9d919]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 4: abort()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 3: gsignal()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 2: /lib/x86_64-linux-gnu/libc.so.6(+0x8ad3c) [0x7f7feaca9d3c]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 1: /lib/x86_64-linux-gnu/libc.so.6(+0x3bfd0) [0x7f7feac5afd0]
Dec 28 15:11:50 pve2 ceph-mon[856941]: ceph version 17.2.7 (e303afc2e967a4705b40a7e5f76067c10eea0484) quincy (stable)
Dec 28 15:11:50 pve2 ceph-mon[856941]: in thread 7f7feb02ca00 thread_name:ceph-mon
Dec 28 15:11:50 pve2 ceph-mon[856941]: 0> 2023-12-28T15:11:50.992-0700 7f7feb02ca00 -1 *** Caught signal (Aborted) **
Dec 28 15:11:50 pve2 ceph-mon[856941]: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Dec 28 15:11:50 pve2 ceph-mon[856941]: 17: _start()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 16: __libc_start_main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 15: /lib/x86_64-linux-gnu/libc.so.6(+0x271ca) [0x7f7feac461ca]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 14: main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 13: (Monitor::preinit()+0x97a) [0x563038d1697a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 12: (Monitor::refresh_from_paxos(bool*)+0x163) [0x563038ce8a23]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 11: (LogMonitor::update_from_paxos(bool*)+0x53) [0x563038d734a3]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 10: (LogMonitor::log_external_backlog()+0xe29) [0x563038d70849]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 9: (std::__throw_invalid_argument(char const*)+0x40) [0x7f7feaaa0192]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 8: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa90d8) [0x7f7feaaa90d8]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 7: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e85) [0x7f7feaaa8e85]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 6: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e1a) [0x7f7feaaa8e1a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 5: /lib/x86_64-linux-gnu/libstdc++.so.6(+0x9d919) [0x7f7feaa9d919]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 4: abort()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 3: gsignal()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 2: /lib/x86_64-linux-gnu/libc.so.6(+0x8ad3c) [0x7f7feaca9d3c]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 1: /lib/x86_64-linux-gnu/libc.so.6(+0x3bfd0) [0x7f7feac5afd0]
Dec 28 15:11:50 pve2 ceph-mon[856941]: ceph version 17.2.7 (e303afc2e967a4705b40a7e5f76067c10eea0484) quincy (stable)
Dec 28 15:11:50 pve2 ceph-mon[856941]: in thread 7f7feb02ca00 thread_name:ceph-mon
Dec 28 15:11:50 pve2 ceph-mon[856941]: 2023-12-28T15:11:50.992-0700 7f7feb02ca00 -1 *** Caught signal (Aborted) **
Dec 28 15:11:50 pve2 ceph-mon[856941]: 17: _start()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 16: __libc_start_main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 15: /lib/x86_64-linux-gnu/libc.so.6(+0x271ca) [0x7f7feac461ca]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 14: main()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 13: (Monitor::preinit()+0x97a) [0x563038d1697a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 12: (Monitor::refresh_from_paxos(bool*)+0x163) [0x563038ce8a23]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 11: (LogMonitor::update_from_paxos(bool*)+0x53) [0x563038d734a3]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 10: (LogMonitor::log_external_backlog()+0xe29) [0x563038d70849]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 9: (std::__throw_invalid_argument(char const*)+0x40) [0x7f7feaaa0192]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 8: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa90d8) [0x7f7feaaa90d8]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 7: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e85) [0x7f7feaaa8e85]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 6: /lib/x86_64-linux-gnu/libstdc++.so.6(+0xa8e1a) [0x7f7feaaa8e1a]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 5: /lib/x86_64-linux-gnu/libstdc++.so.6(+0x9d919) [0x7f7feaa9d919]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 4: abort()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 3: gsignal()
Dec 28 15:11:50 pve2 ceph-mon[856941]: 2: /lib/x86_64-linux-gnu/libc.so.6(+0x8ad3c) [0x7f7feaca9d3c]
Dec 28 15:11:50 pve2 ceph-mon[856941]: 1: /lib/x86_64-linux-gnu/libc.so.6(+0x3bfd0) [0x7f7feac5afd0]
Dec 28 15:11:50 pve2 ceph-mon[856941]: ceph version 17.2.7 (e303afc2e967a4705b40a7e5f76067c10eea0484) quincy (stable)
Dec 28 15:11:50 pve2 ceph-mon[856941]: in thread 7f7feb02ca00 thread_name:ceph-mon
Dec 28 15:11:50 pve2 ceph-mon[856941]: *** Caught signal (Aborted) **
Dec 28 15:11:50 pve2 ceph-mon[856941]: what(): stoull
Dec 28 15:11:50 pve2 ceph-mon[856941]: terminate called after throwing an instance of 'std::invalid_argument'
Dec 28 15:11:50 pve2 systemd[1]: Started ceph-mon@pve2.service - Ceph cluster monitor daemon.
Dec 28 15:11:50 pve2 systemd[1]: Stopped ceph-mon@pve2.service - Ceph cluster monitor daemon.
Dec 28 15:11:50 pve2 systemd[1]: ceph-mon@pve2.service: Scheduled restart job, restart counter is at 3.
I have verified that /etc/ceph/ceph.conf AND /etc/pve/ceph.conf are identical on all 3 servers.
pve1 runs and is probing
pve2 throws this and won't start
pve3 throws this and wont't start
here is pveversion -v:
Code:
root@pve2:~# pveversion -v
proxmox-ve: 8.1.0 (running kernel: 6.5.11-7-pve)
pve-manager: 8.1.3 (running version: 8.1.3/b46aac3b42da5d15)
proxmox-kernel-helper: 8.1.0
pve-kernel-5.15: 7.4-4
proxmox-kernel-6.5: 6.5.11-7
proxmox-kernel-6.5.11-7-pve-signed: 6.5.11-7
proxmox-kernel-6.2.16-20-pve: 6.2.16-20
proxmox-kernel-6.2: 6.2.16-20
proxmox-kernel-6.2.16-15-pve: 6.2.16-15
pve-kernel-5.15.108-1-pve: 5.15.108-2
pve-kernel-5.15.30-2-pve: 5.15.30-3
ceph: 17.2.7-pve1
ceph-fuse: 17.2.7-pve1
corosync: 3.1.7-pve3
criu: 3.17.1-2
glusterfs-client: 10.3-5
ifupdown2: 3.2.0-1+pmx7
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-4
libknet1: 1.28-pve1
libproxmox-acme-perl: 1.5.0
libproxmox-backup-qemu0: 1.4.1
libproxmox-rs-perl: 0.3.3
libpve-access-control: 8.0.7
libpve-apiclient-perl: 3.3.1
libpve-common-perl: 8.1.0
libpve-guest-common-perl: 5.0.6
libpve-http-server-perl: 5.0.5
libpve-network-perl: 0.9.5
libpve-rs-perl: 0.8.7
libpve-storage-perl: 8.0.5
libspice-server1: 0.15.1-1
lvm2: 2.03.16-2
lxc-pve: 5.0.2-4
lxcfs: 5.0.3-pve4
novnc-pve: 1.4.0-3
proxmox-backup-client: 3.1.2-1
proxmox-backup-file-restore: 3.1.2-1
proxmox-kernel-helper: 8.1.0
proxmox-mail-forward: 0.2.2
proxmox-mini-journalreader: 1.4.0
proxmox-offline-mirror-helper: 0.6.3
proxmox-widget-toolkit: 4.1.3
pve-cluster: 8.0.5
pve-container: 5.0.8
pve-docs: 8.1.3
pve-edk2-firmware: 4.2023.08-2
pve-firewall: 5.0.3
pve-firmware: 3.9-1
pve-ha-manager: 4.0.3
pve-i18n: 3.1.5
pve-qemu-kvm: 8.1.2-6
pve-xtermjs: 5.3.0-3
qemu-server: 8.0.10
smartmontools: 7.3-pve1
spiceterm: 3.3.0
swtpm: 0.8.0+pve1
vncterm: 1.8.0
zfsutils-linux: 2.2.2-pve1
root@pve2:~#
INI:
root@pve1:~# cat /etc/ceph/ceph.conf
[global]
auth_client_required = cephx
auth_cluster_required = cephx
auth_service_required = cephx
cluster_network = 10.10.10.0/24
fsid = b3445d50-80e3-405e-b3cd-a5b7251876e2
mon_allow_pool_delete = true
mon_host = 10.10.10.2 10.10.10.3 10.10.10.4
ms_bind_ipv4 = true
ms_bind_ipv6 = false
osd_pool_default_min_size = 2
osd_pool_default_size = 3
public_network = 10.10.10.0/24
[client]
keyring = /etc/pve/priv/$cluster.$name.keyring
rbd_cache_size = 134217728
[mds]
keyring = /var/lib/ceph/mds/ceph-$id/keyring
[mds.pve1]
host = pve1
mds_standby_for_name = pve
[mds.pve2]
host = pve2
mds_standby_for_name = pve
[mds.pve3]
host = pve3
mds_standby_for_name = pve
[mon.pve1]
host = pve1
public_addr = 10.10.10.2
[mon.pve2]
host = pve2
public_addr = 10.10.10.3
[mon.pve3]
host = pve3
public_addr = 10.10.10.4
Last edited: