Since update on one of my nodes it has become a bit unstable.. well 3 of my nodes - but they have different issues. Some dealing with network problems...
For node 5 - I have ceph monitor that is showing as "undefined" in the list of monitors on any node I connect to the gui with.
Any ideas where to start?
I did restart each node... reboot.. but it still acts like the old version installed... any command line interface commands I can run to get some more intel on what is going on?
For node 5 - I have ceph monitor that is showing as "undefined" in the list of monitors on any node I connect to the gui with.
root@node5:~# ceph mon stats
no valid command found; 10 closest matches:
mon ok-to-stop <ids>...
mon ok-to-add-offline
mon ok-to-rm <id>
mon scrub
mon metadata [<id>]
mon count-metadata <property>
mon versions
mon dump [<epoch:int>]
mon stat
mon getmap [<epoch:int>]
Error EINVAL: invalid command
root@node5:~# ceph osd status
ID HOST USED AVAIL WR OPS WR DATA RD OPS RD DATA STATE
0 stack1 4505M 927G 0 0 0 0 exists,up
1 node2 81.9G 849G 0 0 0 0 exists,up
2 node3 83.6G 847G 0 2457 0 0 exists,up
3 node4 85.4G 846G 0 0 0 0 exists,up
4 0 0 0 0 0 0 autoout,exists
5 0 0 0 0 0 0 exists
6 0 0 0 0 0 0 exists
root@node5:~# ceph osd crush tree --show-shadow
ID CLASS WEIGHT TYPE NAME
-2 hdd 6.36789 root default~hdd
-6 hdd 0.90970 host node2~hdd
1 hdd 0.90970 osd.1
-8 hdd 0.90970 host node3~hdd
2 hdd 0.90970 osd.2
-10 hdd 0.90970 host node4~hdd
3 hdd 0.90970 osd.3
-12 hdd 0.90970 host node5~hdd
4 hdd 0.90970 osd.4
-16 hdd 0.90970 host node6~hdd
6 hdd 0.90970 osd.6
-14 hdd 0.90970 host node8~hdd
5 hdd 0.90970 osd.5
-4 hdd 0.90970 host stack1~hdd
0 hdd 0.90970 osd.0
-1 6.36789 root default
-5 0.90970 host node2
1 hdd 0.90970 osd.1
-7 0.90970 host node3
2 hdd 0.90970 osd.2
-9 0.90970 host node4
3 hdd 0.90970 osd.3
-11 0.90970 host node5
4 hdd 0.90970 osd.4
-15 0.90970 host node6
6 hdd 0.90970 osd.6
-13 0.90970 host node8
5 hdd 0.90970 osd.5
-3 0.90970 host stack1
0 hdd 0.90970 osd.0
root@node5:~# ip a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: enp2s0f0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master vmbr0 state UP group default qlen 1000
link/ether 00:0e:b6:5c:a3:e8 brd ff:ff:ff:ff:ff:ff
3: enp2s0f1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 00:0e:b6:5c:a3:e9 brd ff:ff:ff:ff:ff:ff
4: eno1: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 00:0e:b6:a8:54:b3 brd ff:ff:ff:ff:ff:ff
5: eno2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 00:0e:b6:a8:54:b4 brd ff:ff:ff:ff:ff:ff
6: enp1s0f2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 00:0e:b6:a8:54:b5 brd ff:ff:ff:ff:ff:ff
7: enp1s0f3: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 00:0e:b6:a8:54:b6 brd ff:ff:ff:ff:ff:ff
8: vmbr0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
link/ether 00:0e:b6:5c:a3:e8 brd ff:ff:ff:ff:ff:ff
inet 10.0.1.5/16 scope global vmbr0
valid_lft forever preferred_lft forever
inet6 fe80::20e:b6ff:fe5c:a3e8/64 scope link
valid_lft forever preferred_lft forever
root@node5:~#
Any ideas where to start?
I did restart each node... reboot.. but it still acts like the old version installed... any command line interface commands I can run to get some more intel on what is going on?
Last edited: