Hi! So, I started to install and configure Ceph before I fully knew how I wanted to have it configured (mistake #1).
Now, after following https://forum.proxmox.com/threads/changing-ceph-public-network.33083/ I've made some changes, but they're now really incorrect to the point where "monmaptool --print tmpfile returns "couldn't open tmpfile: (2) No such file or directory".. Attempting to edit in the WebUI returns "(500) got timeout" and starting the monitor shows success, but on webpage reload, shows quorum, no.
I have the IP addresses I now want and that config will not change, but how do I get this back up and running? I have nothing in the pools yet, as this is the first I'm I finally have the separate subnet working...
/etc/pve/ceph.conf and /etc/ceph/ceph.conf show:
Edit: I just did the most recent Proxmox update to all three nodes, and "ceph mon getmap -o tmpfile" now hangs for a while again and eventually returns "error (110) connection timed out. [errno 110] error connecting to the cluster
Now, after following https://forum.proxmox.com/threads/changing-ceph-public-network.33083/ I've made some changes, but they're now really incorrect to the point where "monmaptool --print tmpfile returns "couldn't open tmpfile: (2) No such file or directory".. Attempting to edit in the WebUI returns "(500) got timeout" and starting the monitor shows success, but on webpage reload, shows quorum, no.
I have the IP addresses I now want and that config will not change, but how do I get this back up and running? I have nothing in the pools yet, as this is the first I'm I finally have the separate subnet working...
Code:
root@PVE-1:~# monmaptool --print tmpfile
monmaptool: monmap file tmpfile
epoch 9
fsid 7770d4e7-3305-4ca8-b780-508825023a70
last_changed 2019-05-07 21:52:22.105900
created 2019-04-24 22:05:40.902816
0: 10.9.220.1:6789/0 mon.PVE-1
1: 10.9.220.2:6789/0 mon.PVE-2
2: 10.9.220.49:6789/0 mon.PVE-Witness
3: 172.16.0.1:6789/0 mon.0
4: 172.16.0.2:6789/0 mon.1
5: 172.16.0.254:6789/0 mon.2
/etc/pve/ceph.conf and /etc/ceph/ceph.conf show:
Code:
[global]
auth client required = cephx
auth cluster required = cephx
auth service required = cephx
cluster network = 172.16.0.0/24
fsid = 7770d4e7-3305-4ca8-b780-508825023a70
keyring = /etc/pve/priv/$cluster.$name.keyring
mon allow pool delete = true
osd journal size = 5120
osd pool default min size = 2
osd pool default size = 2
public network = 172.16.0.0/24
[osd]
keyring = /var/lib/ceph/osd/ceph-$id/keyring
[mon.PVE-Witness]
host = PVE-Witness
mon addr = 172.16.0.254:6789
[mon.PVE-1]
host = PVE-1
mon addr = 172.16.0.1:6789
Edit: I just did the most recent Proxmox update to all three nodes, and "ceph mon getmap -o tmpfile" now hangs for a while again and eventually returns "error (110) connection timed out. [errno 110] error connecting to the cluster
Last edited: