Hi, everyone;
I have a 8 nodes cluster running for a while, and I have ceph on the same cluster. Yesterday, I tried to install and enable ceph dashboard, then caused my ceph manager service failed to restart and all manager service stopped and can not start, here is what I did step by step, and also I attached veversion -v" and "ceph.conf" file:
1. apt install ceph-mgr-dashboard (on all service manager nodes, I have 3 manager services, installed on all 3 of them) = ok
2. then, on #2 node(has manager service), i did: ceph mgr module enable dashboard --force. (I have to use --force, otherwise, will no enable)
3. on #2 node, ceph config-key set mgr/dashboard/server_addr ::
4. on #2 node, ceph mgr module disable dashboard
5. on #2 node, ceph mgr module enable dashboard
6. on #2 node, systemctl restart ceph-mgr@[servername].service. Now, here is the my problem, manager service failed to restart, and all rest of manager service shows stopped, and I can not restart, give me error message and exit code 1.
7. I have to destory all ceph managers, then re-create them in proxmox gui. Ceph running ok at all time.
I was not sure what did I do wrong, Any help would be much appreciated.
I have a 8 nodes cluster running for a while, and I have ceph on the same cluster. Yesterday, I tried to install and enable ceph dashboard, then caused my ceph manager service failed to restart and all manager service stopped and can not start, here is what I did step by step, and also I attached veversion -v" and "ceph.conf" file:
1. apt install ceph-mgr-dashboard (on all service manager nodes, I have 3 manager services, installed on all 3 of them) = ok
2. then, on #2 node(has manager service), i did: ceph mgr module enable dashboard --force. (I have to use --force, otherwise, will no enable)
3. on #2 node, ceph config-key set mgr/dashboard/server_addr ::
4. on #2 node, ceph mgr module disable dashboard
5. on #2 node, ceph mgr module enable dashboard
6. on #2 node, systemctl restart ceph-mgr@[servername].service. Now, here is the my problem, manager service failed to restart, and all rest of manager service shows stopped, and I can not restart, give me error message and exit code 1.
7. I have to destory all ceph managers, then re-create them in proxmox gui. Ceph running ok at all time.
I was not sure what did I do wrong, Any help would be much appreciated.