I need some help with ceph:
In my home lab I have a few Intel nucs that I run a proxmox cluster ver. 8.2.4 on.
A while ago, on one of the nodes (mox04) the ceph monitor crashed and I was unable to restart it. I did a "destroy monitor" from the GUI. This caused the node to hang and I had to power cycle it.
The node works fine now, except that I'm not able to create a monitor on the node.
The node name or IP address does not appear in the Ceph configuration files (/etc/pve/ceph.conf) or in the output of Ceph commands (ceph -s, ceph mon dump).
I used strace to trace the execution of the 'pveceph mon create' command, uploaded the result to Goggle Gemini and asked it to analyze the result:
The answer was:
The strace output shows that the information claiming the monitor exists comes from the Proxmox cluster file system (pmxcfs). The relevant lines in the strace are:
write(5, "M)\0\0\0{\"prefix\":\"mon metadata\",\"f"..., 46) = 46
read(5, ">3\0\0\0", 5) = 5
read(5, "{\"return_code\":0,\"status_message"..., 51) = 51
This sequence indicates that the pveceph tool is sending a request to pmxcfs to retrieve monitor metadata. The response from pmxcfs includes a status message indicating that the monitor address is already in use.
This suggests that the monitor information is stored in the pmxcfs database, even though it doesn't appear in the Ceph configuration files or the output of Ceph commands.
In my home lab I have a few Intel nucs that I run a proxmox cluster ver. 8.2.4 on.
A while ago, on one of the nodes (mox04) the ceph monitor crashed and I was unable to restart it. I did a "destroy monitor" from the GUI. This caused the node to hang and I had to power cycle it.
The node works fine now, except that I'm not able to create a monitor on the node.
Code:
# ceph mon remove mox04
mon.mox04 does not exist or has already been removed
# pveceph mon create
monitor address '10.0.0.104' already in use
The node name or IP address does not appear in the Ceph configuration files (/etc/pve/ceph.conf) or in the output of Ceph commands (ceph -s, ceph mon dump).
I used strace to trace the execution of the 'pveceph mon create' command, uploaded the result to Goggle Gemini and asked it to analyze the result:
The answer was:
The strace output shows that the information claiming the monitor exists comes from the Proxmox cluster file system (pmxcfs). The relevant lines in the strace are:
write(5, "M)\0\0\0{\"prefix\":\"mon metadata\",\"f"..., 46) = 46
read(5, ">3\0\0\0", 5) = 5
read(5, "{\"return_code\":0,\"status_message"..., 51) = 51
This sequence indicates that the pveceph tool is sending a request to pmxcfs to retrieve monitor metadata. The response from pmxcfs includes a status message indicating that the monitor address is already in use.
This suggests that the monitor information is stored in the pmxcfs database, even though it doesn't appear in the Ceph configuration files or the output of Ceph commands.
Last edited: