Hi,
Proxmox 7.2, Dell PowerEdge R640, first member of cluster of 5.
From this morning I started to see the errors, listed below, in the syslog file. Manually running the commands generates the following error :
root@pm1:~# systemctl show chrony
Failed to get properties: Transport endpoint is not connected
Backups are failing too to local disk with error :
ERROR: Backup of VM 108 failed - start failed: org.freedesktop.DBus.Error.NoReply: Did not receive a reply. Possible causes include: the remote application did not send a reply, the message bus security policy blocked the reply, the reply timeout expired, or the network connection was broken.
Virtual machines are all running fine, network connectivity seems fine, network mount points are fine, I can write to the mounted filesystems etc and nothing useful in dmesg that I can see.
Any ideas where to start to to fix this other than a reboot (I currently moving all the VM's off the server)
Mar 26 10:44:19 pm1-bt pvedaemon[31780]: command 'systemctl show chrony' failed: exit code 1
Mar 26 10:45:49 pm1-bt pvedaemon[31780]: command 'systemctl show corosync' failed: exit code 1
Mar 26 10:47:01 pm1-bt pmxcfs[687781]: [dcdb] notice: data verification successful
Mar 26 10:47:19 pm1-bt pvedaemon[31780]: command 'systemctl show cron' failed: exit code 1
Mar 26 10:48:49 pm1-bt pvedaemon[31780]: command 'systemctl show ksmtuned' failed: exit code 1
Mar 26 10:50:19 pm1-bt pvedaemon[31780]: command 'systemctl show postfix@-' failed: exit code 1
Mar 26 10:51:49 pm1-bt pvedaemon[31780]: command 'systemctl show pve-cluster' failed: exit code 1
Mar 26 10:53:19 pm1-bt pvedaemon[31780]: command 'systemctl show pve-firewall' failed: exit code 1
Mar 26 10:54:49 pm1-bt pvedaemon[31780]: command 'systemctl show pve-ha-crm' failed: exit code 1
Mar 26 10:56:19 pm1-bt pvedaemon[31780]: command 'systemctl show pve-ha-lrm' failed: exit code 1
Note : This server is version Proxmox 7.2 where all the other nodes are 6.3. Its been working for 245 days without issue, but its a difference that might be the reason for the failure?
Best Regards,
Nigel
Proxmox 7.2, Dell PowerEdge R640, first member of cluster of 5.
From this morning I started to see the errors, listed below, in the syslog file. Manually running the commands generates the following error :
root@pm1:~# systemctl show chrony
Failed to get properties: Transport endpoint is not connected
Backups are failing too to local disk with error :
ERROR: Backup of VM 108 failed - start failed: org.freedesktop.DBus.Error.NoReply: Did not receive a reply. Possible causes include: the remote application did not send a reply, the message bus security policy blocked the reply, the reply timeout expired, or the network connection was broken.
Virtual machines are all running fine, network connectivity seems fine, network mount points are fine, I can write to the mounted filesystems etc and nothing useful in dmesg that I can see.
Any ideas where to start to to fix this other than a reboot (I currently moving all the VM's off the server)
Mar 26 10:44:19 pm1-bt pvedaemon[31780]: command 'systemctl show chrony' failed: exit code 1
Mar 26 10:45:49 pm1-bt pvedaemon[31780]: command 'systemctl show corosync' failed: exit code 1
Mar 26 10:47:01 pm1-bt pmxcfs[687781]: [dcdb] notice: data verification successful
Mar 26 10:47:19 pm1-bt pvedaemon[31780]: command 'systemctl show cron' failed: exit code 1
Mar 26 10:48:49 pm1-bt pvedaemon[31780]: command 'systemctl show ksmtuned' failed: exit code 1
Mar 26 10:50:19 pm1-bt pvedaemon[31780]: command 'systemctl show postfix@-' failed: exit code 1
Mar 26 10:51:49 pm1-bt pvedaemon[31780]: command 'systemctl show pve-cluster' failed: exit code 1
Mar 26 10:53:19 pm1-bt pvedaemon[31780]: command 'systemctl show pve-firewall' failed: exit code 1
Mar 26 10:54:49 pm1-bt pvedaemon[31780]: command 'systemctl show pve-ha-crm' failed: exit code 1
Mar 26 10:56:19 pm1-bt pvedaemon[31780]: command 'systemctl show pve-ha-lrm' failed: exit code 1
Note : This server is version Proxmox 7.2 where all the other nodes are 6.3. Its been working for 245 days without issue, but its a difference that might be the reason for the failure?
Best Regards,
Nigel