I have an 8 node PVE 8.0.4 cluster made up of 2 Dell R610 and 6 Dell R710 servers. The most recent addition were 3 R710's which were added some month after the first 5 server were clustered. The VM ISO images I am using are Ubuntu 22.04.3-live-server-amd64.iso images that I have used to create VM's with the other nodes without issue. However on the most recently added three nodes, prox-6, prox-7, and prox-8 which are all functional nodes and linked in the 8 node quorate cluster, I am seeing some VM's that are unable to connect their instances to the noVNC server, and when the attempt is made the console returns a display of a red strip at the top of the screen with "X Failed to connect to server"
Why is this happening, and of the three newly added nodes, one is able to console display its VM and the other two cannot.
There are differences in the syslogs as well on these nodes.
prox-7's VM117 reaches the shell noVNC with this syslog data
Nov 20 12:52:10 prox-7 systemd[1]: user@0.service: Deactivated successfully.
Nov 20 12:52:10 prox-7 systemd[1]: Stopped user@0.service - User Manager for UID 0.
Nov 20 12:52:10 prox-7 systemd[1]: Stopping user-runtime-dir@0.service - User Runtime Directory /run/user/0...
Nov 20 12:52:10 prox-7 systemd[1]: run-user-0.mount: Deactivated successfully.
Nov 20 12:52:10 prox-7 systemd[1]: user-runtime-dir@0.service: Deactivated successfully.
Nov 20 12:52:10 prox-7 systemd[1]: Stopped user-runtime-dir@0.service - User Runtime Directory /run/user/0.
Nov 20 12:52:10 prox-7 systemd[1]: Removed slice user-0.slice - User Slice of UID 0.
Nov 20 12:52:10 prox-7 systemd[1]: user-0.slice: Consumed 1.153s CPU time.
The other two display this info in their syslogs
Nov 20 12:53:34 prox-6 pmxcfs[1123]: [status] notice: received log
Nov 20 12:53:34 prox-6 sshd[136491]: Connection closed by 192.168.0.112 port 34026 [preauth]
Nov 20 12:53:34 prox-6 pmxcfs[1123]: [status] notice: received log
All other nodes report like this on a successful link of a VM to a shell noVNC display
Nov 20 13:02:38 prox-5 systemd[1]: user@0.service: Deactivated successfully.
Nov 20 13:02:38 prox-5 systemd[1]: Stopped user@0.service - User Manager for UID 0.
Nov 20 13:02:39 prox-5 systemd[1]: Stopping user-runtime-dir@0.service - User Runtime Directory /run/user/0...
Nov 20 13:02:39 prox-5 systemd[1]: run-user-0.mount: Deactivated successfully.
Nov 20 13:02:39 prox-5 systemd[1]: user-runtime-dir@0.service: Deactivated successfully.
Nov 20 13:02:39 prox-5 systemd[1]: Stopped user-runtime-dir@0.service - User Runtime Directory /run/user/0.
Nov 20 13:02:39 prox-5 systemd[1]: Removed slice user-0.slice - User Slice of UID 0.
Nov 20 13:02:39 prox-5 systemd[1]: user-0.slice: Consumed 1.249s CPU time.
What is the issue that might be causing these failures on two of the three new nodes?
Why is this happening, and of the three newly added nodes, one is able to console display its VM and the other two cannot.
There are differences in the syslogs as well on these nodes.
prox-7's VM117 reaches the shell noVNC with this syslog data
Nov 20 12:52:10 prox-7 systemd[1]: user@0.service: Deactivated successfully.
Nov 20 12:52:10 prox-7 systemd[1]: Stopped user@0.service - User Manager for UID 0.
Nov 20 12:52:10 prox-7 systemd[1]: Stopping user-runtime-dir@0.service - User Runtime Directory /run/user/0...
Nov 20 12:52:10 prox-7 systemd[1]: run-user-0.mount: Deactivated successfully.
Nov 20 12:52:10 prox-7 systemd[1]: user-runtime-dir@0.service: Deactivated successfully.
Nov 20 12:52:10 prox-7 systemd[1]: Stopped user-runtime-dir@0.service - User Runtime Directory /run/user/0.
Nov 20 12:52:10 prox-7 systemd[1]: Removed slice user-0.slice - User Slice of UID 0.
Nov 20 12:52:10 prox-7 systemd[1]: user-0.slice: Consumed 1.153s CPU time.
The other two display this info in their syslogs
Nov 20 12:53:34 prox-6 pmxcfs[1123]: [status] notice: received log
Nov 20 12:53:34 prox-6 sshd[136491]: Connection closed by 192.168.0.112 port 34026 [preauth]
Nov 20 12:53:34 prox-6 pmxcfs[1123]: [status] notice: received log
All other nodes report like this on a successful link of a VM to a shell noVNC display
Nov 20 13:02:38 prox-5 systemd[1]: user@0.service: Deactivated successfully.
Nov 20 13:02:38 prox-5 systemd[1]: Stopped user@0.service - User Manager for UID 0.
Nov 20 13:02:39 prox-5 systemd[1]: Stopping user-runtime-dir@0.service - User Runtime Directory /run/user/0...
Nov 20 13:02:39 prox-5 systemd[1]: run-user-0.mount: Deactivated successfully.
Nov 20 13:02:39 prox-5 systemd[1]: user-runtime-dir@0.service: Deactivated successfully.
Nov 20 13:02:39 prox-5 systemd[1]: Stopped user-runtime-dir@0.service - User Runtime Directory /run/user/0.
Nov 20 13:02:39 prox-5 systemd[1]: Removed slice user-0.slice - User Slice of UID 0.
Nov 20 13:02:39 prox-5 systemd[1]: user-0.slice: Consumed 1.249s CPU time.
What is the issue that might be causing these failures on two of the three new nodes?