Hi there,
I have a 2-node-Proxmox-Cluster using a dedicated QDevice. The nodes are "ernie" and "bert". This morning, for now reason "bert" went offline. As a result, VMs with HA configuration got moved to node "ernie" and started there. However, none of the moved VMs received a IPv4 via DHCP, they were not accessible from my LAN. I had to go to the Console and restart networking.
How can I debug this situation, starting with why bert went down? bert's syslog shows nothing special:
What additional info can I provide?
I have a 2-node-Proxmox-Cluster using a dedicated QDevice. The nodes are "ernie" and "bert". This morning, for now reason "bert" went offline. As a result, VMs with HA configuration got moved to node "ernie" and started there. However, none of the moved VMs received a IPv4 via DHCP, they were not accessible from my LAN. I had to go to the Console and restart networking.
How can I debug this situation, starting with why bert went down? bert's syslog shows nothing special:
Code:
Jan 02 08:00:14 bert sshd[2527962]: Accepted publickey for root from 192.168.30.8 port 37234 ssh2: RSA SHA256:IsUxs2JXFlny/OXUxigsCBx69WQeZb8xX1fdoGjyEQU
Jan 02 08:00:14 bert sshd[2527962]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
Jan 02 08:00:14 bert systemd-logind[797]: New session 6526 of user root.
Jan 02 08:00:14 bert systemd[1]: Started session-6526.scope - Session 6526 of User root.
Jan 02 08:00:14 bert sshd[2527962]: pam_env(sshd:session): deprecated reading of user environment enabled
Jan 02 08:00:15 bert sshd[2527962]: Received disconnect from 192.168.30.8 port 37234:11: disconnected by user
Jan 02 08:00:15 bert sshd[2527962]: Disconnected from user root 192.168.30.8 port 37234
Jan 02 08:00:15 bert sshd[2527962]: pam_unix(sshd:session): session closed for user root
Jan 02 08:00:15 bert systemd-logind[797]: Session 6526 logged out. Waiting for processes to exit.
Jan 02 08:00:15 bert systemd[1]: session-6526.scope: Deactivated successfully.
Jan 02 08:00:15 bert systemd-logind[797]: Removed session 6526.
Jan 02 08:00:25 bert systemd[1]: Stopping user@0.service - User Manager for UID 0...
Jan 02 08:00:25 bert systemd[2527791]: Activating special unit exit.target...
Jan 02 08:00:25 bert systemd[2527791]: Stopped target default.target - Main User Target.
Jan 02 08:00:25 bert systemd[2527791]: Stopped target basic.target - Basic System.
Jan 02 08:00:25 bert systemd[2527791]: Stopped target paths.target - Paths.
Jan 02 08:00:25 bert systemd[2527791]: Stopped target sockets.target - Sockets.
Jan 02 08:00:25 bert systemd[2527791]: Stopped target timers.target - Timers.
Jan 02 08:00:25 bert systemd[2527791]: Closed dirmngr.socket - GnuPG network certificate management daemon.
Jan 02 08:00:25 bert systemd[2527791]: Closed gpg-agent-browser.socket - GnuPG cryptographic agent and passphrase cache (access for web browsers).
Jan 02 08:00:25 bert systemd[2527791]: Closed gpg-agent-extra.socket - GnuPG cryptographic agent and passphrase cache (restricted).
Jan 02 08:00:25 bert systemd[2527791]: Closed gpg-agent-ssh.socket - GnuPG cryptographic agent (ssh-agent emulation).
Jan 02 08:00:25 bert systemd[2527791]: Closed gpg-agent.socket - GnuPG cryptographic agent and passphrase cache.
Jan 02 08:00:25 bert systemd[2527791]: Removed slice app.slice - User Application Slice.
Jan 02 08:00:25 bert systemd[2527791]: Reached target shutdown.target - Shutdown.
Jan 02 08:00:25 bert systemd[2527791]: Finished systemd-exit.service - Exit the Session.
Jan 02 08:00:25 bert systemd[2527791]: Reached target exit.target - Exit the Session.
Jan 02 08:00:25 bert systemd[1]: user@0.service: Deactivated successfully.
Jan 02 08:00:25 bert systemd[1]: Stopped user@0.service - User Manager for UID 0.
Jan 02 08:00:25 bert systemd[1]: Stopping user-runtime-dir@0.service - User Runtime Directory /run/user/0...
Jan 02 08:00:25 bert systemd[1]: run-user-0.mount: Deactivated successfully.
Jan 02 08:00:25 bert systemd[1]: user-runtime-dir@0.service: Deactivated successfully.
Jan 02 08:00:25 bert systemd[1]: Stopped user-runtime-dir@0.service - User Runtime Directory /run/user/0.
Jan 02 08:00:25 bert systemd[1]: Removed slice user-0.slice - User Slice of UID 0.
Jan 02 08:00:25 bert systemd[1]: user-0.slice: Consumed 4.245s CPU time.
Jan 02 08:17:01 bert CRON[2534266]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)
Jan 02 08:17:01 bert CRON[2534267]: (root) CMD (cd / && run-parts --report /etc/cron.hourly)
Jan 02 08:17:01 bert CRON[2534266]: pam_unix(cron:session): session closed for user root
Jan 02 08:20:50 bert pmxcfs[1072]: [dcdb] notice: data verification successful
-- Reboot --
What additional info can I provide?