one node in cluster reboot randomly

smartcoder

New Member
Oct 30, 2024
3
0
1
Hello

I am contacting you because I currently have a proxmox cluster in production composed of 4 servers.

One of the servers restarts randomly after a few weeks.

I don't know why?

My version is 8.2.
 
Nov 29 03:18:06 pve4 systemd-logind[3591]: Removed session 259444.
Nov 29 03:18:13 pve4 pvestatd[4337]: got timeout
Nov 29 03:18:14 pve4 pvestatd[4337]: status update time (5.822 seconds)
Nov 29 03:18:23 pve4 pvestatd[4337]: got timeout
Nov 29 03:18:24 pve4 pvestatd[4337]: status update time (5.859 seconds)
Nov 29 03:18:33 pve4 pvestatd[4337]: got timeout
Nov 29 03:18:34 pve4 pvestatd[4337]: status update time (5.824 seconds)
Nov 29 03:18:43 pve4 pvestatd[4337]: got timeout
Nov 29 03:18:44 pve4 pvestatd[4337]: status update time (5.894 seconds)
Nov 29 03:18:53 pve4 pvestatd[4337]: got timeout
Nov 29 03:18:54 pve4 pvestatd[4337]: status update time (5.834 seconds)
Nov 29 03:19:03 pve4 pvestatd[4337]: got timeout
Nov 29 03:19:03 pve4 pvestatd[4337]: status update time (5.835 seconds)
Nov 29 03:19:06 pve4 sshd[3634915]: Accepted password for root from 172.16.111.106 port 49335 ssh2
Nov 29 03:19:06 pve4 sshd[3634915]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
Nov 29 03:19:06 pve4 systemd-logind[3591]: New session 259445 of user root.
Nov 29 03:19:06 pve4 systemd[1]: Started session-259445.scope - Session 259445 of User root.
Nov 29 03:19:06 pve4 sshd[3634915]: pam_env(sshd:session): deprecated reading of user environment enabled
Nov 29 03:19:06 pve4 sshd[3634915]: Received disconnect from 172.16.111.106 port 49335:11: Connection terminated by the client.
Nov 29 03:19:06 pve4 sshd[3634915]: Disconnected from user root 172.16.111.106 port 49335
Nov 29 03:19:06 pve4 sshd[3634915]: pam_unix(sshd:session): session closed for user root
Nov 29 03:19:06 pve4 systemd[1]: session-259445.scope: Deactivated successfully.
Nov 29 03:19:06 pve4 systemd-logind[3591]: Session 259445 logged out. Waiting for processes to exit.
Nov 29 03:19:06 pve4 systemd-logind[3591]: Removed session 259445.
Nov 29 03:19:14 pve4 pvestatd[4337]: got timeout
Nov 29 03:19:14 pve4 pvestatd[4337]: status update time (5.820 seconds)
Nov 29 03:19:22 pve4 pmxcfs[4152]: [status] notice: received log
Nov 29 03:19:22 pve4 pvedaemon[3410916]: got timeout
Nov 29 03:19:24 pve4 pvestatd[4337]: got timeout
Nov 29 03:19:24 pve4 pvestatd[4337]: status update time (5.807 seconds)
Nov 29 03:19:26 pve4 pmxcfs[4152]: [status] notice: received log
Nov 29 03:19:34 pve4 pvestatd[4337]: got timeout
Nov 29 03:19:34 pve4 pvestatd[4337]: status update time (5.845 seconds)
Nov 29 03:19:43 pve4 pvestatd[4337]: got timeout
Nov 29 03:19:44 pve4 pvestatd[4337]: status update time (5.827 seconds)
Nov 29 03:19:53 pve4 pvestatd[4337]: got timeout
Nov 29 03:19:54 pve4 pvestatd[4337]: status update time (5.879 seconds)
Nov 29 03:20:03 pve4 pvestatd[4337]: got timeout
Nov 29 03:20:04 pve4 pvestatd[4337]: status update time (5.920 seconds)
Nov 29 03:20:06 pve4 sshd[3635635]: Accepted password for root from 172.16.111.106 port 49524 ssh2
Nov 29 03:20:06 pve4 sshd[3635635]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0)
Nov 29 03:20:06 pve4 systemd-logind[3591]: New session 259446 of user root.
Nov 29 03:20:06 pve4 systemd[1]: Started session-259446.scope - Session 259446 of User root.
Nov 29 03:20:06 pve4 sshd[3635635]: pam_env(sshd:session): deprecated reading of user environment enabled
Nov 29 03:20:06 pve4 sshd[3635635]: Received disconnect from 172.16.111.106 port 49524:11: Connection terminated by the client.
Nov 29 03:20:06 pve4 sshd[3635635]: Disconnected from user root 172.16.111.106 port 49524
Nov 29 03:20:06 pve4 sshd[3635635]: pam_unix(sshd:session): session closed for user root
Nov 29 03:20:06 pve4 systemd[1]: session-259446.scope: Deactivated successfully.
Nov 29 03:20:06 pve4 systemd-logind[3591]: Session 259446 logged out. Waiting for processes to exit.
Nov 29 03:20:06 pve4 systemd-logind[3591]: Removed session 259446.
Nov 29 03:20:13 pve4 pvestatd[4337]: got timeout
Nov 29 03:20:13 pve4 pvestatd[4337]: status update time (5.851 seconds)
-- Reboot --
Nov 29 03:24:03 pve4 kernel: Linux version 6.8.12-9-pve (build@proxmox) (gcc (Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP PREEMPT_DYNAMIC PMX 6.8.12-9 (2025-03-16T19:18Z) ()
Nov 29 03:24:03 pve4 kernel: Command line: initrd=\EFI\proxmox\6.8.12-9-pve\initrd.img-6.8.12-9-pve root=ZFS=rpool/ROOT/pve-1 boot=zfs
Nov 29 03:24:03 pve4 kernel: KERNEL supported cpus:
Nov 29 03:24:03 pve4 kernel: Intel GenuineIntel
Nov 29 03:24:03 pve4 kernel: AMD AuthenticAMD
Nov 29 03:24:03 pve4 kernel: Hygon HygonGenuine
Nov 29 03:24:03 pve4 kernel: Centaur CentaurHauls
Nov 29 03:24:03 pve4 kernel: zhaoxin Shanghai
Nov 29 03:24:03 pve4 kernel: BIOS-provided physical RAM map:
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x0000000000000000-0x000000000009ffff] usable
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x00000000000a0000-0x00000000000fffff] reserved
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x0000000000100000-0x00000000682fefff] usable
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x00000000682ff000-0x000000006ebfefff] reserved
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x000000006ebff000-0x000000006f9fefff] ACPI NVS
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x000000006f9ff000-0x000000006fffefff] ACPI data
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x000000006ffff000-0x000000006fffffff] usable
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x0000000070000000-0x000000008fffffff] reserved
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x00000000fe000000-0x00000000fe010fff] reserved
Nov 29 03:24:03 pve4 kernel: BIOS-e820: [mem 0x0000000100000000-0x000000906fffffff] usable
Nov 29 03:24:03 pve4 kernel: NX (Execute Disable) protection: active
Nov 29 03:24:03 pve4 kernel: APIC: Static calls initialized
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b949020-0x4b95105f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b949020-0x4b95105f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b934020-0x4b94885f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b934020-0x4b94885f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b8d2020-0x4b93365f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b8d2020-0x4b93365f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b870020-0x4b8d165f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b870020-0x4b8d165f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b80e020-0x4b86f65f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b80e020-0x4b86f65f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b7ac020-0x4b80d65f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b7ac020-0x4b80d65f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b7a5020-0x4b7ab45f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b7a5020-0x4b7ab45f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b79e020-0x4b7a445f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b79e020-0x4b7a445f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b797020-0x4b79d45f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: e820: update [mem 0x4b797020-0x4b79d45f] usable ==> usable
Nov 29 03:24:03 pve4 kernel: extended physical RAM map:
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x0000000000000000-0x000000000009ffff] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x00000000000a0000-0x00000000000fffff] reserved
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x0000000000100000-0x000000004b79701f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b797020-0x000000004b79d45f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b79d460-0x000000004b79e01f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b79e020-0x000000004b7a445f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b7a4460-0x000000004b7a501f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b7a5020-0x000000004b7ab45f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b7ab460-0x000000004b7ac01f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b7ac020-0x000000004b80d65f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b80d660-0x000000004b80e01f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b80e020-0x000000004b86f65f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b86f660-0x000000004b87001f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b870020-0x000000004b8d165f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b8d1660-0x000000004b8d201f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b8d2020-0x000000004b93365f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b933660-0x000000004b93401f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b934020-0x000000004b94885f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b948860-0x000000004b94901f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b949020-0x000000004b95105f] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000004b951060-0x00000000682fefff] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x00000000682ff000-0x000000006ebfefff] reserved
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000006ebff000-0x000000006f9fefff] ACPI NVS
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000006f9ff000-0x000000006fffefff] ACPI data
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x000000006ffff000-0x000000006fffffff] usable
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x0000000070000000-0x000000008fffffff] reserved
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x00000000fe000000-0x00000000fe010fff] reserved
Nov 29 03:24:03 pve4 kernel: reserve setup_data: [mem 0x0000000100000000-0x000000906fffffff] usable