I have been experiencing random freezes for a while now with this computer. It is a perfect device for proxmox due to its high power and very low energy consumption.
Freeze-ups occur approximately every 1 to 2 days.
System:
I've done:
During some freezes I have been able to have a screen connected. The screen simply freezes and does not show any errors. I can't enter commands or interact
In syslog I see this the last time it happened (The logs on the different freezes do not seem to be related and each log is different):
To "fix it" I press and hold the power button on the device.
I'm going crazy with this, I've been trying to fix it for weeks. I love Proxmox and really don't want to switch to xcp-ng or another platform.
Freeze-ups occur approximately every 1 to 2 days.
System:
- Gigabyte GB-BRR7H-4800 (rev. 1.0)
- 64 GB RAM: 2x Kingston KVR26S19D8/32 SO-DIMM DDR4 2666Mhz 32GB C
- NVMe: Samsung 980 SSD 1TB PCIe 3.0 NVMe M.2
- SSD: SanDisk 1TB
I've done:
- memtest86 for 20h. No errors
- I have put additional ventilation for the device to lower the temperature (no component rises above 40ºC).
- Alternative Linux Kernels.
- Change the installation drive from NVMe to SSD and vice versa.
- Stress the CPU and RAM of several VMs to try to force the freeze. This does not cause freezes of any kind.
During some freezes I have been able to have a screen connected. The screen simply freezes and does not show any errors. I can't enter commands or interact
In syslog I see this the last time it happened (The logs on the different freezes do not seem to be related and each log is different):
Code:
Jul 29 05:17:01 pve CRON[715550]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)
Jul 29 05:17:01 pve CRON[715551]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jul 29 05:17:01 pve CRON[715550]: pam_unix(cron:session): session closed for user root
Jul 29 06:06:47 pve systemd[1]: Starting Daily apt upgrade and clean activities...
Jul 29 06:06:47 pve systemd[1]: apt-daily-upgrade.service: Succeeded.
Jul 29 06:06:47 pve systemd[1]: Finished Daily apt upgrade and clean activities.
Jul 29 06:12:31 pve postfix/qmgr[1127]: 426884C083A: from=<root@pve.xxxxxxx.es>, size=1140, nrcpt=1 (queue active)
Jul 29 06:12:32 pve postfix/smtp[734720]: connect to gmail-smtp-in.l.google.com[2a00:1450:400c:c00::1a]:25: Network is unreachable
Jul 29 06:13:02 pve postfix/smtp[734720]: connect to gmail-smtp-in.l.google.com[173.194.76.27]:25: Connection timed out
Jul 29 06:13:32 pve postfix/smtp[734720]: connect to alt1.gmail-smtp-in.l.google.com[142.250.153.27]:25: Connection timed out
Jul 29 06:13:32 pve postfix/smtp[734720]: connect to alt1.gmail-smtp-in.l.google.com[2a00:1450:4013:c16::1a]:25: Network is unreachable
Jul 29 06:13:32 pve postfix/smtp[734720]: connect to alt2.gmail-smtp-in.l.google.com[2a00:1450:4025:c03::1a]:25: Network is unreachable
Jul 29 06:13:32 pve postfix/smtp[734720]: 426884C083A: to=<xxxxxx@gmail.com>, relay=none, delay=183295, delays=183234/0.01/61/0, dsn=4.4.1, status=deferred (connect to alt2.gmail-smtp-in.l.google.com[2a00:1450:4025:c03::1a]:25: Network is unreachable)
Jul 29 06:17:01 pve CRON[736270]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)
Jul 29 06:17:01 pve CRON[736271]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jul 29 06:17:01 pve CRON[736270]: pam_unix(cron:session): session closed for user root
Jul 29 06:25:01 pve CRON[739027]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)
Jul 29 06:25:01 pve CRON[739028]: (root) CMD (test -x /usr/sbin/anacron || ( cd / && run-parts --report /etc/cron.daily ))
Jul 29 06:25:01 pve CRON[739027]: pam_unix(cron:session): session closed for user root
Jul 29 06:42:37 pve systemd: systemd-journald.service: Main process exited, code=killed, status=6/ABRT
Jul 29 06:46:06 pve systemd: systemd-journald.service: Failed with result 'watchdog'.
Jul 29 06:47:14 pve systemd: systemd-journald.service: Consumed 58.594s CPU time.
-- Reboot --
Jul 29 16:38:33 pve kernel: Linux version 5.15.30-2-pve (build@proxmox) (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP PVE 5.15.30-3 (Fri, 22 Apr 2022 18:08:27 +0200) ()
Jul 29 16:38:33 pve kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-5.15.30-2-pve root=/dev/mapper/pve-root ro quiet
To "fix it" I press and hold the power button on the device.
I'm going crazy with this, I've been trying to fix it for weeks. I love Proxmox and really don't want to switch to xcp-ng or another platform.