Hi there, I seem to have been having issues with my setup since the beginning. All I know is that my server can run stable for at most 1-2 weeks, until it eventually just hard crashes. If I hook my monitor up (while my GPU is plugged in) there is no output on the screen. Going into my router, the host and all of my VMs/CTs do not show up. Sometimes it crashes after a day or two. This has been happening with Proxmox 5.4, and 6.0.
System Specifications:
What I've tried so far:
System Specifications:
- CPU: AMD Ryzen 7 1700
- Motherboard: ASRock B450m pro4
- Memory: 4x 8GB DDR4-3000 RAM
- GPU: GeForce GTX 650 (Removed)
- Storage:
- 1x 250gb SSD
- 1x 10tb HDD
- pveversion -v output:
[*]proxmox-ve: 6.0-2 (running kernel: 5.0.15-1-pve)
pve-manager: 6.0-4 (running version: 6.0-4/2a719255)
pve-kernel-5.0: 6.0-5
pve-kernel-helper: 6.0-5
pve-kernel-5.0.15-1-pve: 5.0.15-1
ceph-fuse: 12.2.11+dfsg1-2.1
corosync: 3.0.2-pve2
criu: 3.11-3
glusterfs-client: 5.5-3
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.10-pve1
libpve-access-control: 6.0-2
libpve-apiclient-perl: 3.0-2
libpve-common-perl: 6.0-2
libpve-guest-common-perl: 3.0-1
libpve-http-server-perl: 3.0-2
libpve-storage-perl: 6.0-5
libqb0: 1.0.5-1
lvm2: 2.03.02-pve3
lxc-pve: 3.1.0-61
lxcfs: 3.0.3-pve60
novnc-pve: 1.0.0-60
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.0-5
pve-cluster: 6.0-4
pve-container: 3.0-3
pve-docs: 6.0-4
pve-edk2-firmware: 2.20190614-1
pve-firewall: 4.0-5
pve-firmware: 3.0-2
pve-ha-manager: 3.0-2
pve-i18n: 2.0-2
pve-qemu-kvm: 4.0.0-3
pve-xtermjs: 3.13.2-1
qemu-server: 6.0-5
smartmontools: 7.0-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.1-pve1
[*]
What I've tried so far:
- In any case, checking the syslog or the kernel log shows the ASCII characters for null (^@^@^@...) when it crashes, not giving me any meaningful explanation.
- The other logs don't even mention the crash.
- I've tested all four modules of my RAM, all in different spots using memtest86+ for a full day. Memtest reported no errors.
- I've blacklisted all nouveau drivers as I have already taken out my graphics card anyway. (This did actually help make the system more stable than it used to be, but it still crashes often.)
- I've checked for any settings in BIOS that could be causing any power issues and have disabled them.
- I've tried with the RAM at the BIOS default speeds at 2133 Mhz
- I've reset the BIOS settings
- Updated BIOS version to 2.00
- A full reinstall of proxmox