Hi,
my PVE Server crashes sometimes, but I dont have any idea why. This happens from several days to multiple weeks.
The problem now is, that i dont have any log entry which could help me.
What happens:
- no VM reachable
- I can still ping the server
- No SSH connection to Proxmox or any VM
- If I directly work on the server (keyboard,...) i can still ping to the outside
- Also the Server seems to work fine, background processes still work
- LED of Network devices work flashing
- SSH service (systemctl status ssh) was "active"
- omv_backup is my backup server which wasnt running at the time
To get closer to the time it happens, one VM sends me every 5 minutes a life signal. I received the last signal at 1:36 AM this morning
At around 6:21 I tried to start/stop the server via console. In the end, i had to restart the server....
-> Why cant I see any error message. I could remove all external/internal devices. But I would like to know whats going on. Verbose level,...
System:
Asrock N100m + 32G Ram
2 WD Red 2TB SSDs as a ZFS storage
Onboard Lan as a backup
1 added USB 2.5G Nic
additional SATA PCIE adapter
- added a PVE Report "export"
- added a, quite large, journalctl output
Strangely, as I write this: Something probably still worked?
my PVE Server crashes sometimes, but I dont have any idea why. This happens from several days to multiple weeks.
The problem now is, that i dont have any log entry which could help me.
What happens:
- no VM reachable
- I can still ping the server
- No SSH connection to Proxmox or any VM
- If I directly work on the server (keyboard,...) i can still ping to the outside
- Also the Server seems to work fine, background processes still work
- LED of Network devices work flashing
- SSH service (systemctl status ssh) was "active"
- omv_backup is my backup server which wasnt running at the time
To get closer to the time it happens, one VM sends me every 5 minutes a life signal. I received the last signal at 1:36 AM this morning
At around 6:21 I tried to start/stop the server via console. In the end, i had to restart the server....
-> Why cant I see any error message. I could remove all external/internal devices. But I would like to know whats going on. Verbose level,...
System:
Asrock N100m + 32G Ram
2 WD Red 2TB SSDs as a ZFS storage
Onboard Lan as a backup
1 added USB 2.5G Nic
additional SATA PCIE adapter
- added a PVE Report "export"
- added a, quite large, journalctl output
Strangely, as I write this: Something probably still worked?
Bash:
2024-04-20 02:41:17 starting update
2024-04-20 02:41:17 start download http://download.proxmox.com/images/aplinfo-pve-8.dat.asc
2024-04-20 02:41:17 download finished: 200 OK
2024-04-20 02:41:17 start download http://download.proxmox.com/images/aplinfo-pve-8.dat.gz
2024-04-20 02:41:24 download finished: 200 OK
2024-04-20 02:41:24 signature verification: gpgv: Signature made Fri Mar 1 10:49:10 2024 CET
2024-04-20 02:41:24 signature verification: gpgv: using RSA key F4E136C67CDCE41AE6DE6FC81140AF8F639E0C39
2024-04-20 02:41:24 signature verification: gpgv: Good signature from "Proxmox Bookworm Release Key <proxmox-release@proxmox.com>"
2024-04-20 02:41:24 update successful
2024-04-20 02:41:24 start download https://releases.turnkeylinux.org/pve/aplinfo.dat.asc
2024-04-20 02:41:39 download failed: 500 Can't connect to releases.turnkeylinux.org:443 (SSL connect attempt failed error:0A000126:SSL routines::unexpected eof while reading)
2024-04-20 02:41:39 update failed - no signature file '/var/lib/pve-manager/apl-info/pveam-releases.turnkeylinux.org.tmp.1500417.asc'
Attachments
Last edited: