Proxmox random file system corruption

guimolins

New Member
Jan 3, 2025
2
0
1
Dear all,

Yesterday I performed a clean installation of Proxmox within a Supermicro server for a small business. I created a simple Ubuntu LXC container and set up an OpenVPN Server in it for remote access. Today I noticed I was no longer able to log into the Proxmox web server ui. I remoted into the Supermicro IPMI to find the error shown in the attached image and no form of input. I decided to reboot the server but now I am taken directly to shell, so my assumption is the OS is gone. What happened? I don't mind performing a new install but I would like to know how to avoid this in the future. Any help would be appreciated.

Best regards
 

Attachments

  • iKVM_capture.jpg
    iKVM_capture.jpg
    56.3 KB · Views: 22
Looks like one of the drives had a problem. Probably the Proxmox installation drive. Unfortunately the logs could probably not be written to the drive, so you can't see the actual problem. It could also be memory corruption causing disk corruption. Check the cables, run a memtest and run a SMART long self test on that drive.
 
  • Like
Reactions: Kingneutron
Assuming the disk/s are good (have you checked them?) - are you using a HW raid-controller, update the firmware on that controller.
 
  • Like
Reactions: guimolins
Looks like one of the drives had a problem. Probably the Proxmox installation drive. Unfortunately the logs could probably not be written to the drive, so you can't see the actual problem. It could also be memory corruption causing disk corruption. Check the cables, run a memtest and run a SMART long self test on that drive.
Ok I will definetely try this. The hardware is second hand from what I was told and it only has a single 1TB NVME SSD, so a drive corruption/malfunction is possible. Many thanks!