Earlier this year, I purchased a refurbished Dell R720xd server so I could run a few VMs. I put ProxMox VE on it as the OS and it's been up and running without flaw ever since. I have four VMs (2 Linux, 1 Windows 10, and TrueNas) on it. Earlier this week, I finally decided to check for updates via the web GUI. It downloaded and installed the updates, then needed a reboot. After I rebooted the server is when I ran into the hanging issue. My R720xd gets to the firmware initializing screen, then blips a message up too fast to read, then the screen goes blank and nothing else happens. I can still get to the iDRAC controller just fine but it isn't loading proxmox.
Info on the R720xd:
2 - Intel Xeon CPUs E5-2697 v2
8 - 8 DDR-3 32GB PC3L-12800L RAM (It is 7 now as I discovered one of the sticks went bad (guessing it was limping along until the reboot))
1 - Dell Broadcom 5720 Quad-Port Gig Network Daughter card
7 - 8TB SAS drives
2 pools, 1 for the ProxMox OS and the VMs and their storage, and a ~35TB pool for TrueNas
If any other information on the server would be helpful, let me know.
Things I've tried so far:
-Rebooting
-Powering down completely
-Power down, pull the ethernet cables, pull the power cords, press and hold the power button for 20 seconds, plug power back in, let it sit for a few minutes. power back up
-I can still get into the BIOS, Lifecycle, and iDRAC on boot up, so I ran a full hardware test (that is where I discovered a ram stick went bad), no other issues were detected
-I read somewhere about someone else having a similar issue and they reseated all the drives, so I tried that but it didn't help
-I recorded the message that pops up too fast to read on the screen and played it back. It said that it was disconnecting the UEFI drivers. I read somewhere that meant the system was trying to boot from a USB stick. That is how I originally put ProxMox on the server, so I thought that could be it. I checked in the BIOS settings and it is set to boot from the hard drive.
-I read somewhere that there was a kernel missing in ProxMox 8 and you had to go through changing some files or something. The problem with that is I have no idea how to get to anything to do that. With ProxMox being the OS and it's not loading, I can't get to anything like that.
-When the server is "stuck" I can still see all four ports on the ethernet card are connected to my network. I can see their MAC addresses but not the VMs themselves. (Passthrough issue?)
It could very well be something else and it wasn't the update that broke it, but the reboot instead. I don't know. Is it possible to update ProxMox from a USB drive in the server without erasing everything I have on the drives? What else can I check for on the ProxMox end to verify if it was that or another server issue? Keep in mind, the hardware test claims everything is ok. I had rebooted the server with the previous version of ProxMox quite a few times without any issues. It always just came back up, spun the VMs up, and away I went.
Help would be greatly appreciated.
Info on the R720xd:
2 - Intel Xeon CPUs E5-2697 v2
8 - 8 DDR-3 32GB PC3L-12800L RAM (It is 7 now as I discovered one of the sticks went bad (guessing it was limping along until the reboot))
1 - Dell Broadcom 5720 Quad-Port Gig Network Daughter card
7 - 8TB SAS drives
2 pools, 1 for the ProxMox OS and the VMs and their storage, and a ~35TB pool for TrueNas
If any other information on the server would be helpful, let me know.
Things I've tried so far:
-Rebooting
-Powering down completely
-Power down, pull the ethernet cables, pull the power cords, press and hold the power button for 20 seconds, plug power back in, let it sit for a few minutes. power back up
-I can still get into the BIOS, Lifecycle, and iDRAC on boot up, so I ran a full hardware test (that is where I discovered a ram stick went bad), no other issues were detected
-I read somewhere about someone else having a similar issue and they reseated all the drives, so I tried that but it didn't help
-I recorded the message that pops up too fast to read on the screen and played it back. It said that it was disconnecting the UEFI drivers. I read somewhere that meant the system was trying to boot from a USB stick. That is how I originally put ProxMox on the server, so I thought that could be it. I checked in the BIOS settings and it is set to boot from the hard drive.
-I read somewhere that there was a kernel missing in ProxMox 8 and you had to go through changing some files or something. The problem with that is I have no idea how to get to anything to do that. With ProxMox being the OS and it's not loading, I can't get to anything like that.
-When the server is "stuck" I can still see all four ports on the ethernet card are connected to my network. I can see their MAC addresses but not the VMs themselves. (Passthrough issue?)
It could very well be something else and it wasn't the update that broke it, but the reboot instead. I don't know. Is it possible to update ProxMox from a USB drive in the server without erasing everything I have on the drives? What else can I check for on the ProxMox end to verify if it was that or another server issue? Keep in mind, the hardware test claims everything is ok. I had rebooted the server with the previous version of ProxMox quite a few times without any issues. It always just came back up, spun the VMs up, and away I went.
Help would be greatly appreciated.