Proxmox running for 1-3 days then all VM's inaccessible/fail while Proxmox GUI and PC still on. [Cautionary Tale]

greyskull

Member
Jul 13, 2023
3
0
6
Hopefully I've fixed my own problem with your help via searching the forums and various search engines. I'm sure there is still room for improvement and hopefully it will not reoccur because of underlying thermal issues.

I moved homes and after the move it took me a while to re-establish my server when I noticed VM's going offline though they looked fine in the GUI. At first I figured something hardware related got loose during the move and I'd deal with it as life allowed. After some time and digging I started investigating my logs and I'm not really sure why it was working flawlessly for so long before with such a problematic etc/default/grub and /proc/cmdline

within etc/default/grub there was;
GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on iommu=pt initcall_blacklist=sysfb_init pcie_acs_override=downstream,multifunction pci=nommconf'"

clearly an unnecessary ' before the parentheses "

which resulted in /proc/cmdline;
BOOT_IMAGE=/boot/vmlinuz-6.8.12-8-pve root=/dev/mapper/pve-root ro quiet intel_iommu=on iommu=pt initcall_blacklist=sysfb_init pcie_acs_override=downstream,multifunction pci=nommconf"

the parentheses should not have been there which can be fixed by removing the error ' and running command update-grub once done.

Now my logs are a LOT cleaner however reading various forums, not all pertaining directly to ProxMox but Unix/Linux as a whole, I realize there are probably a lot of things I could be doing more efficiently by really utilizing and learning logs and the cmdline/ssh/bash.
 
Good you managed to solve it.
In order to mark the thread as solved, Please edit the first post of the thread and select Solved from the pull-down menu.