My Proxmox cluster consists of three nodes. First is a Dell notebook without battery running usually only Home Assistant, second is a Lenovo ThinkCentre M715 Gen 2 currently only running an Ubuntu VM and third (only for having third vote for quorum, always doing nothing) is a VM on my Synology NAS. Did a cluster just to see how it works, probably will the ThinkCentre in the long term be my only Proxmox VE.
Now I wrote some web interface running on my Synology which triggers TV recordings from Kathrein TechnoTrend TT-Smart C2821 receiver. These are being made by a call to cvlc for writing the IP stream to a file from the Ubuntu VM running on the ThinkCentre. Actually this is being scheduled by the at command on that Ubuntu VM, but I don't think that causes the problem.
Sometimes the Proxmox host goes down. The Ubuntu VM isn't reachable, the Proxmox host isn't reachable. This always happens while a TV recording is running, every few weeks, maybe every 30th recording or every 10th hour or something like that. It once happened again within 24 hours, other times it does weeks of running while recording flawlessly. I have no idea what triggers it.
After asking in a German Ubuntu forum I found out there is some invalid file handle occurring in the strace. I changed that the recordings are not directly stored on the NAS, but locally in the VM, and afterwards being moved to the NAS destination. Thought it helped, but it didn't (yesterday it happened again). But even if it fails in any way this is inside the VM! It should not be able to make the Proxmox host offline!
The ThinkCentre is running headless. I'm measuring the power consumption, usually it is using around 9W and if it gets unavailable, the power consumption is around 20W. Then I toggle power remotely (the measuring plug can do this), and after booting it is available again ...
Any idea which logging would help analyzing the problem? How can it be that the Proxmox host is getting down, independent of what cvlc on the VM is doing wrong?
Now I wrote some web interface running on my Synology which triggers TV recordings from Kathrein TechnoTrend TT-Smart C2821 receiver. These are being made by a call to cvlc for writing the IP stream to a file from the Ubuntu VM running on the ThinkCentre. Actually this is being scheduled by the at command on that Ubuntu VM, but I don't think that causes the problem.
Sometimes the Proxmox host goes down. The Ubuntu VM isn't reachable, the Proxmox host isn't reachable. This always happens while a TV recording is running, every few weeks, maybe every 30th recording or every 10th hour or something like that. It once happened again within 24 hours, other times it does weeks of running while recording flawlessly. I have no idea what triggers it.
After asking in a German Ubuntu forum I found out there is some invalid file handle occurring in the strace. I changed that the recordings are not directly stored on the NAS, but locally in the VM, and afterwards being moved to the NAS destination. Thought it helped, but it didn't (yesterday it happened again). But even if it fails in any way this is inside the VM! It should not be able to make the Proxmox host offline!
The ThinkCentre is running headless. I'm measuring the power consumption, usually it is using around 9W and if it gets unavailable, the power consumption is around 20W. Then I toggle power remotely (the measuring plug can do this), and after booting it is available again ...
Any idea which logging would help analyzing the problem? How can it be that the Proxmox host is getting down, independent of what cvlc on the VM is doing wrong?