[TUTORIAL] Hardware watchdog at a per-VM level

Walhalla · Jun 3, 2024

Hi, have had some problems with OOM killer "at home" and was searching for any solution - at least to detect and recover a killed VM.

My idea was to monitor the logfile (on the host) for any occurrence of "OOM killers" and if so (Could doublecheck that VM is really down with a simple ping) -> make a restart of the regarding VM.

Such a stupid idea or impossible for any reason?

LnxBil · Jun 6, 2024

Walhalla said:
Hi, have had some problems with OOM killer "at home" and was searching for any solution - at least to detect and recover a killed VM.

My idea was to monitor the logfile (on the host) for any occurrence of "OOM killers" and if so (Could doublecheck that VM is really down with a simple ping) -> make a restart of the regarding VM.

In case of an OOM, the VM is killed, so you need to start it. If the VM was just killed, starting it will probably yield another OOM. You need to fix the OOM condition in order to get a stable system.

cweakland · Mar 9, 2025

darknezz said:
How you add the watchdog to Home Assistant VM?

It looks like it has already be implemented: https://github.com/home-assistant/operating-system/pull/2627

Search

Search

[TUTORIAL] Hardware watchdog at a per-VM level

Walhalla

Well-Known Member

LnxBil

Distinguished Member

cweakland

Member

We value your privacy