I'm periodically having issues with the lxc containers crashing the host node.
The errors on the node are the classic nmi_watchdog stuck and i believe so far i was treating the symptom instead of the cause.
Today, i had a very interesting "customer". His container was using 100% of his cpu (1...