Hi, I have a Proxmox 7 installation that's been going strong until just this morning. From what I can gather, any running QEMU process causes **very** fast memory allocation on the host until eventually, everything is OOM killed. The PVE node is itself a virtual machine. For the purposes of this thread "host" will refer to the hypervisor in which PVE is running, not PVE itself.
Here's what I know:
- On 1 August, the PVE node's packages were updated, and the host rebooted. No problems, and no memory leak.
- This morning, 11 August, there were no packages on PVE to be updated (so, nothing in that regard changed from 1 August to 11 August), and the host was rebooted.
- Upon starting up, *something* begins to allocate ~100-200MB/second of memory until the machine crashes
- With the fleeting seconds of usability I had with each boot, I manually disabled each container and VM set to start when the PVE node does
- The leak stopped happening as soon as no QEMU/KVM guests were set to start with the PVE node
- Starting any QEMU/KVM guest, any at all, causes the memory leak to occur. The leak continues until the guest is stopped.
- The amount of memory consumed is not dependent on the amount of memory assigned to the guest. For example, a guest only assigned 512MB of memory can still consume all 16GB of memory on the PVE node.
- The memory is not returned when the guest stops.
- Nothing in htop or ps is revealing what is using all of this memory.
- Rolling back to an earlier kernel revision does not solve the problem
I'm ready to post logs, version numbers, etc. just let me know what you need. I did some cursory searching on both Google and this forum but it looks like I'm the first person to notice this problem.
Here's what I know:
- On 1 August, the PVE node's packages were updated, and the host rebooted. No problems, and no memory leak.
- This morning, 11 August, there were no packages on PVE to be updated (so, nothing in that regard changed from 1 August to 11 August), and the host was rebooted.
- Upon starting up, *something* begins to allocate ~100-200MB/second of memory until the machine crashes
- With the fleeting seconds of usability I had with each boot, I manually disabled each container and VM set to start when the PVE node does
- The leak stopped happening as soon as no QEMU/KVM guests were set to start with the PVE node
- Starting any QEMU/KVM guest, any at all, causes the memory leak to occur. The leak continues until the guest is stopped.
- The amount of memory consumed is not dependent on the amount of memory assigned to the guest. For example, a guest only assigned 512MB of memory can still consume all 16GB of memory on the PVE node.
- The memory is not returned when the guest stops.
- Nothing in htop or ps is revealing what is using all of this memory.
- Rolling back to an earlier kernel revision does not solve the problem
I'm ready to post logs, version numbers, etc. just let me know what you need. I did some cursory searching on both Google and this forum but it looks like I'm the first person to notice this problem.
Last edited: