I'm having an issue with a few Proxmox 6.x hosts where the total *used* RAM as shown in top and free are roughly double or more the amount of RAM used by the VMs themselves. There's a mixture of LXC and KVM, and in some cases I believe the Windows VMs do not have the balloon driver installed, but even if I add up the total allocated RAM (not just used RAM) of all the containers/VMs, I still get a number roughly half of what's actually showing as used.
I'm well aware of zfs caching and how you can trick yourself into thinking you don't actually have any free RAM, but in this case I'm pretty confident this isn't that -- for one thing, I'd expect zfs cache to show up as buff/cache in top and free. But what I'm seeing is way too much RAM under "used" without large amounts in free, buff/cache, or avail. Additionally htop shows it as green memory usage (rather than yellow for cache), and I'm actually waking up the OOM killer, which is killing some of my VMs.
Any suggestions? It doesn't seem to be caching, and the amount of RAM allocated to the containers/VMs doesn't seem to be nearly enough to account for the used RAM. These machines have ~128GB RAM, so what I'm seeing is something like 110GB used (not including buff/cache) out of 128GB, but less than half of that actually allocated to containers or VMs, and maybe 1/4 of that if you only counted RAM actually used within the containers/VMs.
I'm well aware of zfs caching and how you can trick yourself into thinking you don't actually have any free RAM, but in this case I'm pretty confident this isn't that -- for one thing, I'd expect zfs cache to show up as buff/cache in top and free. But what I'm seeing is way too much RAM under "used" without large amounts in free, buff/cache, or avail. Additionally htop shows it as green memory usage (rather than yellow for cache), and I'm actually waking up the OOM killer, which is killing some of my VMs.
Any suggestions? It doesn't seem to be caching, and the amount of RAM allocated to the containers/VMs doesn't seem to be nearly enough to account for the used RAM. These machines have ~128GB RAM, so what I'm seeing is something like 110GB used (not including buff/cache) out of 128GB, but less than half of that actually allocated to containers or VMs, and maybe 1/4 of that if you only counted RAM actually used within the containers/VMs.