[SOLVED] Proxmox Memory Leak in VM? or ZFS issue?

utorth

Member
Jul 20, 2020
6
0
21
54
One proxmox node
4 - VM's
1 - ZFS NFS

Every once in awhile, when I move files from the VM that I have setup to access the ZFS NFS share, the VM will crash. After it reboots, I notice the total memory available on the VM starts to dwindle until it again, starts crashing. The only fix I can find for this is to completely reboot the node and after that everything normalizes.

here's top on the VM after a few days when it's running normally (and I haven't accessed the ZFS NFS share):

Code:
top - 11:10:23 up 3 days, 23:08,  2 users,  load average: 0.70, 0.56, 0.48
Tasks: 268 total,   1 running, 267 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.0 us,  1.1 sy,  4.1 ni, 89.8 id,  3.6 wa,  0.0 hi,  1.4 si,  0.0 st
KiB Mem : 65806168 total,   352844 free,  3962160 used, 61491164 buff/cache
KiB Swap: 33030140 total, 32992836 free,    37304 used. 61302144 avail Mem

Code:
[root@localhost~]# free
              total        used        free      shared  buff/cache   available
Mem:       65806168     3962512      411388        5560    61432268    61301984
Swap:      33030140       37304    32992836

Here's top on the VM when it's crashing. The KiB Mem total starts at the max of 64gig and then ram starts disappearing... in this case it's already down to 52gig total after 18 minutes. It will slowly go down to zero and crash again.

Code:
top - 10:08:55 up 18 min,  2 users,  load average: 0.04, 0.27, 0.21
Tasks: 253 total,   1 running, 252 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.1 us,  0.0 sy,  0.0 ni, 99.9 id,  0.0 wa,  0.0 hi,  0.0 si,  0.0 st
KiB Mem : 52477784 total, 50751324 free,  1109224 used,   617236 buff/cache
KiB Swap: 33030140 total, 33030140 free,        0 used. 50872556 avail Mem

Code:
[root@localhost ~]# free
              total        used        free      shared  buff/cache   available
Mem:       51453784     1107844    49712272        9184      633668    49849924
Swap:      33030140           0    33030140

After 40 minutes, total ram on the VM is down to 38gig

Code:
top - 10:30:53 up 40 min,  2 users,  load average: 0.05, 0.19, 0.25
Tasks: 254 total,   1 running, 253 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.1 us,  0.1 sy,  0.0 ni, 99.8 id,  0.0 wa,  0.0 hi,  0.0 si,  0.0 st
KiB Mem : 38960984 total, 36769848 free,  1090380 used,  1100756 buff/cache
KiB Swap: 33030140 total, 33030140 free,        0 used. 37368412 avail Mem

Code:
[root@localhost ~]# free
              total        used        free      shared  buff/cache   available
Mem:       38551384     1089924    36352536        9188     1108924    36959300
Swap:      33030140           0    33030140

Any ideas?
------

root@proxmox:~# pveversion -v
proxmox-ve: 6.1-2 (running kernel: 5.3.18-3-pve)
pve-manager: 6.1-8 (running version: 6.1-8/806edfe1)
pve-kernel-helper: 6.1-8
pve-kernel-5.3: 6.1-6
pve-kernel-5.3.18-3-pve: 5.3.18-3
pve-kernel-5.3.18-1-pve: 5.3.18-1
pve-kernel-5.3.10-1-pve: 5.3.10-1
ceph-fuse: 12.2.11+dfsg1-2.1+b1
corosync: 3.0.3-pve1
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.15-pve1
libpve-access-control: 6.0-6
libpve-apiclient-perl: 3.0-3
libpve-common-perl: 6.0-17
libpve-guest-common-perl: 3.0-5
libpve-http-server-perl: 3.0-5
libpve-storage-perl: 6.1-5
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 3.2.1-1
lxcfs: 4.0.1-pve1
novnc-pve: 1.1.0-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.1-3
pve-cluster: 6.1-4
pve-container: 3.0-23
pve-docs: 6.1-6
pve-edk2-firmware: 2.20200229-1
pve-firewall: 4.0-10
pve-firmware: 3.0-7
pve-ha-manager: 3.0-9
pve-i18n: 2.0-4
pve-qemu-kvm: 4.1.1-4
pve-xtermjs: 4.3.0-1
qemu-server: 6.1-7
smartmontools: 7.1-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.3-pve1
 
Last edited:
Hi,

Please try to upgrade to latest version and try again.
apt update && apt full-upgrade

pve-qemu-kvm: 4.1.1-4 should be pve-qemu-kvm: 5.0.0-11
qemu-server: 6.1-7 should be qemu-server: 6.2-10
 
Hi,

Please try to upgrade to latest version and try again.
apt update && apt full-upgrade

pve-qemu-kvm: 4.1.1-4 should be pve-qemu-kvm: 5.0.0-11
qemu-server: 6.1-7 should be qemu-server: 6.2-10

Updated to latest, will monitor and report back.