Hi,
I found that only VM on server is going down randomly. From dmesg, I can find :
This node has 128GB of RAM and only VM where originally 125GB was assigned to as for now there is no other VM running on it. The VM is cpanel+cloudlinux and with couple of low traffic websites.
Interesting part is that from monitoring, I can see that total memory usage of VM was not more than 12-15GB when it was killed by OOM.
May anyone help me to how should troubleshoot this and how to avoid this happening in future ?
Have further reduced memory to ~96GB
I found that only VM on server is going down randomly. From dmesg, I can find :
Code:
[Thu Mar 23 02:51:34 2023] Out of memory: Killed process 538560 (kvm) total-vm:125506644kB, anon-rss:113157280kB, file-rss:3988kB, shmem-rss:4kB, UID:0 pgtables:237624kB oom_score_adj:0
[Sun Mar 26 05:09:06 2023] Out of memory: Killed process 1087978 (kvm) total-vm:127067788kB, anon-rss:113982728kB, file-rss:28kB, shmem-rss:0kB, UID:0 pgtables:240364kB oom_score_adj:0
This node has 128GB of RAM and only VM where originally 125GB was assigned to as for now there is no other VM running on it. The VM is cpanel+cloudlinux and with couple of low traffic websites.
Interesting part is that from monitoring, I can see that total memory usage of VM was not more than 12-15GB when it was killed by OOM.
May anyone help me to how should troubleshoot this and how to avoid this happening in future ?
Code:
qm config 5001
agent: 1,freeze-fs-on-backup=0
boot: order=scsi0
cipassword: **********
ciuser: root
cores: 40
cpu: host
ipconfig0: ip=x.x.x.x,gw=x.x.x.1
localtime: 0
machine: q35
memory: 96000
meta: creation-qemu=7.1.0,ctime=1677322719
name: ugi-nl-cl
net0: virtio=E6:10:88:42:58:7A,bridge=vmbr0
numa: 0
onboot: 1
ostype: l26
scsi0: local-lvm:vm-5001-disk-0,cache=writeback,discard=on,size=665360M,ssd=1
scsihw: virtio-scsi-pci
serial0: socket
smbios1: uuid=9163a2fc-5fc7-4be7-a844-26e6196798e1
sockets: 1
sshkeys:
vga: qxl
vmgenid: ee77a50f-a636-48a6-b112-2eaa71cd31ca
Code:
proxmox-ve: 7.3-1 (running kernel: 5.15.102-1-pve)
pve-manager: 7.4-3 (running version: 7.4-3/9002ab8a)
pve-kernel-5.15: 7.3-3
pve-kernel-helper: 7.2-14
pve-kernel-5.15.102-1-pve: 5.15.102-1
pve-kernel-5.15.74-1-pve: 5.15.74-1
ceph-fuse: 15.2.17-pve1
corosync: 3.1.7-pve1
criu: 3.15-1+pve-1
glusterfs-client: 9.2-1
ifupdown2: 3.1.0-1+pmx3
ksm-control-daemon: 1.4-1
libjs-extjs: 7.0.0-1
libknet1: 1.24-pve2
libproxmox-acme-perl: 1.4.4
libproxmox-backup-qemu0: 1.3.1-1
libproxmox-rs-perl: 0.2.1
libpve-access-control: 7.4-2
libpve-apiclient-perl: 3.2-1
libpve-common-perl: 7.3-3
libpve-guest-common-perl: 4.2-4
libpve-http-server-perl: 4.2-1
libpve-rs-perl: 0.7.5
libpve-storage-perl: 7.4-2
libspice-server1: 0.14.3-2.1
lvm2: 2.03.11-2.1
lxc-pve: 5.0.2-2
lxcfs: 5.0.3-pve1
novnc-pve: 1.4.0-1
proxmox-backup-client: 2.3.3-1
proxmox-backup-file-restore: 2.3.3-1
proxmox-mail-forward: 0.1.1-1
proxmox-mini-journalreader: 1.3-1
proxmox-widget-toolkit: 3.6.3
pve-cluster: 7.3-3
pve-container: 4.4-3
pve-docs: 7.4-2
pve-edk2-firmware: 3.20221111-2
pve-firewall: 4.3-1
pve-firmware: 3.6-4
pve-ha-manager: 3.6.0
pve-i18n: 2.11-1
pve-qemu-kvm: 7.2.0-8
pve-xtermjs: 4.16.0-1
qemu-server: 7.4-2
smartmontools: 7.2-pve3
spiceterm: 3.2-2
swtpm: 0.8.0~bpo11+3
vncterm: 1.7-1
zfsutils-linux: 2.1.9-pve1