Ubuntu 20.04-server + jellyfin + gpu passthrough = intermittent suspend

nopeimgood

New Member
Mar 25, 2021
2
1
1
30
Have an issue with a vm constantly getting put into suspended state after ~20 minutes. Have soft-rebooted, hard-rebooted, and re-built the vm with no fix. Have also tried a silly "keep alive" script in case the gpu-passthrough was expecting some terminal action before hibernating. Thinking there's either an issue with the gpu passthrough, with the disk space, or RAM (since I've experienced some memory leaks with jellyfin in the past). Relevant outputs below. Has anyone had this issue before?

config
Code:
$ qm config 104
balloon: 2000
bios: ovmf
bootdisk: scsi0
cores: 4
efidisk0: local-lvm:vm-104-disk-0,size=128K
hostpci0: 03:00,pcie=1,x-vga=1
machine: q35
memory: 8000
name: jellyfin
net0: virtio=8A:C7:0A:FE:C5:71,bridge=vmbr0,firewall=1
numa: 0
ostype: l26
scsi0: HDD-3TB:vm-104-disk-0,cache=writeback,size=1000G
scsihw: virtio-scsi-pci
smbios1: uuid=a436cd78-81d7-464a-8774-7a68d93dfe6d
sockets: 2
vmgenid: 905380e8-1f85-4c99-9b40-bff916872057

disk output
Code:
$ df -h --total
Filesystem                         Size  Used Avail Use% Mounted on
udev                               3.8G     0  3.8G   0% /dev
tmpfs                              726M  1.5M  724M   1% /run
/dev/mapper/ubuntu--vg-ubuntu--lv  982G  243G  690G  27% /
tmpfs                              3.6G     0  3.6G   0% /dev/shm
tmpfs                              5.0M     0  5.0M   0% /run/lock
tmpfs                              3.6G     0  3.6G   0% /sys/fs/cgroup
/dev/sda2                          976M  131M  778M  15% /boot
/dev/loop2                          33M   33M     0 100% /snap/snapd/11402
/dev/loop0                          71M   71M     0 100% /snap/lxd/19647
/dev/loop3                          72M   72M     0 100% /snap/lxd/16099
/dev/loop1                          56M   56M     0 100% /snap/core18/1988
/dev/loop5                          30M   30M     0 100% /snap/snapd/8542
/dev/loop4                          55M   55M     0 100% /snap/core18/1880
/dev/sda1                          511M  7.9M  504M   2% /boot/efi
tmpfs                              736M   16K  736M   1% /run/user/122
tmpfs                              736M  4.0K  736M   1% /run/user/1000
total                              997G  244G  704G  26% -

GPU passthrough works no problem
Code:
$ nvidia-smi
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 460.32.03    Driver Version: 460.32.03    CUDA Version: 11.2     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  GeForce GTX 1080    Off  | 00000000:01:00.0 Off |                  N/A |
| 33%   37C    P8     9W / 190W |     13MiB /  8119MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A       983      G   /usr/lib/xorg/Xorg                  8MiB |
|    0   N/A  N/A      1044      G   /usr/bin/gnome-shell                2MiB |
+-----------------------------------------------------------------------------+

and RAM usage
Code:
$ free
              total        used        free      shared  buff/cache   available
Mem:        7954468      732232     6445036        3380      777200     6950292
Swap:       4194300           0     4194300
 
I don't know why this was happening, but in /var/log/syslog I found some systemd-sleep logs. Haven't seen this in other ubuntu-server vms, but maybe helpful for someone else.

Fixed with this for now.
Code:
sudo systemctl mask sleep.target suspend.target hibernate.target hybrid-sleep.target

No suspends for ~60 mins so far.
 
  • Like
Reactions: leesteken
I don't know why this was happening, but in /var/log/syslog I found some systemd-sleep logs. Haven't seen this in other ubuntu-server vms, but maybe helpful for someone else.

Fixed with this for now.
Code:
sudo systemctl mask sleep.target suspend.target hibernate.target hybrid-sleep.target

No suspends for ~60 mins so far.
This helped me today. Thank you!