Proxmox VE 6.2-4, SMP PVE 5.4.41-1
I've passed through my Intel HD Graphics 630 to a vm running Ubuntu 18.04.2 (5.3.0-51-generic)
I use the quicksync feature on the iGD for plex transcoding. At first it seemed to be working as it should, but a friend mentioned he was having some trouble with the streams stopping and glitching a couple of times.
Then I tested some myself and put it under stress and quickly realized that as soon as there is a bit of load it buckles.
I was monitoring dmesg while I was testing and I keep getting GPU HANG and recovery timed out. When that happens everything stops, even froze the VM at some point.
I mentioned this over at the Plex-forum and they said the same thing as my googling have resulted in, that the kernel driver for i915 is having trouble.
There seems to be lots of threads regarding this problem if I google GPU HANG: ecode 9:0:0x00000000, hang on vecs0, some mentioning that it started somewhere after kernel 5.3 https://bbs.archlinux.org/viewtopic.php?id=250765
Anyone know if there is something I can do about this? Ive tried using a couple of different i915 module options recommended, but none seem to fix it.
What sort of logs would be needed to to some troubleshooting on this?
Edit: Here is some info about the VM as well
I've passed through my Intel HD Graphics 630 to a vm running Ubuntu 18.04.2 (5.3.0-51-generic)
I use the quicksync feature on the iGD for plex transcoding. At first it seemed to be working as it should, but a friend mentioned he was having some trouble with the streams stopping and glitching a couple of times.
Then I tested some myself and put it under stress and quickly realized that as soon as there is a bit of load it buckles.
I was monitoring dmesg while I was testing and I keep getting GPU HANG and recovery timed out. When that happens everything stops, even froze the VM at some point.
Code:
[mai19 17:35] i915 0000:00:10.0: GPU HANG: ecode 9:0:0x00000000, hang on vecs0
[ +0,001011] i915 0000:00:10.0: Resetting vecs0 for hang on vecs0
[ +7,981283] i915 0000:00:10.0: Resetting vecs0 for hang on vecs0
[ +1,986848] i915 0000:00:10.0: GPU recovery timed out, cancelling all in-flight rendering.
I mentioned this over at the Plex-forum and they said the same thing as my googling have resulted in, that the kernel driver for i915 is having trouble.
There seems to be lots of threads regarding this problem if I google GPU HANG: ecode 9:0:0x00000000, hang on vecs0, some mentioning that it started somewhere after kernel 5.3 https://bbs.archlinux.org/viewtopic.php?id=250765
Anyone know if there is something I can do about this? Ive tried using a couple of different i915 module options recommended, but none seem to fix it.
What sort of logs would be needed to to some troubleshooting on this?
Edit: Here is some info about the VM as well
Code:
root@prox:~# cat /etc/pve/qemu-server/103.conf
agent: 1
bios: seabios
bootdisk: scsi0
cores: 4
hostpci0: 00:02
memory: 9032
name: tools
net0: virtio=A6:DF:E3:89:A1:F4,bridge=vmbr0,firewall=1
numa: 0
onboot: 1
ostype: l26
scsi0: data:vm-103-disk-0,discard=on,size=62G,ssd=1
scsihw: virtio-scsi-pci
smbios1: uuid=6187827f-2b2c-4c32-b07c-bbc41aa8b6ea
sockets: 1
vmgenid: d464ca53-e753-4886-94db-90b0a86bcc1c
Last edited: