Hi!
i have a bit of a problem, when having load on my AMD RX Vega 64 GPU (Adrenaline 19.10.1 driver) that is passed through to a Windows 10 (1903) VM it crashed the kernel after a while on the host.
if the windows machine just ideling and not rendering more then the windows desktop it is stable for a long time.
This is pressent on both VE 5.4 and on VE 6.0
i manage to snag this up from the syslog on the host. (To long to post here so on Pastebin)
https://pastebin.com/TjnzGP4c
The Host is a SuperMicro Quad Xeon machine
Some Configuration of the Host:
root@pmox1:~# cat /etc/modprobe.d/blacklist.conf
blacklist radeon
blacklist nouveau
blacklist nvidia
blacklist amdgpu
blacklist snd_hda_intel
root@pmox1:~# lspci -nnv
04:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 XL/XT [Radeon RX Vega 56/64] [1002:687f] (rev c1) (prog-if 00 [VGA controller])
Subsystem: Micro-Star International Co., Ltd. [MSI] Vega 10 XL/XT [Radeon RX Vega 56/64] [1462:3680]
Flags: bus master, fast devsel, latency 0, IRQ 97, NUMA node 0
Memory at a0000000 (64-bit, prefetchable) [size=256M]
Memory at b0000000 (64-bit, prefetchable) [size=2M]
I/O ports at 4000 [size=256
Memory at bb900000 (32-bit, non-prefetchable) [size=512K]
Expansion ROM at bb980000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: vfio-pci
Kernel modules: amdgpu
04:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 HDMI Audio [Radeon Vega 56/64] [1002:aaf8]
Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 HDMI Audio [Radeon Vega 56/64] [1002:aaf8]
Flags: bus master, fast devsel, latency 0, IRQ 94, NUMA node 0
Memory at bb9a0000 (32-bit, non-prefetchable) [size=16K]
Capabilities: <access denied>
Kernel driver in use: vfio-pci
Kernel modules: snd_hda_intel
I have tryed to google "DMAR: DRHD: handling fault status reg 40" but it seams that this error never have been writen about before.
Do anyone have any input how to solve this or where to look?
Thanks alot for reading!
i have a bit of a problem, when having load on my AMD RX Vega 64 GPU (Adrenaline 19.10.1 driver) that is passed through to a Windows 10 (1903) VM it crashed the kernel after a while on the host.
if the windows machine just ideling and not rendering more then the windows desktop it is stable for a long time.
This is pressent on both VE 5.4 and on VE 6.0
i manage to snag this up from the syslog on the host. (To long to post here so on Pastebin)
https://pastebin.com/TjnzGP4c
The Host is a SuperMicro Quad Xeon machine
Some Configuration of the Host:
root@pmox1:~# cat /etc/modprobe.d/blacklist.conf
blacklist radeon
blacklist nouveau
blacklist nvidia
blacklist amdgpu
blacklist snd_hda_intel
root@pmox1:~# lspci -nnv
04:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 XL/XT [Radeon RX Vega 56/64] [1002:687f] (rev c1) (prog-if 00 [VGA controller])
Subsystem: Micro-Star International Co., Ltd. [MSI] Vega 10 XL/XT [Radeon RX Vega 56/64] [1462:3680]
Flags: bus master, fast devsel, latency 0, IRQ 97, NUMA node 0
Memory at a0000000 (64-bit, prefetchable) [size=256M]
Memory at b0000000 (64-bit, prefetchable) [size=2M]
I/O ports at 4000 [size=256
Memory at bb900000 (32-bit, non-prefetchable) [size=512K]
Expansion ROM at bb980000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: vfio-pci
Kernel modules: amdgpu
04:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 HDMI Audio [Radeon Vega 56/64] [1002:aaf8]
Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 HDMI Audio [Radeon Vega 56/64] [1002:aaf8]
Flags: bus master, fast devsel, latency 0, IRQ 94, NUMA node 0
Memory at bb9a0000 (32-bit, non-prefetchable) [size=16K]
Capabilities: <access denied>
Kernel driver in use: vfio-pci
Kernel modules: snd_hda_intel
I have tryed to google "DMAR: DRHD: handling fault status reg 40" but it seams that this error never have been writen about before.
Do anyone have any input how to solve this or where to look?
Thanks alot for reading!