(somewhat) random blue screens with Windows 10 Pro VM

dvdmeer

New Member
Dec 29, 2019
5
0
1
49
Hi,

For some time I have my own proxmox system (6.1-3) and things have been great for the most part. Here is my hardware list:
AMD Ryzen 3950x
Asus ROG STRIX x570-E Gaming Motherboard
Corsair AX1200i PSU
G.Skill Trident Z Neo DDR4 64 GB (4x16)
Corsair H150i pro
AMD Saphire Nitro+ Radeon RX 5700 XT
Gigabyte RTX 2070 Super Gaming OC
6 x Corsair ML120 Pro 120MM Case Fans
WD Red 10 TB 3.5" Disk
Seagate FireCuda 520 2TB
Corsair Commander Pro

Note that I have GPU passthrough working with my Vidia card

I use my system mainly for music production so I installed my main OS (Windows 10 Pro - latest version with all updates) on the
Seagate Firecuda M.2 disk. Also I have a 1 TB partition for all my audio samples + instruments.

For some time I noticed some issues where I get a blue screen in Windows 10 (video_dxgkrnl_fatal_error). This always seemed to happen
with my iCue software (used for RGB lighting) for some reason. So when I get an update for the software and uninstall the old software,
as soon as it tries uninstalling the driver I get the blue screen. Then I do a complete reinstall of Windows 10 + all my software and things
are great up until some point (haven't been figuring out at which point exactly because I forgot to make snapshots / backups at each point
in the install) where I get the blue screens.
Now I decided to make backups at each point in my install so I can exactly pinpoint when it happens. So I have a backup when I install the
basic os (no drivers), a backup with all drivers installed, a backup with the basic software installed.
Up until this point I have no issues at all. Even my iCue software keeps working.
Now I started installing my Ableton software and during the installation I get the same blue screen. This has never happened before during
this installation. I was able to uninstall the software once and then tried to reinstalling it and the same thing happened.

Even though when it happens it always happens at the same point it is still rather random. First my iCue software and now my Ableton
installation. I expected that somehow my M.2 disk was the issue so I decided to restore my last backup to the WD 10 TB disk instead.
Unfortunately this was not the case and I got the same blue screen again.

I also notice there are a lot (and I mean a lot) of messages in the log:

[117660.957017] kvm_get_msr_common: 665 callbacks suppressed
[117660.957018] kvm [6951]: vcpu5, guest rIP: 0xfffff8032288174b ignored rdmsr: 0xc0010064

Are these hardware related? Are they serious or can I just safely ignore them? I know they can be turned off somewhere.

I know that if I start a completely new installation from scratch the blue screen will end up somewhere else.
Note that the blue screen is related to the video drivers but the software that causes the bsod does not have anything to do with the video
card drivers.

As a last try I through that it might be due to issues with my GPU passthrough so I disabled this and used the default graphics card.
But even this did not help and I got the same BSOD again during installation.

I would really like some help pinpointing this issue because it is driving me crazy.


Thanks
 
[117660.957017] kvm_get_msr_common: 665 callbacks suppressed
[117660.957018] kvm [6951]: vcpu5, guest rIP: 0xfffff8032288174b ignored rdmsr: 0xc0010064
those are not supported debug register reads which can be ignored
did you set 'ignore_msrs' somewhere? if no, try that

also can you post your vm config? (qm config ID)
 
  • Like
Reactions: kwinz
I did set them now and the messages are gone so thank you for that. I do have another serious crash which may or may not have to do with my bios but I'll put that in a separate post.
Here is my config:

agent: 1
balloon: 0
bios: ovmf
boot: cdn
bootdisk: sata0
cores: 8
cpu: host
efidisk0: local-lvm:vm-105-disk-0,size=4M
hostpci0: 07:00,pcie=1,rombar=0
hostpci1: 0d:00,pcie=1,x-vga=1
ide2: none,media=cdrom
machine: q35
memory: 32768
name: win10-pro-main-vm
net0: virtio=F6:0F:16:22:28:21,bridge=vmbr0,firewall=1
numa: 0
onboot: 1
ostype: win10
sata0: local-lvm:vm-105-disk-1,discard=on,size=80G,ssd=1
sata1: local-lvm:vm-105-disk-2,discard=on,size=1000G
scsihw: virtio-scsi-pci
smbios1: uuid=46aa8688-107c-4a71-90a8-c81877e94263
sockets: 1
vga: none
vmgenid: 618cce04-b7f1-410b-bb97-c2ac997f1225
 
I would really like to know if this was ever figured out? I have this same symptom on my machine and I am wondering if I should create a new thread?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!