High Temp of GPU?

killmasta93

Renowned Member
Aug 13, 2017
958
56
68
30
Hi
I was wondering if someone could shed some light, Currently i have zabbix monitoring the GPU temp which goes to 100C but dont understand why if the VMs dont use GPU as i dont have any passthough, Not sure how to trobleshoot the issue?
I already cleaned the Server and still shows high GPU

1705292969296.png
 
Have you ever touched the graphics card? Do you have sufficient cooling? Are all fans working?
 
nope fans working fine, we have ac 24/7, im going to see if i install the drivers of nvidia to see nvtop to see whats going on
 
Hi,

maybe the GPU is faulty? We once had also a GPU for testing purpose and not in use by VM's but the server fans were all on 100% because of the GPU. After removing, everything was back to normal and in the end it turned out the GPU was damaged.

Greetz
 
thanks for the reply currently i get this

Code:
00:00.0 Host bridge [0600]: Intel Corporation 5520 I/O Hub to ESI Port [8086:3406] (rev 13)
    Subsystem: Dell 5520 I/O Hub to ESI Port [1028:026e]
00:01.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 [8086:3408] (rev 13)
    Kernel driver in use: pcieport
00:03.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 [8086:340a] (rev 13)
    Kernel driver in use: pcieport
00:07.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 [8086:340e] (rev 13)
    Kernel driver in use: pcieport
00:14.0 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub System Management Registers [8086:342e] (rev 13)
    Subsystem: Device [0028:006e]
    Kernel driver in use: i7core_edac
    Kernel modules: i7core_edac
00:14.1 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers [8086:3422] (rev 13)
    Subsystem: Device [0028:006e]
00:14.2 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status and RAS Registers [8086:3423] (rev 13)
    Subsystem: Device [0028:006e]
00:1a.0 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4 [8086:3a37]
    Subsystem: Dell 82801JI (ICH10 Family) USB UHCI Controller [1028:026e]
    Kernel driver in use: uhci_hcd
00:1a.1 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #5 [8086:3a38]
    Subsystem: Dell 82801JI (ICH10 Family) USB UHCI Controller [1028:026e]
    Kernel driver in use: uhci_hcd
00:1a.2 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #6 [8086:3a39]
    Subsystem: Dell 82801JI (ICH10 Family) USB UHCI Controller [1028:026e]
    Kernel driver in use: uhci_hcd
00:1a.7 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2 [8086:3a3c]
    Subsystem: Dell 82801JI (ICH10 Family) USB2 EHCI Controller [1028:026e]
    Kernel driver in use: ehci-pci
00:1b.0 Audio device [0403]: Intel Corporation 82801JI (ICH10 Family) HD Audio Controller [8086:3a3e]
    Subsystem: Dell 82801JI (ICH10 Family) HD Audio Controller [1028:026e]
    Kernel driver in use: snd_hda_intel
    Kernel modules: snd_hda_intel
00:1c.0 PCI bridge [0604]: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 1 [8086:3a40]
    Kernel driver in use: pcieport
00:1c.5 PCI bridge [0604]: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 6 [8086:3a4a]
    Kernel driver in use: pcieport
00:1d.0 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1 [8086:3a34]
    Subsystem: Dell 82801JI (ICH10 Family) USB UHCI Controller [1028:026e]
    Kernel driver in use: uhci_hcd
00:1d.1 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2 [8086:3a35]
    Subsystem: Dell 82801JI (ICH10 Family) USB UHCI Controller [1028:026e]
    Kernel driver in use: uhci_hcd
00:1d.2 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3 [8086:3a36]
    Subsystem: Dell 82801JI (ICH10 Family) USB UHCI Controller [1028:026e]
    Kernel driver in use: uhci_hcd
00:1d.7 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1 [8086:3a3a]
    Subsystem: Dell 82801JI (ICH10 Family) USB2 EHCI Controller [1028:026e]
    Kernel driver in use: ehci-pci
00:1e.0 PCI bridge [0604]: Intel Corporation 82801 PCI Bridge [8086:244e] (rev 90)
00:1f.0 ISA bridge [0601]: Intel Corporation 82801JIR (ICH10R) LPC Interface Controller [8086:3a16]
    Subsystem: Dell 82801JIR (ICH10R) LPC Interface Controller [1028:026e]
    Kernel driver in use: lpc_ich
    Kernel modules: lpc_ich
00:1f.2 SATA controller [0106]: Intel Corporation 82801JI (ICH10 Family) SATA AHCI Controller [8086:3a22]
    Subsystem: Dell 82801JI (ICH10 Family) SATA AHCI Controller [1028:026e]
    Kernel driver in use: ahci
    Kernel modules: ahci
00:1f.3 SMBus [0c05]: Intel Corporation 82801JI (ICH10 Family) SMBus Controller [8086:3a30]
    Subsystem: Dell 82801JI (ICH10 Family) SMBus Controller [1028:026e]
    Kernel driver in use: i801_smbus
    Kernel modules: i2c_i801
01:00.0 PCI bridge [0604]: Pericom Semiconductor PCI Express to PCI-XPI7C9X130 PCI-X Bridge [12d8:e130] (rev 04)
03:00.0 VGA compatible controller [0300]: NVIDIA Corporation G98 [GeForce 9300 GS] [10de:06e1] (rev a1)
    Kernel driver in use: nouveau
    Kernel modules: nvidiafb, nouveau
04:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller [10ec:8168] (rev 06)
    Subsystem: TP-LINK Technologies Co., Ltd. TG-3468 Gigabit PCI Express Network Adapter [7470:3468]
    Kernel driver in use: r8169
    Kernel modules: r8169
06:00.0 Ethernet controller [0200]: Broadcom Limited NetXtreme BCM5761 Gigabit Ethernet PCIe [14e4:1681] (rev 10)
    Subsystem: Dell NetXtreme BCM5761 Gigabit Ethernet PCIe [1028:026e]
    Kernel driver in use: tg3
    Kernel modules: tg3
20:07.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 [8086:340e] (rev 13)
    Kernel driver in use: pcieport
20:09.0 PCI bridge [0604]: Intel Corporation 7500/5520/5500/X58 I/O Hub PCI Express Root Port 9 [8086:3410] (rev 13)
    Kernel driver in use: pcieport
20:14.0 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub System Management Registers [8086:342e] (rev 13)
    Kernel modules: i7core_edac
20:14.1 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers [8086:3422] (rev 13)
20:14.2 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status and RAS Registers [8086:3423] (rev 13)
22:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller [10ec:8168] (rev 06)
    Subsystem: TP-LINK Technologies Co., Ltd. TG-3468 Gigabit PCI Express Network Adapter [7470:3468]
    Kernel driver in use: r8169
    Kernel modules: r8169
23:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. Device [10ec:8161] (rev 15)
    Subsystem: Realtek Semiconductor Co., Ltd. Device [10ec:8168]
    Kernel driver in use: r8169
    Kernel modules: r8169
3f:00.0 Host bridge [0600]: Intel Corporation Xeon 5600 Series QuickPath Architecture Generic Non-core Registers [8086:2c70] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series QuickPath Architecture Generic Non-core Registers [8086:8086]
3f:00.1 Host bridge [0600]: Intel Corporation Xeon 5600 Series QuickPath Architecture System Address Decoder [8086:2d81] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series QuickPath Architecture System Address Decoder [8086:8086]
3f:02.0 Host bridge [0600]: Intel Corporation Xeon 5600 Series QPI Link 0 [8086:2d90] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series QPI Link 0 [8086:8086]
3f:02.1 Host bridge [0600]: Intel Corporation Xeon 5600 Series QPI Physical 0 [8086:2d91] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series QPI Physical 0 [8086:8086]
3f:02.2 Host bridge [0600]: Intel Corporation Xeon 5600 Series Mirror Port Link 0 [8086:2d92] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Mirror Port Link 0 [8086:8086]
3f:02.3 Host bridge [0600]: Intel Corporation Xeon 5600 Series Mirror Port Link 1 [8086:2d93] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Mirror Port Link 1 [8086:8086]
3f:02.4 Host bridge [0600]: Intel Corporation Xeon 5600 Series QPI Link 1 [8086:2d94] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series QPI Link 1 [8086:8086]
3f:02.5 Host bridge [0600]: Intel Corporation Xeon 5600 Series QPI Physical 1 [8086:2d95] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series QPI Physical 1 [8086:8086]
3f:03.0 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Registers [8086:2d98] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Registers [8086:8086]
3f:03.1 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Target Address Decoder [8086:2d99] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Target Address Decoder [8086:8086]
3f:03.2 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller RAS Registers [8086:2d9a] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller RAS Registers [8086:8086]
3f:03.4 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Test Registers [8086:2d9c] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Test Registers [8086:8086]
3f:04.0 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Control [8086:2da0] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Control [8086:8086]
3f:04.1 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Address [8086:2da1] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Address [8086:8086]
3f:04.2 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Rank [8086:2da2] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Rank [8086:8086]
3f:04.3 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Thermal Control [8086:2da3] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Thermal Control [8086:8086]
3f:05.0 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Control [8086:2da8] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Control [8086:8086]
3f:05.1 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Address [8086:2da9] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Address [8086:8086]
3f:05.2 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Rank [8086:2daa] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Rank [8086:8086]
3f:05.3 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Thermal Control [8086:2dab] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Thermal Control [8086:8086]
3f:06.0 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Control [8086:2db0] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Control [8086:8086]
3f:06.1 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Address [8086:2db1] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Address [8086:8086]
3f:06.2 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Rank [8086:2db2] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Rank [8086:8086]
3f:06.3 Host bridge [0600]: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Thermal Control [8086:2db3] (rev 02)
    Subsystem: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Thermal Control [8086:8086]
 
thanks for the reply currently i get this

03:00.0 VGA compatible controller [0300]: NVIDIA Corporation G98 [GeForce 9300 GS] [10de:06e1] (rev a1) Kernel driver in use: nouveau Kernel modules: nvidiafb, nouveau
The open-source driver nouveau is loaded for this GPU. That driver is known to have no access to NVidia's documentation for re-clocking (which prevents any decent performance) but I don't know if that applies to the 9300. Maybe investigate if that's the case? Maybe pass it through to a minimal VM with the NVidia driver to put it in power save mode? Or install the proprietary NVidia driver on Proxmox (which might break when Proxmox updates the kernel)?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!