Very High load average on my Proxmox server

luciferlu

Active Member
Apr 22, 2019
26
2
43
44
微信图片_20230608141155.png
Except the high load average there is nothing wrong with this machine. Which information should I provide to check this situation?
 
Hi,
what is the output of uptime, what cpu is this lscpu?
 
The high load value says that there are a lot of processes in the run queue. Please check with top or ps what those thousands of processes are.
 
Hi,
what is the output of uptime, what cpu is this lscpu?
Bash:
root@pve:~# lscpu
Architecture:                    x86_64
CPU op-mode(s):                  32-bit, 64-bit
Byte Order:                      Little Endian
Address sizes:                   39 bits physical, 48 bits virtual
CPU(s):                          16
On-line CPU(s) list:             0-15
Thread(s) per core:              2
Core(s) per socket:              8
Socket(s):                       1
NUMA node(s):                    1
Vendor ID:                       GenuineIntel
CPU family:                      6
Model:                           158
Model name:                      Genuine Intel(R) CPU 0000 @ 1.70GHz
Stepping:                        12
Frequency boost:                 enabled
CPU MHz:                         841.403
CPU max MHz:                     1701.0000
CPU min MHz:                     800.0000
BogoMIPS:                        3399.90
Virtualization:                  VT-x
L1d cache:                       256 KiB
L1i cache:                       256 KiB
L2 cache:                        2 MiB
L3 cache:                        16 MiB
NUMA node0 CPU(s):               0-15
Vulnerability Itlb multihit:     KVM: Mitigation: Split huge pages
Vulnerability L1tf:              Not affected
Vulnerability Mds:               Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Meltdown:          Not affected
Vulnerability Mmio stale data:   Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Retbleed:          Mitigation; IBRS
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitizat
                                 ion
Vulnerability Spectre v2:        Mitigation; IBRS, IBPB conditional, RSB filling, PBRSB-eIBRS Not
                                 affected
Vulnerability Srbds:             Mitigation; Microcode
Vulnerability Tsx async abort:   Mitigation; Clear CPU buffers; SMT vulnerable
Flags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat
                                  pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx
                                  pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_goo
                                 d nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes6
                                 4 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pc
                                 id sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave
                                  avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpci
                                 d_single ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vp
                                 id ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpci
                                 d rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xge
                                 tbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window
                                  hwp_epp md_clear flush_l1d arch_capabilities
root@pve:~# uptime
 15:56:36 up 97 days, 21:08,  7 users,  load average: 1500.95, 1501.00, 1501.00


The cpu is a 9900t es.
 
The high load value says that there are a lot of processes in the run queue. Please check with top or ps what those thousands of processes are.
Yes. But a week ago the value is 140 and now it is 1500. The cpu usage is always less than 10%.
 
you can see more details with

cat /proc/pressure/cpu
cat /proc/pressure/io
cat /proc/pressure/memory

(to see exactly if the load is because of cpu or disk or mem)
thanks
Bash:
cat: /proc/pressure/iocat: No such file or directory
root@pve:~# cat /proc/pressure/io                                                                 some avg10=1.89 avg60=1.57 avg300=1.38 total=187966635651                                         full avg10=1.77 avg60=1.45 avg300=1.29 total=175867279749
root@pve:~# cat /proc/pressure/cpu                                                                some avg10=0.00 avg60=0.01 avg300=0.04 total=76129377964                                          full avg10=0.00 avg60=0.00 avg300=0.00 total=0
root@pve:~# cat /proc/pressure/mem                                                                cat: /proc/pressure/mem: No such file or directory                                                root@pve:~# cat /proc/pressure/memory
some avg10=0.00 avg60=0.00 avg300=0.00 total=117412023272                                         full avg10=0.00 avg60=0.00 avg300=0.00 total=116506418415                                         root@pve:~#
/CODE]

I use a 9 disks  zfs pool  on this server.
Is this  the problem with so high load average?
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!