Hello,
we have a recurring issue where one of our proxmox backup servers thats under heavy load stops working about once a month. The pbs stopped working differently every time but it was always due to the an Error some where stateing "Too many open files".
This time the proxmox-backup service just exited and did not restart automatically (systemctl status proxmox-backup said dead / inactive). This caused all logins to fail and we had to login via ssh. The issue was easily fixed this time by running "systemctl start proxmox-backup".
This is the relevant section from the syslog:
After that the log gets spammed with failed login requests as the backend isen't running
Ulimit:
Versions (we know these aren't the latest but we can only update monthly):
Is this a know issue?
Can we simply increase the file limit from 1024 to prevent this from happening? (Should this then also be adjusted on the installation iso)
Shouldn't systemd always restart the services?
we have a recurring issue where one of our proxmox backup servers thats under heavy load stops working about once a month. The pbs stopped working differently every time but it was always due to the an Error some where stateing "Too many open files".
This time the proxmox-backup service just exited and did not restart automatically (systemctl status proxmox-backup said dead / inactive). This caused all logins to fail and we had to login via ssh. The issue was easily fixed this time by running "systemctl start proxmox-backup".
This is the relevant section from the syslog:
Code:
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-api[818]: server error: error accepting connection: Too many open files (os error 24)
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-api[818]: SET SHUTDOWN MODE
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-api[818]: daemon shutting down...
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-api[818]: daemon shut down...
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-api[818]: server shutting down, waiting for active workers to complete
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-api[818]: done - exit server
After that the log gets spammed with failed login requests as the backend isen't running
Code:
Apr 5 01:14:48 srv-k0m1rh proxmox-backup-proxy[903]: POST /api2/json/access/ticket: 400 Bad Request: [client [::ffff:10.91.102.101]:35546] connection error: Connection reset by peer (os err
or 104)
Ulimit:
Code:
$ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 63804
max locked memory (kbytes, -l) 65536
max memory size (kbytes, -m) unlimited
open files (-n) 1024
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 8192
cpu time (seconds, -t) unlimited
max user processes (-u) 63804
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
Versions (we know these aren't the latest but we can only update monthly):
Code:
$proxmox-backup-manager versions --verbose
proxmox-backup 1.0-4 running kernel: 5.4.103-1-pve
proxmox-backup-server 1.0.13-1 running version: 1.0.9
pve-kernel-5.4 6.3-8
pve-kernel-helper 6.3-8
pve-kernel-5.4.103-1-pve 5.4.103-1
pve-kernel-5.4.101-1-pve 5.4.101-1
pve-kernel-5.4.78-2-pve 5.4.78-2
pve-kernel-5.4.65-1-pve 5.4.65-1
ifupdown2 3.0.0-1+pve3
libjs-extjs 6.0.1-10
proxmox-backup-docs 1.0.13-1
proxmox-backup-client 1.0.13-1
proxmox-mini-journalreader 1.1-1
proxmox-widget-toolkit 2.4-9
pve-xtermjs 4.7.0-3
smartmontools 7.2-pve2
zfsutils-linux 2.0.4-pve1
Is this a know issue?
Can we simply increase the file limit from 1024 to prevent this from happening? (Should this then also be adjusted on the installation iso)
Shouldn't systemd always restart the services?
Last edited: