Stromausfall: Server fährt nicht richtig hoch, webui nicht erreichbar

Apr 25, 2022
3
1
3
Hallo!

Nach einem Stromausfall wurde der Server nicht richtig heruntergefahren. Jetzt fährt er nicht mehr richtig hoch, die Webui lässt sich nicht starten.

root@xxxx:~# pvecm nodes ipcc_send_rec[1] failed: Connection refused ipcc_send_rec[2] failed: Connection refused ipcc_send_rec[3] failed: Connection refused Unable to load access control list: Connection refused

Nicht alle pve-Prozesse scheinen zu laufen:

root@xxxx:~# ps -e | grep pve 1737 ? 00:00:02 pvefw-logger 25833 ? 00:00:00 pvedaemon 25834 ? 00:00:01 pvedaemon worke 25835 ? 00:00:01 pvedaemon worke 25836 ? 00:00:01 pvedaemon worke 25866 ? 00:00:00 pveproxy 25867 ? 00:00:01 pveproxy worker 25868 ? 00:00:01 pveproxy worker 25869 ? 00:00:01 pveproxy worker 25924 ? 00:00:00 pve-lxc-syscall

Mit dem Befehl systemctl status 'pve*' sieht man, dass folgende Prozesse nicht richtig laufen/starten:
  • pve-daily-update.service
  • pve-ha-crm.service
  • Failed to start PVE Cluster HA Resource Manager Daemon.
  • pvestatd.service - PVE Status Daemon
  • Failed to start PVE Local HA Resource Manager Daemon
  • Failed to start Proxmox VE replication runner
  • Failed to start Proxmox VE firewall


Gibt es eine Möglichkeit, den Server wieder starten zu können?
 
Was steht denn im SYSLOG während/nach dem Bootvorgang?
Wenn du den Status der nicht gestarteten Dienste anzeigenden lässt, müssten die auch Auskunft über die Ursache der Startprobleme geben.
"systemctl status servicename"
 
So sehen die Statusmeldungen aus:

root@xxxx:~# systemctl status pve-daily-update ● pve-daily-update.service - Daily PVE download activities Loaded: loaded (/lib/systemd/system/pve-daily-update.service; static; vendor Active: failed (Result: exit-code) since Wed 2022-04-27 02:08:54 CEST; 14h ag Process: 24442 ExecStart=/usr/bin/pveupdate (code=exited, status=111) Main PID: 24442 (code=exited, status=111) Apr 27 02:08:54 hera systemd[1]: Starting Daily PVE download activities... Apr 27 02:08:54 hera pveupdate[24442]: ipcc_send_rec[1] failed: Connection refus Apr 27 02:08:54 hera pveupdate[24442]: ipcc_send_rec[2] failed: Connection refus Apr 27 02:08:54 hera pveupdate[24442]: ipcc_send_rec[3] failed: Connection refus Apr 27 02:08:54 hera pveupdate[24442]: Unable to load access control list: Conne Apr 27 02:08:54 hera systemd[1]: pve-daily-update.service: Main process exited, Apr 27 02:08:54 hera systemd[1]: pve-daily-update.service: Failed with result 'e Apr 27 02:08:54 hera systemd[1]: Failed to start Daily PVE download activities. root@xxxx:~# systemctl status pve-ha-crm.service ● pve-ha-crm.service - PVE Cluster HA Resource Manager Daemon Loaded: loaded (/lib/systemd/system/pve-ha-crm.service; enabled; vendor prese Active: failed (Result: exit-code) since Sun 2022-04-24 14:04:14 CEST; 3 days Process: 1473 ExecStart=/usr/sbin/pve-ha-crm start (code=exited, status=111) Apr 24 14:04:14 hera pve-ha-crm[1473]: ipcc_send_rec[1] failed: Connection refus Apr 24 14:04:14 hera pve-ha-crm[1473]: ipcc_send_rec[1] failed: Connection refus Apr 24 14:04:14 hera pve-ha-crm[1473]: ipcc_send_rec[2] failed: Connection refus Apr 24 14:04:14 hera pve-ha-crm[1473]: ipcc_send_rec[2] failed: Connection refus Apr 24 14:04:14 hera pve-ha-crm[1473]: ipcc_send_rec[3] failed: Connection refus Apr 24 14:04:14 hera pve-ha-crm[1473]: ipcc_send_rec[3] failed: Connection refus Apr 24 14:04:14 hera pve-ha-crm[1473]: Unable to load access control list: Conne Apr 24 14:04:14 hera systemd[1]: pve-ha-crm.service: Control process exited, cod Apr 24 14:04:14 hera systemd[1]: pve-ha-crm.service: Failed with result 'exit-co Apr 24 14:04:14 hera systemd[1]: Failed to start PVE Cluster HA Resource Manager root@xxxx:~# systemctl status pvestatd.service ● pvestatd.service - PVE Status Daemon Loaded: loaded (/lib/systemd/system/pvestatd.service; enabled; vendor preset: enabled) Active: failed (Result: exit-code) since Sun 2022-04-24 14:04:14 CEST; 3 days ago Process: 1455 ExecStart=/usr/bin/pvestatd start (code=exited, status=111) Apr 24 14:04:14 hera pvestatd[1455]: ipcc_send_rec[1] failed: Connection refused Apr 24 14:04:14 hera pvestatd[1455]: ipcc_send_rec[1] failed: Connection refused Apr 24 14:04:14 hera pvestatd[1455]: ipcc_send_rec[2] failed: Connection refused Apr 24 14:04:14 hera pvestatd[1455]: ipcc_send_rec[2] failed: Connection refused Apr 24 14:04:14 hera pvestatd[1455]: ipcc_send_rec[3] failed: Connection refused Apr 24 14:04:14 hera pvestatd[1455]: ipcc_send_rec[3] failed: Connection refused Apr 24 14:04:14 hera pvestatd[1455]: Unable to load access control list: Connection refused Apr 24 14:04:14 hera systemd[1]: pvestatd.service: Control process exited, code=exited, status=111/n/a Apr 24 14:04:14 hera systemd[1]: pvestatd.service: Failed with result 'exit-code'. Apr 24 14:04:14 hera systemd[1]: Failed to start PVE Status Daemon.

So sieht es in der SYSLOG aus:
Apr 27 00:00:01 hera rsyslogd: [origin software="rsyslogd" swVersion="8.1901.0" x-pid="918" x-info="https://www.rsyslog.com"] rsyslogd was HUPed Apr 27 00:00:01 hera rsyslogd: [origin software="rsyslogd" swVersion="8.1901.0" x-pid="918" x-info="https://www.rsyslog.com"] rsyslogd was HUPed Apr 27 00:00:01 hera systemd[1]: logrotate.service: Succeeded. Apr 27 00:00:01 hera systemd[1]: Started Rotate log files. Apr 27 00:00:01 hera pveproxy[25866]: restarting server Apr 27 00:00:01 hera pveproxy[25866]: starting 3 worker(s) Apr 27 00:00:01 hera pveproxy[25866]: worker 22621 started Apr 27 00:00:01 hera pveproxy[25866]: worker 22622 started Apr 27 00:00:01 hera pveproxy[25866]: worker 22623 started Apr 27 00:00:06 hera spiceproxy[1619]: worker exit Apr 27 00:00:06 hera spiceproxy[1483]: worker 1619 finished Apr 27 00:00:06 hera pveproxy[25869]: worker exit Apr 27 00:00:06 hera pveproxy[25868]: worker exit Apr 27 00:00:06 hera pveproxy[25867]: worker exit Apr 27 00:00:06 hera pveproxy[25866]: worker 25869 finished Apr 27 00:00:06 hera pveproxy[25866]: worker 25868 finished Apr 27 00:00:06 hera pveproxy[25866]: worker 25867 finished Apr 27 00:01:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:01:00 hera pvesr[22630]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:01:00 hera pvesr[22630]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:01:00 hera pvesr[22630]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:01:00 hera pvesr[22630]: Unable to load access control list: Connection refused Apr 27 00:01:00 hera systemd[1]: pvesr.service: Main process exited, code=exited, status=111/n/a Apr 27 00:01:00 hera systemd[1]: pvesr.service: Failed with result 'exit-code'. Apr 27 00:01:00 hera systemd[1]: Failed to start Proxmox VE replication runner. Apr 27 00:02:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:02:00 hera pvesr[22643]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:02:00 hera pvesr[22643]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:02:00 hera pvesr[22643]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:02:00 hera pvesr[22643]: Unable to load access control list: Connection refused Apr 27 00:02:00 hera systemd[1]: pvesr.service: Main process exited, code=exited, status=111/n/a Apr 27 00:02:00 hera systemd[1]: pvesr.service: Failed with result 'exit-code'. Apr 27 00:02:00 hera systemd[1]: Failed to start Proxmox VE replication runner. Apr 27 00:03:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:03:00 hera pvesr[22656]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:03:00 hera pvesr[22656]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:03:00 hera pvesr[22656]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:03:00 hera pvesr[22656]: Unable to load access control list: Connection refused Apr 27 00:03:00 hera systemd[1]: pvesr.service: Main process exited, code=exited, status=111/n/a Apr 27 00:03:00 hera systemd[1]: pvesr.service: Failed with result 'exit-code'. Apr 27 00:03:00 hera systemd[1]: Failed to start Proxmox VE replication runner. Apr 27 00:04:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:04:00 hera pvesr[22671]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:04:00 hera pvesr[22671]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:04:00 hera pvesr[22671]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:04:00 hera pvesr[22671]: Unable to load access control list: Connection refused Apr 27 00:04:00 hera systemd[1]: pvesr.service: Main process exited, code=exited, status=111/n/a Apr 27 00:04:00 hera systemd[1]: pvesr.service: Failed with result 'exit-code'. Apr 27 00:04:00 hera systemd[1]: Failed to start Proxmox VE replication runner. Apr 27 00:05:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:05:00 hera pvesr[22685]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:05:00 hera pvesr[22685]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:05:00 hera pvesr[22685]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:05:00 hera pvesr[22685]: Unable to load access control list: Connection refused Apr 27 00:05:00 hera systemd[1]: pvesr.service: Main process exited, code=exited, status=111/n/a Apr 27 00:05:00 hera systemd[1]: pvesr.service: Failed with result 'exit-code'. Apr 27 00:05:00 hera systemd[1]: Failed to start Proxmox VE replication runner. Apr 27 00:06:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:06:00 hera pvesr[22700]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:06:00 hera pvesr[22700]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:06:00 hera pvesr[22700]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:06:00 hera pvesr[22700]: Unable to load access control list: Connection refused Apr 27 00:06:00 hera systemd[1]: pvesr.service: Main process exited, code=exited, status=111/n/a Apr 27 00:06:00 hera systemd[1]: pvesr.service: Failed with result 'exit-code'. Apr 27 00:06:00 hera systemd[1]: Failed to start Proxmox VE replication runner. Apr 27 00:07:00 hera systemd[1]: Starting Proxmox VE replication runner... Apr 27 00:07:00 hera pvesr[22714]: ipcc_send_rec[1] failed: Connection refused Apr 27 00:07:00 hera pvesr[22714]: ipcc_send_rec[2] failed: Connection refused Apr 27 00:07:00 hera pvesr[22714]: ipcc_send_rec[3] failed: Connection refused Apr 27 00:07:00 hera pvesr[22714]: Unable to load access control list: Connection refused

Im Ordner /etc/pve/ haben folgende Dateien ein Datum vom 1. Januar 1970 (!!??):
  • .clusterlog
  • .debug
  • .members
  • .rrd
  • .version
  • .vmlist
 
Last edited:
Hi,
sofern du Backups deiner VMs hast, was du ja haben solltest. (PBS?)....
Host neu installieren von Stick und Restore der VMs....

Alles andere halte ich für "riskant", da nicht sicher zu sagen ist ob noch weitere Probleme kommen bedingt durch den harten Crash bei Stromausfall....
 
was sagt denn der log von corosync und pve-cluster?
 
Vielen Dank! Wir haben ein Backup der VMs und den Proxmox inzwischen neu hochgezogen. Die ersten Maschinen laufen schon wieder...
PBS: Vom Proxmox selbst hatten wir bisher kein Backup ... dies werden wir ändern!
 
  • Like
Reactions: itNGO