I am running a seven-node cluster, upon a reboot this morning I have been receiving the following error on one of my nodes:
I SSH'd into 'mouse' and checked the following:
Strange, because I made no changes to anything; so off to Google I went and it led me to try the following things...
Moving on to the cluster manager, rabbit - I checked the following:
And, that's all I've got; hoping someone can provide some help. Thanks terribly in advance.
Code:
Connection files (Error 500: hostname look 'mouse' failed - failed to get address info for: mouse: Name or service not known)
I SSH'd into 'mouse' and checked the following:
Code:
root@mouse:~# cat /etc/hosts
127.0.0.1 localhost.localdomain localhost
192.168.100.35 mouse.elephanteggs.com mouse
# The following lines are desirable for IPv6 capable hosts
::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
ff02::3 ip6-allhosts
Code:
root@mouse:~# hostnamectl
Static hostname: mouse
Icon name: computer-server
Chassis: server
Machine ID: 8fe85cdb52ef4002bf36639f7e4cc59d
Boot ID: 9d37017b5a5f4035a742703c5cc2cf89
Operating System: Debian GNU/Linux 11 (bullseye)
Kernel: Linux 5.15.60-1-pve
Architecture: x86-64
Code:
root@mouse:~# cat /etc/hostname
mouse
Strange, because I made no changes to anything; so off to Google I went and it led me to try the following things...
Code:
root@mouse:~# pvecm updatecerts
ipcc_send_rec[1] failed: Connection refused
ipcc_send_rec[2] failed: Connection refused
ipcc_send_rec[3] failed: Connection refused
Unable to load access control list: Connection refused
Code:
root@mouse:~# ls -l /etc/pve
total 0
Code:
root@mouse:~# tail -100 /var/log/syslog
Oct 3 10:55:27 mouse pveproxy[3264]: /etc/pve/local/pve-ssl.key: failed to load local private key (key_file or key) at /usr/share/perl5/PVE/APIServer/AnyEvent.pm line 1943.
Oct 3 10:55:27 mouse pveproxy[3265]: /etc/pve/local/pve-ssl.key: failed to load local private key (key_file or key) at /usr/share/perl5/PVE/APIServer/AnyEvent.pm line 1943.
Oct 3 10:55:27 mouse pveproxy[3266]: /etc/pve/local/pve-ssl.key: failed to load local private key (key_file or key) at /usr/share/perl5/PVE/APIServer/AnyEvent.pm line 1943.
Oct 3 10:55:32 mouse pveproxy[3264]: worker exit
Oct 3 10:55:32 mouse pveproxy[3265]: worker exit
Oct 3 10:55:32 mouse pveproxy[3266]: worker exit
Oct 3 10:55:32 mouse pveproxy[1467]: worker 3264 finished
Oct 3 10:55:32 mouse pveproxy[1467]: worker 3265 finished
Oct 3 10:55:32 mouse pveproxy[1467]: starting 2 worker(s)
Oct 3 10:55:32 mouse pveproxy[1467]: worker 3267 started
Oct 3 10:55:32 mouse pveproxy[1467]: worker 3266 finished
Oct 3 10:55:32 mouse pveproxy[1467]: worker 3268 started
Code:
root@mouse:~# systemctl restart pve-cluster.service
Job for pve-cluster.service failed because the control process exited with error code.
See "systemctl status pve-cluster.service" and "journalctl -xe" for details.
Code:
root@mouse:~# journalctl -xe
░░ Support: https://www.debian.org/support
░░
░░ Automatic restarting of the unit pve-cluster.service has been scheduled, as the result for
░░ the configured Restart= setting for the unit.
Oct 03 10:58:25 mouse systemd[1]: Stopped The Proxmox VE cluster filesystem.
░░ Subject: A stop job for unit pve-cluster.service has finished
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░
░░ A stop job for unit pve-cluster.service has finished.
░░
░░ The job identifier is 2560 and the job result is done.
Oct 03 10:58:25 mouse systemd[1]: pve-cluster.service: Start request repeated too quickly.
Oct 03 10:58:25 mouse systemd[1]: pve-cluster.service: Failed with result 'exit-code'.
░░ Subject: Unit failed
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░
░░ The unit pve-cluster.service has entered the 'failed' state with result 'exit-code'.
Oct 03 10:58:25 mouse systemd[1]: Failed to start The Proxmox VE cluster filesystem.
░░ Subject: A start job for unit pve-cluster.service has failed
░░ Defined-By: systemd
░░ Support: https://www.debian.org/support
░░
░░ A start job for unit pve-cluster.service has finished with a failure.
░░
░░ The job identifier is 2560 and the job result is failed.
Oct 03 10:58:29 mouse pveproxy[3389]: worker exit
Oct 03 10:58:29 mouse pveproxy[3390]: worker exit
Oct 03 10:58:29 mouse pveproxy[1467]: worker 3389 finished
Oct 03 10:58:29 mouse pveproxy[1467]: starting 1 worker(s)
Oct 03 10:58:29 mouse pveproxy[1467]: worker 3397 started
Oct 03 10:58:29 mouse pveproxy[3391]: worker exit
Oct 03 10:58:29 mouse pveproxy[1467]: worker 3390 finished
Oct 03 10:58:29 mouse pveproxy[1467]: starting 1 worker(s)
Oct 03 10:58:29 mouse pveproxy[1467]: worker 3398 started
Oct 03 10:58:29 mouse pveproxy[3397]: /etc/pve/local/pve-ssl.key: failed to load local private key (key_file or key) at /usr/share/perl5/PVE/APIServer/AnyEvent.pm line 1943.
Code:
root@mouse:~# systemctl status -l pve-cluster
● pve-cluster.service - The Proxmox VE cluster filesystem
Loaded: loaded (/lib/systemd/system/pve-cluster.service; enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Mon 2022-10-03 10:59:24 EDT; 2min 41s ago
Process: 3444 ExecStart=/usr/bin/pmxcfs (code=exited, status=255/EXCEPTION)
CPU: 11ms
Oct 03 10:59:24 mouse systemd[1]: pve-cluster.service: Scheduled restart job, restart counter is at 5.
Oct 03 10:59:24 mouse systemd[1]: Stopped The Proxmox VE cluster filesystem.
Oct 03 10:59:24 mouse systemd[1]: pve-cluster.service: Start request repeated too quickly.
Oct 03 10:59:24 mouse systemd[1]: pve-cluster.service: Failed with result 'exit-code'.
Oct 03 10:59:24 mouse systemd[1]: Failed to start The Proxmox VE cluster filesystem.
Moving on to the cluster manager, rabbit - I checked the following:
Code:
root@rabbit:~# cat /etc/hosts
127.0.0.1 localhost.localdomain localhost
192.168.100.5 rabbit.elephanteggs.com pinnacle
# The following lines are desirable for IPv6 capable hosts
::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
ff02::3 ip6-allhosts
Code:
root@rabbit:~# pvecm updatecerts
(re)generate node files
merge authorized SSH keys and known hosts
Code:
root@rabbit:~# pvecm nodes
Membership information
----------------------
Nodeid Votes Name
1 1 rabbit (local)
2 1 hedgehog
3 1 fennec
4 1 hamster
5 1 quokka
6 1 weasel
7 1 mouse
And, that's all I've got; hoping someone can provide some help. Thanks terribly in advance.