Proxmox hanging

Hyien

Member
Jun 18, 2021
95
2
13
34
Hi, I'm currently running pve 7.2-7.

After updating the following packages:
libpve-rs-perl:amd64 0.6.2 -> 0.7.1
pve-qemu-kvm:amd64 6.2.0-11 -> 7.0.0-2
libpve-storage-perl:amd64 7.2-7 -> 7.2-8
libproxmox-rs-perl:amd64 0.1.2 -> 0.2.0
qemu-server:amd64 7.2-3 -> 7.2-4,
pve-edk2-firmware:amd64 3.20210831-2 -> 3.20220526-1

I noticed my clusters hanging. Every proxmox command would hang and require
a hard reboot to fix. After rebooting, it runs for a bit, then hangs again.
 
Have you read all code change for the beta package ? As updating is to try out and find errors and post log related to certain commit. If you only need a working server, just install direct iso.
 
can you check the syslog (via /var/log/syslog or journalctl) to see if there are any errors or related messages there when it hangs?
'hanging' of commands usually happens when there is disk io that is blocking (so storage related)
 
don't see any storage errors. i see a bunch of cluster related errors.
pmxcfs[1551]: [status] notice: cpg_send_message retry 100
pmxcfs[1551]: [status] notice: cpg_send_message retried 100 times
pmxcfs[1551]: [status] crit: cpg_send_message failed: 6
pve-firewall[1577]: firewall update time (10.016 seconds)
pmxcfs[1551]: [status] notice: cpg_send_message retry 10
corosync[1557]: [KNET ] rx: host: 1 link: 0 is up
corosync[1557]: [KNET ] rx: host: 2 link: 0 is up
corosync[1557]: [KNET ] host: host: 1 (passive) best link: 0 (pri: 1)
corosync[1557]: [KNET ] host: host: 2 (passive) best link: 0 (pri: 1)
pmxcfs[1551]: [status] notice: cpg_send_message retry 20
 
then check your network & firewall settings
 
there aren't any fw restrictions.
i have nodes in different subnets in the same cluster.
would this cause any issue?
 
without more info & logs to your setup it's hard to say...
 
for corosync yes, though if you use the webui/api the nodes must be able to connect via https (port 8006) to each other, for some things ssh is required and if you use spice, port 3128 is required

though that may not be the problem, throughput and latency can also be problem

i have nodes in different subnets in the same cluster.
would this cause any issue?
possibly, never tried that to be honest
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!