how to debug proxmox not being responsive

hengly

New Member
Oct 30, 2024
2
0
1
hello, I have been running proxmox on-premises for my company for just little over a year and I have this issue happens after I move the servers to another office location. Everything is working fine for few days after office moving then proxmox starting to become un-responsive, first I tried rebooting, then It just resolve itself eventually then the rebooting trick stopped working as well(It still works I think but seems to be requiring luck, but now 90% of the time does not work), I tried checking `pve-cluster`, `pvedaemon`, `pveproxy` and `corosync` those services look fine on all node sometime, sometime `pveproxy` shows `client problem, connection reset by peer` but networking seems fine and no firewall seems to be blocking, I can reach the proxmox web ui but not able to login, it just hanging there after enter the credential and timeout eventually and running `pvecm updatecerts -f` always result in `got timeout`.

The new office location already have networking setup, so the proxmox servers just connecting to another vlan with same ip cidr, ssh between each proxmox nodes is working fine, when proxmox is working, communication between nodes and each vm itself also fine, I can also access to services within vm from outside using public ip fine, except I notice an issue, all vm created within promox before now has wrong default route, eventhou I check the config from `/etc/network/interfaces` it looks like it is correct but vm gets the wrong one but I add the default route in manually afterward and it solve the issues but anyway those are issue that I notice after moving to new office which before does not happened.

so currently I am not sure how to debug and fix this issue so I any advice would be appreciated, and thanks for your time reading this.
 

Attachments

  • pve0-logs.txt
    16 KB · Views: 0
  • pve1-logs.txt
    22.7 KB · Views: 0
  • pve2-logs.txt
    24 KB · Views: 0
so I managed to pinpoint the problem, althou pinging ip address of each proxmox node works, but the hostname does not, so I added each host to each proxmox /etc/hosts and now it is all working fine, except that the default gateway within vm is still wrong, it is showing differently from what has configured on the bridge network why is that?

Code:
# one of proxmox host
# cat /etc/network/interfaces
auto lo
iface lo inet loopback

iface eno2 inet manual

auto vmbr0
iface vmbr0 inet static
        address 192.168.88.61/24
        gateway 192.168.88.1
        bridge-ports eno1
        bridge-stp off
        bridge-fd 0

# one of VM
# ip route
default via 192.168.1.14 dev ens18 proto dhcp src 192.168.88.20 metric 100
192.168.1.14 dev ens18 proto dhcp scope link src 192.168.88.20 metric 100
192.168.88.0/24 dev ens18 proto kernel scope link src 192.168.88.20 metric 100
 
Last edited:
so I managed to pinpoint the problem, althou pinging ip address of each proxmox node works, but the hostname does not, so I added each host to each proxmox /etc/hosts and now it is all working fine
I don't have much experience with this but I thought this was a requirement but I could be wrong.
, except that the default gateway within vm is still wrong, it is showing differently from what has configured on the bridge network why is that?
Only network configurations of containers are (partially) managed by Proxmox. The network settings inside any VM are not under the control of Proxmox. Maybe you need to change them manually or change a setting on your DHCP server?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!