HELP! One Host of cluster not reachable after reboot

Dec 19, 2020
67
4
13
48
Hi!

I have a Proxmox cluster with 3 hosts and CEPH running, I did all the recent updates, but did not reboot them for a while now.
However, I did a reboot of the first host right now, the reboot ran fine, but after the host is back up, it is not reachable in the network?!
Any idea, what this could be?

I did a second reboot, hoping that this will solve it, but this did not help.
In the Proxmox WebGUI I see that the host is back online (green tick), but when trying to access it I get "Connection error 595: No route to host".
When going on the console and execute "ifconfig" I see the interface all setup correctly...
But from here, I can also not ping any machine on the network?!

Also the CEPH network is down and says "Degraded".

PLEASE HELP!
Thank you.
 
After searching the forum now, I read something about network problems with new kernels...
So I did another reboot and choose the kernel to version 6.5.13-1-pve while booting, the version that is also running on the other two hosts.
Finally the host is back to network and running fine! :cool:

But how to deal with this now?
Do I have to stay with this old kernel now forever?
Could I add support for my network cards to the new kernels?

Regards,
Prahn
 
Here is the output of the network cards of lspci -v:

Code:
b3:00.0 Ethernet controller: Intel Corporation Ethernet Controller 10-Gigabit X540-AT2 (rev 01)
        Subsystem: Intel Corporation Ethernet Converged Network Adapter X540-T1
        Flags: bus master, fast devsel, latency 0, IRQ 43, NUMA node 0
        Memory at fb800000 (64-bit, prefetchable) [size=2M]
        Memory at fba00000 (64-bit, prefetchable) [size=16K]
        Expansion ROM at e1000000 [disabled] [size=512K]
        Capabilities: [40] Power Management version 3
        Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
        Capabilities: [70] MSI-X: Enable+ Count=64 Masked-
        Capabilities: [a0] Express Endpoint, MSI 00
        Capabilities: [100] Advanced Error Reporting
        Capabilities: [140] Device Serial Number a0-36-9f-ff-ff-1d-65-ca
        Capabilities: [150] Alternative Routing-ID Interpretation (ARI)
        Capabilities: [160] Single Root I/O Virtualization (SR-IOV)
        Capabilities: [1d0] Access Control Services
        Kernel driver in use: ixgbe
        Kernel modules: ixgbe

b4:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
        Subsystem: Hewlett-Packard Company Ethernet I210-T1 GbE NIC
        Flags: bus master, fast devsel, latency 0, IRQ 37, NUMA node 0
        Memory at fbc00000 (32-bit, non-prefetchable) [size=1M]
        Memory at fbd00000 (32-bit, non-prefetchable) [size=16K]
        Expansion ROM at fbb00000 [disabled] [size=1M]
        Capabilities: [40] Power Management version 3
        Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
        Capabilities: [70] MSI-X: Enable+ Count=5 Masked-
        Capabilities: [a0] Express Endpoint, MSI 00
        Capabilities: [100] Advanced Error Reporting
        Capabilities: [140] Device Serial Number 68-05-ca-ff-ff-bd-65-08
        Capabilities: [1a0] Transaction Processing Hints
        Kernel driver in use: igb
        Kernel modules: igb
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!