Thanks for the quick response, I thought with increasing one of the nodes to two votes that it would be "stable". To confirm Ceph needing three nodes is a separate issues than the cluster quorum issue?
Thanks
For my homelab, I have 2-nodes running PVE 8.1.10 with ceph installed. Both have monitors,mds,mgrs, and osd. When everything is up it's working good. When I do an update and reboot either of the nodes, the other node soon becomes unresponsive for a few minutes and the VMs running on that note...
I have a couple of the new MS-01 mini computers that should be great for homelabs. I've been following a few guides to enable SR-IOV with the i915 and while it seems to work for others. I can confirm that the dkms builds properly and is active but the driver never loads the right driver for...
Thanks, I just checked and the firmware up to date. I have not seen any thermal messages and it’s in a cool room without case fan but feeling it these cards run hot.
So I was able to fix this by completely rebuilding my proxmox node, even though the original was only weeks old and I didn’t do anything to it besides regular updates.
Funny enough after several hours the note interfaces dropped offline and now aren’t detected. I just saw a message of both...
I followed the instructions to delnode which worked but I still see parts of the old node in the gui. I see the server and the VMs with ? under it but can't do anything with those. Right or left clicking on them breaks the interface and I need to refresh to fix it.
Thoughts?
The card never properly loads... well it loads and then unloads itself. I live booted this computer with ubuntu and the card works as expected, so I don't think it's hardware. I'm also using the 6.5.11-7 default kernel from a new 8.1.3 install.
[ 1.455990] mlx5_core 0000:01:00.0: firmware...
I have a couple of Supermicro AOC-S25G-B2S cards and while they show up in the dmesg, they never enumerate in the OS.
Proxmox 8.1.3 (Kernel 6.2.16-20-pve this prevents bnxt_en kernel panics, seems to be a known kernel / driver issues)
[ 1.286682] bnxt_en 0000:01:00.0 eth0: Broadcom BCM57414...
I updated the firmware and have check dmesg. The cards stays up for about 30 seconds after boot and then both interfaces go into cleanup and disappears.
[ 26.907682] mlx5_core 0000:01:00.0: E-Switch: Unload vfs: mode(LEGACY), nvfs(0), necvfs(0), active vports(0)
[ 26.949676] mlx5_core...
ok, so this is a weird issues with the Mellanox ConnectX-4 card falling offline... I'm struggling to identify a fix I didn't need to do anything for ESXi, XCP-ng, or Ubuntu.
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.