I was playing around with Proxmox by virtualizing 2 nodes on my existing ESXi infrastructure, since it was taking forever for my new hardware to come through the mail and I was getting annoyed I started migrating VMs over anyway by converting the vmdks and creating new VMs. Things were going well, with live migrates etc too as I was testing stuff.
My hardware finally showed up. I stuck a 4 port nic in each one, for now I just need 2 ports, so using the built on one for management (and storage traffic for now) and the other port on the 4 port nic is a trunk port to the switch.
1st problem:
vlans are not working on the new nodes, it's set as a trunk, the vlans are allowed, all that stuff should be fine as far as the switch is concerned, I set vlan aware to yes when I created the bridge, and the name matches the other nodes. But minute I migrate a VM to it, there's no network. What further troubleshooting or logs could I look at to figure out what's going on?
2nd problem: one of the nodes is showing offline, no matter what I do. I can open a console and even migrate a VM to it, but there's always an X icon. Sometimes if I click on it, I get prompted for a password then get an "invalid tivcket" error and get kicked out of the entire cluster. At one point it even looked like the whole cluster was corrupted and as I was typing this, it magically started to work again. And now as I type this the node that was showing offline is now showing online, but the one that was showing online is now showing offline. (both new physical nodes). There is no network interfaces showing up at all either on the node that was previously showing offline and now magically is showing online.
Every now I keep getting kicked out of the entire cluster and need to login again and everything is just super unstable, like it won't show anything or just sits spinning etc. it seems adding these two physical nodes to the cluster completely broke everything.
Overall there seems to be lot of weird instability stuff going on right now. What would be the best way to even start troubleshooting this?
My hardware finally showed up. I stuck a 4 port nic in each one, for now I just need 2 ports, so using the built on one for management (and storage traffic for now) and the other port on the 4 port nic is a trunk port to the switch.
1st problem:
vlans are not working on the new nodes, it's set as a trunk, the vlans are allowed, all that stuff should be fine as far as the switch is concerned, I set vlan aware to yes when I created the bridge, and the name matches the other nodes. But minute I migrate a VM to it, there's no network. What further troubleshooting or logs could I look at to figure out what's going on?
2nd problem: one of the nodes is showing offline, no matter what I do. I can open a console and even migrate a VM to it, but there's always an X icon. Sometimes if I click on it, I get prompted for a password then get an "invalid tivcket" error and get kicked out of the entire cluster. At one point it even looked like the whole cluster was corrupted and as I was typing this, it magically started to work again. And now as I type this the node that was showing offline is now showing online, but the one that was showing online is now showing offline. (both new physical nodes). There is no network interfaces showing up at all either on the node that was previously showing offline and now magically is showing online.
Every now I keep getting kicked out of the entire cluster and need to login again and everything is just super unstable, like it won't show anything or just sits spinning etc. it seems adding these two physical nodes to the cluster completely broke everything.
Overall there seems to be lot of weird instability stuff going on right now. What would be the best way to even start troubleshooting this?
Last edited: