Hello Everyone,
I am facing quite a unique issue lately. I am not sure what is causing this, earlier I thought netbird VM was causing it, but even after shutting it down I am facing the same issue. Read below for details:
I have a 2 node proxmox cluster at my home.
Node 1
Type: Primary Node
Hosts: OPNSense Firewall, Home Assistant, Traefik and a few more things.
Memory: 16GB
Storage: 128GB, much of it is free
X------------------------------------------------X-------------------------------------------X
Node 2
Type: Secondary Node
Hosts: Nextcloud, Jellyfin, Netbird and a few more things.
Memory: 64GB
Storage: 128GB SSD, 12TB x 2 HDD with RAID 1 configuration within Proxmox. Most of the nextcloud, jellyfin etc installed on HDD and all the stuff being stored in HDD pictures etc.
This started happening after I setup the netbird, on node 2. But even after shutting down the vm and not using netbird from around 2 days, the issue repeated again. And now it is repeating quickly after every few hours.
I need help in finding out the root cause of this issue, what and where I should see the logs. By the way, all the workloads are assigned static ips from OPNSense, and everytime, I am facing issue only on Node 2, Node 1 never fails or faces any problems what so ever.
Problem:
Every now and then, without any resource crunch or visible network issues the node 2 get disconnected from the cluster. There is nothing you can do with the node. None of the workload remains accessible. Once you turn it off (directly by pressing the shutdown button, and letting it gracefully shutdown) and start it up again, it starts to work like nothing happened.
I am not sure what to look for and resolve this problem. I am rocking this setup from around 6 months and only this month things started to break apart like this.
Just today I faced this issue 2 times in around 8 hour period. What could be going wrong?
Best Regards
I am facing quite a unique issue lately. I am not sure what is causing this, earlier I thought netbird VM was causing it, but even after shutting it down I am facing the same issue. Read below for details:
I have a 2 node proxmox cluster at my home.
Node 1
Type: Primary Node
Hosts: OPNSense Firewall, Home Assistant, Traefik and a few more things.
Memory: 16GB
Storage: 128GB, much of it is free
X------------------------------------------------X-------------------------------------------X
Node 2
Type: Secondary Node
Hosts: Nextcloud, Jellyfin, Netbird and a few more things.
Memory: 64GB
Storage: 128GB SSD, 12TB x 2 HDD with RAID 1 configuration within Proxmox. Most of the nextcloud, jellyfin etc installed on HDD and all the stuff being stored in HDD pictures etc.
This started happening after I setup the netbird, on node 2. But even after shutting down the vm and not using netbird from around 2 days, the issue repeated again. And now it is repeating quickly after every few hours.
I need help in finding out the root cause of this issue, what and where I should see the logs. By the way, all the workloads are assigned static ips from OPNSense, and everytime, I am facing issue only on Node 2, Node 1 never fails or faces any problems what so ever.
Problem:
Every now and then, without any resource crunch or visible network issues the node 2 get disconnected from the cluster. There is nothing you can do with the node. None of the workload remains accessible. Once you turn it off (directly by pressing the shutdown button, and letting it gracefully shutdown) and start it up again, it starts to work like nothing happened.
I am not sure what to look for and resolve this problem. I am rocking this setup from around 6 months and only this month things started to break apart like this.
Just today I faced this issue 2 times in around 8 hour period. What could be going wrong?
Best Regards