I have been having issues with the clustering function on proxmox for a while now I hope someone can help.
I seem to be getting packet loss for what ever reason and I have no idea how to better diagnose this on our PROXMOX 5.3 system
As seen in this OMPING you can see there is loss (happens all the time)
Note: 10.50.0.10 Is that node IP address (the same node we are testing from within)
As you can see on the 10th packet it just stopps working. I can not seem to find much in my syslogs about this either.. and have a real hard issue because I can not use clustering becuase of this
System Info:
As you can see above we are running a lot of containers on this one machine too.
I seem to be getting packet loss for what ever reason and I have no idea how to better diagnose this on our PROXMOX 5.3 system
As seen in this OMPING you can see there is loss (happens all the time)
Note: 10.50.0.10 Is that node IP address (the same node we are testing from within)
Code:
root@matterhorn:~# omping 10.50.0.10 -c 20
10.50.0.10 : waiting for response msg
10.50.0.10 : joined (S,G) = (*, 232.43.211.234), pinging
10.50.0.10 : unicast, seq=1, size=69 bytes, dist=0, time=0.026ms
10.50.0.10 : multicast, seq=1, size=69 bytes, dist=0, time=0.038ms
10.50.0.10 : unicast, seq=2, size=69 bytes, dist=0, time=0.074ms
10.50.0.10 : multicast, seq=2, size=69 bytes, dist=0, time=0.089ms
...
10.50.0.10 : unicast, seq=8, size=69 bytes, dist=0, time=0.076ms
10.50.0.10 : multicast, seq=8, size=69 bytes, dist=0, time=0.092ms
10.50.0.10 : unicast, seq=9, size=69 bytes, dist=0, time=0.067ms
10.50.0.10 : multicast, seq=9, size=69 bytes, dist=0, time=0.080ms
10.50.0.10 : multicast, seq=10, size=69 bytes, dist=0, time=0.095ms
10.50.0.10 : given amount of query messages was sent
10.50.0.10 : unicast, xmt/rcv/%loss = 20/9/55%, min/avg/max/std-dev = 0.026/0.079/0.135/0.028
10.50.0.10 : multicast, xmt/rcv/%loss = 20/10/50%, min/avg/max/std-dev = 0.038/0.094/0.149/0.027
As you can see on the 10th packet it just stopps working. I can not seem to find much in my syslogs about this either.. and have a real hard issue because I can not use clustering becuase of this
System Info:
Code:
root@matterhorn:~# uname -r
4.15.18-8-pve
Code:
root@matterhorn:~# pveversion
pve-manager/5.3-5/97ae681d (running kernel: 4.15.18-8-pve)
Code:
root@matterhorn:~# pct list| wc -l
285
root@matterhorn:~# qm list|wc -l
11
As you can see above we are running a lot of containers on this one machine too.