I also have a similar issue with the cluster not responding properly. The VM's still work but proxmox is not communicating across the cluster.
I had a similar issue with v3.x but I could clear it by resetting services cmon and pve-cluster on each node in the cluster. This fixed the issue and...
The only other thing out side of normal installs of Proxmox and Ubuntu guests is one of the clusters uses ceph but I have another cluster that does not use ceph and uses local LVM on raid5 for block storage and I have the same issue on both clusters...
Are there others that experience this...
I have been having an issue with random VM crashes over the last few months. The problem happens with several different VMs running Ubuntu 12.x and 14.04 and on two different clusters running PVE v3.3-5 or v3.4-1.
Since the crashing is abrupt, there is nothing in the logs of the VMs...
Looks like I may have also run into this issue with grub. :(
I just finished installing v3.3 on a clean PowerEdge 415 with 2 drives in raid1 config.
All went well so I setup the "pve-no-subscription repository" and did a apt-get update && apt-get dist-upgrade.
At some point during the upgrade...
You have an interesting point. To test this, I went to a host that has a 11GB dump file on it that was written and untouched over a week ago so it should not be in any buffers on the OSDs or other. I did a read test and got the following:
/d1/dump# dd of=/dev/null bs=1M...
To test sequential read/write performance I use the following script:
rm foo? >/dev/null 2>&1
printf "Writing 10 files:\n"
for c in 0 1 2 3 4 5 6 7 8 9
dd if=/dev/zero bs=1M count=$cnt...
I think this may have something to do with the repositories.
I setup the following repository:
# PVE pve-no-subscription repository provided by proxmox.com, NOT recommended for production use
deb http://download.proxmox.com/debian wheezy pve-no-subscription
so maybe the original install was...
Looks like I found the issue, for some reason, the packages were uninstalled!
When I did the pveversion -v it said that the program was not installed so I looked into this and found a thread that said to install proxmox-ve-2.6.32 so I did apt-get install proxmox-ve-2.6.32. I was then able to...
I just completed building a new cluster of 12 nodes and I ran into a big problem where the ProxMox cluster fails!
This cluster consists of 12 servers with: 128gigs ram, 16 cores, 10gbit net, 2drives in raid1 for OS and 8 4tb drives for ceph.
On each server I installed...
I was able to install and test a small cluster using PVE v3.2 and set up a ceph cluster. I would also like to setup an MDS so I can start testing CephFS.
I know cephfs is not yet considered production ready but I need to do some testing to be ready for when it does become production...
I'm looking to build a larger cluster using PVE V3.2 with ceph and wanted to know what the max size is that I can go to.
We are currently looking at 18 nodes with:
- Two 8 core AMD CPUs
- 128Gig ram
- 2 smaller SAS disks for base OS
- 4 x 4gig sas drives as ceph OSDs...