I read full mesh article but I have tested for 1 year an improved solution that uses batman routing protocol.
The batman routing protocol is a level 2 mesh protocol for wireless networks but it works perfectly with ethernet (I am testing with 40g network).
So I can do a three servers...
I have a three node proxmox/ceph cluster. Each node has two nvme as ceph osd. Replica is 3. Network is 40gb.
OSD are 60% full.
Now one osd breaks. Considering I have followed all guidelines (replica 3, min 2, fast network and so on) I expected that I have no problems.
But ceph on the...
I have a proxmox 6.4 cluster with ceph 14.2.20. If I do long backups (for example full backups) the day after people have micro pauses using windows remote desktop. If I migrate the windows vm to another server the VM start working perfectly.
With fast backups (for example incremental...
I have a three node Proxmox/ceph cluster with an external nfs NAS.
I write backups on that nas. Unfortunately the nas has disconnected for an hardware problem.
Linux disconnected nfs (why??) and started to write on mount point in /mnt. So it has filled the root partition blocking the ceph...
I have a Proxmox 6.4 cluster wiith ceph 14.2.20 with three servers, each one with: 192gb ram, 48 core, 4 ssd, 10gb ethernet, light load (few vms)
One of the servers has filled root partition due to a failing nfs mount (another thread), so immediately ceph mon stopped working.
I am trying latest proxmox 6.4-6 on several clusters.
Situation is improved but there are still problems.
I start a snapshot type backup of a VM.
The VM starts to become not usable: it hangs several times during backup.
If backup is too long (big hdd) or if backup server is too fast ( I...
I have a PBS 1.0-11 with 1.5 tb of space. I am backupping three VMs with disks 800 800 and 500. Disks are almost empty if I do backups with zstd I get vma.gz of 390, 270, 59 gb. Now I use PBS and I fill 1.5gtb without backupping the third VM. It seems compression is not applied but I have...
I have several proxmox clusters on HP G8 and G9 servers, with Windows Server VMs.
Terminal servers users complains about "disk slowness" (I have ssd and nvme...), "micro interruptions" and so on.
It seemed an unsolvable problem until I discovered that changing in ilo4/power settings from...
I would like to monitor all my proxmox/ceph installations:
- check that backups start;
- check errors after backup run;
- check if ceph osd is down;
- check if some ha vm is in error state
I plan to use influxdb with telegraf or opendistro for elasticsearch or others.
I started with...
I have removed a proxmox node.
By mistake I forgot to migrate some VMs.
The disks for these machines are on ceph, so I have only lost their .conf.
I have backups so I try to write a xxx.conf (recovered with vma config) in /etc/pve/qemu-server of another node.
Unfortunately even if...
I have a proxmox cluster of three HP DL380 G9 with 192gb ram each, ssd disks, raid controller with bbu ,10g ethernet and so on.
I am using windows 10 and 2019 VMs on ceph with krbd, writeback, iothread and all options that can improve performance.
So I have installed NVMe disks and built...
I would like to add multiqueue support.
- number of queues must be the same of number of cpus: false, in proxmox I can put 8 multiqueues and not more
- in linux vm I must give the command ethtool -L ens1 combined X.... but unfortunately I am using windows VM
Can someone explain...
I have upgraded two proxmox ceph cluster to 6.0 and then 6.1
I had backups of type "snapshot" and when the backup was running the virtual machine backupped was slow but usable.
Now the virtual machine hangs without responding until the end of backup and then starts working again.
I have rebooted a node of proxmox 5.4 cluster.
Now it is always in wait_for_agent_lock state.
Some virtual machines are in fence state.
Corosync works, I have looked in many logs without finding errors.
I receive via mail every minute these messages:
FENCE: Try to fence node 'pvehp2'...
Proxmox VE is a great product and with last version is even bigger.
I have some big customers interested but only two important features are missing:
- load balancing
- maintenance mode
I have seen the second one is on the roadmap, but can someone tell me if load balancing of vm will be...
I have a VM with cloudinit drive that does not start anymore (if I remove cloudinit drive it starts).
It gives the following error:
rbd: create error: (17) File exists
TASK ERROR: error with cfs lock 'storage-raid2c128fast_vm': rbd create vm-167-cloudinit' error: rbd: create error: (17)...
Proxmox used to support hardware fencing devices and ilom/ipmi fencing.
Now it supports only watchdog fencing.
I have a server in a proxmox cluster that, sometimes, loses all hard disks: it is a partial failure.
The watchdog timer does not start, the server seems alive but ceph osds are...