Hello,
I'm having a weird issue that I can't seem to find an answer for and not sure what search terms to google for to try and find any solutions.
I have two HP Proliant Gen 8 machines with dual CPUs and 64GB of RAM each. I have set the onboard controller to configure the disks as raid 1+0. The SATA controller has one SSD for the Proxmox system and a spinning 2TB disk split into two for more local and slower storage. I have gone through and set the Proxmox servers up as dual primary running drbd over the SAS drives and also the two halves of the 2TB HDD (3 drbd resources in total).
After installing a mix of linux and Windows machines, I am seeing pauses/hangs from the virtual machines, especially after more activity from users. Essentially, I have three windows 7 VMs, one win server 2012 a debian-based OMV server and a zentyal machine. The pauses/hangs even affect rlogin sessions to the Proxmox boxes. I do not see high CPU usage on either box just after they un-freeze, nor excessive IO wait etc. No indication of anything wrong.
I have read all I can about best practices for the virtual machines, and am running DirectSync on the vm disks. All machines run virtio for the disks.
Today, I stopped all virtual machines, stopped the drbd service and re-enable barriers md and disk write flushes, then restarted the drbd service and the VMs - but there was no improvement - still seems as bad as ever.
I have two dedicated 10Gb NICs for the DRBD replication and they are hard wired and configured in a bond0 with "balance-rr" round-robin mode, and separate NIcs for the network access to the guest machines.
Any help would be gratefully received and thanks for reading,
YellowShed
I'm having a weird issue that I can't seem to find an answer for and not sure what search terms to google for to try and find any solutions.
I have two HP Proliant Gen 8 machines with dual CPUs and 64GB of RAM each. I have set the onboard controller to configure the disks as raid 1+0. The SATA controller has one SSD for the Proxmox system and a spinning 2TB disk split into two for more local and slower storage. I have gone through and set the Proxmox servers up as dual primary running drbd over the SAS drives and also the two halves of the 2TB HDD (3 drbd resources in total).
After installing a mix of linux and Windows machines, I am seeing pauses/hangs from the virtual machines, especially after more activity from users. Essentially, I have three windows 7 VMs, one win server 2012 a debian-based OMV server and a zentyal machine. The pauses/hangs even affect rlogin sessions to the Proxmox boxes. I do not see high CPU usage on either box just after they un-freeze, nor excessive IO wait etc. No indication of anything wrong.
I have read all I can about best practices for the virtual machines, and am running DirectSync on the vm disks. All machines run virtio for the disks.
Today, I stopped all virtual machines, stopped the drbd service and re-enable barriers md and disk write flushes, then restarted the drbd service and the VMs - but there was no improvement - still seems as bad as ever.
I have two dedicated 10Gb NICs for the DRBD replication and they are hard wired and configured in a bond0 with "balance-rr" round-robin mode, and separate NIcs for the network access to the guest machines.
Any help would be gratefully received and thanks for reading,
YellowShed
Last edited: