Search results

  1. B

    move disk on live VM causes cluster node to reboot

    Hi All, To keep with our timeline we're going to back up and restore from shared storage ... I'm not planning on troubleshooting this. Just an FYI to any posters trying to help. best, James
  2. B

    [SOLVED] Unable to properly remove node from cluster

    Just wanted to confirm that this is an overzealous error message ... same exact issue for our 5 node cluster (7.0) when shrinking it to 3 nodes.
  3. B

    move disk on live VM causes cluster node to reboot

    Hello all, I'm trying to figure out why using move disk on a live VM causes one of our cluster nodes to reboot. We are looking to migrate a live VM from CEPH to LVM storage. The reason being that this will then enable us to live migrate the VM to a non-CEPH attached node. When we do this...
  4. B

    SWAP usage

    Thanks avw, Another backup triggered last night and swap is holding at about 6.5 or 8GB used. There is a significantly larger amount of storage on the node that has the 6.5GB of swap usage vs other cluster nodes. My guess is that this is why SWAP is used ... based on what you are saying...
  5. B

    SWAP usage

    Here's what I see in Proxmox ... this is Max memory for a day with a dot on the time that the problem started: Zabbix: It looks a lot like RAM is being moved to disk for some reason. That dip two days ago is the first time the backup ran.
  6. B

    SWAP usage

    Hi All, We recently started using the built in backup with version 7 of Proxmox. When the backup runs is pushes up swap usage. Should I be concerned? This is the result of the backup running through all of the VMs on this node (1 of 3) in our cluster. Well, to be clear, I'm correlating the...
  7. B

    scheduled backups not starting

    Morning from PST all, I'm not sure what's changed but backups are working. The new backup set fired off and worked correctly last night. best, James
  8. B

    scheduled backups not starting

    Hi Dominik, Thank you for replying. I just checked and the last backup did trigger. The first backup was missed but the second did work. The backups occur every 2 days. I've added another set of backups ... there are now backups for even days and backups for odd. I'll check again tomorrow...
  9. B

    scheduled backups not starting

    Hi All, It there a checkbox or CLI task that I'm missing? This is the first time that I've tried to use scheduled backups. I scheduled them in the GUI and see an entry in /etc/cron.d/vzdump so they should be working. I must be missing something obvious. best, James
  10. B

    [TUTORIAL] Dell Openmanage on Proxmox 6.x

    Hi All, So grateful for this community. To install OMSA 10.1.0.0 on Proxmox 7 on a Dell R730xd... 1) substitute: echo "deb http://linux.dell.com/repo/community/openmanage/10100/focal/ focal main" > /etc/apt/sources.list.d/linux.dell.com.sources.list wget...
  11. B

    Proxmox vs Win Server / HyperV 2019

    @aaron, thank you for replying. I found this today: https://certification.ubuntu.com/server/models?query=&vendors=Dell+EMC&release=20.04+LTS to help with certified hardware.
  12. B

    Proxmox vs Win Server / HyperV 2019

    Hi All, I could use some advice / pointing in the correct direction for a potential new project using Proxmox. We've had great success running a 3 node cluster with Proxmox and Ceph (in a DC) for more than a year now and are starting to think about using this to replace HyperV at a client's...
  13. B

    3 node cluster loosing network connectivity randomly

    @wolfgang, thank you. Currently I'm pursuing this here as I believe that this is related to a PFsense 2.4.5 bug. https://forum.netgate.com/topic/153663/performance-impact-of-clicking-apply-changes Also, your point about support ending is noted. Thank you.
  14. B

    3 node cluster loosing network connectivity randomly

    Hi All, We've been chasing an issue where our 3 node cluster (see attached version info) seemingly randomly looses connectivity to the Internet. When this happens multiple VoIP PBXs, a CPanel server, a Unifi server, etc. all loose connectivity to the Internet. I'm not 100% sure that this is a...
  15. B

    CEPH in WARN state for 54 min

    Thanks ... I can't see bringing up a local NTP server outside of this cluster right now. I'll wait and watch for the time being. I appreciate the feedback.
  16. B

    CEPH in WARN state for 54 min

    I should have noted that those emails were from Zabbix at 9:03 and 9:57 PM local time ... about 30 minutes after the 7.1 quake that hit Southern California. These servers are about a 1.5 hour drive from the epicenter but there was still plenty of movement. I did check PVE / the CLI. It showed...
  17. B

    CEPH in WARN state for 54 min

    Hi All, So I noticed out 3 node CEPH / Proxmox cluster in a WARN state a few minutes ago. By the time I started investigating the issue had resolved itself. --------------------- Problem started at 04:03:42 on 2019.07.06 Problem name: Ceph cluster in WARN state Host: AIT2 (ProxMox Cluster Node...
  18. B

    redundant separate 10GBit network

    Bridges? No, I haven't used this before. That sounds interesting. Our current setup only uses STP for the physical switch trunks. Hmm ... even RSTP may take more time that you would like to detect a failure and converge. Also, if you add VLANs to the mix do your switches support PVST? If...
  19. B

    redundant separate 10GBit network

    We use active / backup bonding for everything except corosync networks. It works well and hasn't caused any issues. We've had reason to bring a switch offline (firmware updates, etc.) and used these reasons as a test. We still schedule these offline periods for after hours but have never...
  20. B

    PVE 5.x - Is HA possible with RRP?

    Hi Romsch, My understanding is that out of the box the network needs to be made redundant in hardware. I've read about other options like Pacemaker but the recommendation is to go with hardware. Put another way, the cluster will protect you from server failure but now network failure...