Search results

  1. G

    [SOLVED] Certain VMs from a cluster cannot be backed up and managed

    So it seems this issue does not just go away. I had one node where all except one VMs exhibited this issue. Only some of qmp commands are working. Shutdown is working and some others, but not migrate or backup or even monitor (console isn't working). And in time this seem to "get caught" by...
  2. G

    [SOLVED] Certain VMs from a cluster cannot be backed up and managed

    We have a 4 node cluster of proxmox 6 proxmox-ve: 6.0-2 (running kernel: 5.0.18-1-pve) pve-manager: 6.0-5 (running version: 6.0-5/f8a710d7) pve-kernel-5.0: 6.0-6 pve-kernel-helper: 6.0-6 pve-kernel-4.15: 5.4-7 pve-kernel-5.0.18-1-pve: 5.0.18-3 pve-kernel-4.15.18-19-pve: 4.15.18-45...
  3. G

    Corosync fails randomly on PVE 6.0-5

    It seems that this was somehow related to some network errors (someone made a loop in another part of the network and we missed the notification). This is weird since - the access switches have bpdu guard acivated so the loop is not propagating into the network and - the virtualization...
  4. G

    Corosync fails randomly on PVE 6.0-5

    It is happening again. I see that 2 of the nodes see each other, in pairs. They had different Ring IDs, 1 and 2. I stopped corosync on all nodes, started up and it is working again.
  5. G

    Corosync fails randomly on PVE 6.0-5

    Today we had an issue related to corosync i suppose. The whole cluster disintegrated, nodes started rebooting, i even powered off all nodes and started them again - this seemed to fix the issue for the moment. There are 4 nodes and sometimes they just stopped seeing each other. Sometimes they...
  6. G

    ProxMox Freezing On AMD RYZEN Machines

    Try setting rcu_nocbs=0-N in the kernel command line. (where N is threads-1) For 8 threads it is rcu_nocbs=0-7 I had to set this on my 2200G otherwise it would lock up randomly. With this setting it never hangs (it runs 24/7).
  7. G

    How to upgrade a Proxmox 5 cluster to Proxmox 6

    Oh well. I wanted to add a new node to the cluster before migration. I installed PVE 5.4 on the new node and joined it succesfully. So i will just upgrade the cluster now.
  8. G

    How to upgrade a Proxmox 5 cluster to Proxmox 6

    Hi, We have a 3 node Proxmox 5 cluster and we need to upgrade it to 6. So, what is the procedure exactly? It seems that we cannot add a new Proxmox 6 node to a Proxmox 5 cluster, so i just want to know how can we go about this. Can we do the in-place upgrade of the nodes as we did before for...
  9. G

    SPICE not working with Let's Encrypt (wildcard) cetificates

    Hi, We started using Let's Encrypt certificates on our proxmox cluster. - The domain name used for is different from the internal LAN domain name. The servers were installed with the internal hostnames. - The certificates are generated on a different machine. - We use wildcard certificates...
  10. G

    VM crash on console attempt

    Not sure, RDP never chashed for me. It seems that using qxl as video adapter works around this issue? I had 2 VMs, both Server 2019 which crashed after a few days if i clicked on the integrated console. I replaced the video adapter of one of them with qxl and today i tested them. The one with...
  11. G

    VM crash on console attempt

    Nothing there... Edit: It happened on the Dell server too. Even on Windows 10. It is not reproducible 100% unfortunately.
  12. G

    VM crash on console attempt

    I found nothing interesting in dmesg or Windows logs. Maybe the kvm process crashes?
  13. G

    VM crash on console attempt

    Hi, I observer a weird issue on Proxmox related to the web bases VNC console. Sometimes when i click on a Windows 2016 VM's VNC console, the VM crashes. This happens on a server with lateish updates, but doesn't on another that is in the same cluster but a bit behind. I supposed it is maybe an...
  14. G

    How to find out if a multi node backup job still running?

    Hi, Yes, i am aware of that. Is there a method of programatically determining it (this was actually the question but it seems i did not formulate it right), as in a single command? I did hack in some scripting involving lock files placed/deleted by the hook script that also checks for other...
  15. G

    How to find out if a multi node backup job still running?

    Edit: The question relates to scripting, i forgot to explicitly specify this. Hi, Is there a way to find out programatically (edited) if a backup task is running if the backups are taken from multiple nodes concurrently? I see that there is an option to run scripts when backups are running...
  16. G

    Migrate 7-node PVE 4.4 cluster to a new subnet

    The older "servers" are in fact Optiplex desktops with I7 CPUS so reboot speed is not an issue :)
  17. G

    Migrate 7-node PVE 4.4 cluster to a new subnet

    We need to change all IP addresses. I was thinking maybe we get away with it if we change them at once, but since i decided it is not worth the risk of getting a non functional cluster (we need HA, live migration, one-node management) and unscheduled vm downtime. So we will take the safer route...
  18. G

    Migrate 7-node PVE 4.4 cluster to a new subnet

    I was aware if the fact that you cannot change one node's config in a cluster. The question was if we change the configs (changes will be applied after reboot) and then take the WHOLE cluster offline at once then start it up in the new location what would happen. I could mess with config files...
  19. G

    Migrate 7-node PVE 4.4 cluster to a new subnet

    Hello, We have a 7-node PVE 4.4 cluster and we have to move it into another subnet. Can this be done by just changing the IP addresses in the web interface, shut down the machines, change the cables for the management interface (the links to the data storage are not changed) and start them up...
  20. G

    Questions about backup names and vm ids

    Hi, i test a deployment of PVE and i have a few questions. Question 1: I see that the backups are named vzdump-qemu-VMID-date-time.vma.gz (for gzip format). This naming scheme is problematic because PVE tends to "fill the gaps" when vms are deleted. If we have a vm, back it up then delete it...