Recent content by hohl

  1. H

    Live-Migration almost freezes Targetnode

    Seeing the same issue with target node freezes while using live migrations with Proxmox 7.2 here (but also already happened with 7.1). All nodes use Enterprise NVMe 4.0 disks, ZFS RAID 10, 10G NICs, latest AMD EPYC CPUs and a mixture of Supermicro and Dell mainboard. Discard/trim is enabled for...
  2. H

    SEV Init Error 0x13 after reboot

    Never mind; giving up and just waiting on that INIT error screen made it pass. I just never waited over 15 minutes besides while I was writing this post.
  3. H

    SEV Init Error 0x13 after reboot

    After a reboot of an Supermicro AMD EPYC server, Proxmox 7.1 suddenly stopped to work. Reboots worked before, I don‘t see what changed, but can‘t even look much into the node since its not booting up at all. GRUB loads, it appears to start and then all I get is: ccp 0000:4a:00.1: SEV: failed...
  4. H

    Cloud-Init suddenly stopped working for CentOS/Fedora templates

    CentOS 8 images seem to work, so might be enough to get it working.
  5. H

    Cloud-Init suddenly stopped working for CentOS/Fedora templates

    Exactly, CentOS 7 only gets security updates and fixes for mission critical bugs since August 2020, so I wouldn't count on that either.
  6. H

    Cloud-Init suddenly stopped working for CentOS/Fedora templates

    As mentioned in the ticket, the 19.4-11 release does not yet support the "static6" keyword. You will need to upgrade to newer images that ship with CloudInit 21.4 by default or edit your existing images to include a newer version. In general, that shouldn't be an issue: most Linux distributions...
  7. H

    [SOLVED] One Proxmox node can't reach the other one via SSH

    Oh no, I found it. Recently introduced some SSH hardening policy which is enforced via automation on all Linux machine and that policy includes `PermitRootLogin no`. Since it's automatically applied to all Linux machines, it was also applied to the Proxmox hypervisors. If anybody else stumbles...
  8. H

    [SOLVED] One Proxmox node can't reach the other one via SSH

    It seems one Proxmox node can't reach the other one via SSH. That blocks features such as the VNC console (on the other node in the same cluster) and live migrations. For example: root@5.255.77.104: Permission denied (publickey). TASK ERROR: Failed to run vncproxy. I've found a few seemingly...
  9. H

    [SOLVED] How much resources does the Proxmox backup functionality use?

    I'd be especially interest in how Proxmox handles the compression for backups? Is this done in RAM (does it require the entire disk to be loaded into RAM)? How would that work out with large VMs? Let's say, a virtual server that runs on a 800 GB virtual disk. Would this cause extreme pressure to...
  10. H

    [SOLVED] LVM-Thin and reported disk usage: reserved or actual usage?

    When using LVM-Thin, the storage panel only reports one figure for used storage space. Is this the amount of storage reserved or the actually used one? As far as I understand thin provisioning there should be two values? Sorry, if that's mentioned somewhere. I've checked the docs but couldn't...
  11. H

    Cloud-Init suddenly stopped working for CentOS/Fedora templates

    In case anybody stumbles over this post in search for a solution: I do believe I have found the source of this issue. The latest Proxmox version relies on a CloudInit keyword (`static6`) that is only supported by the most recent CloudInit release, but CentOS, RHEL, Fedora, AlmaLinux, Oracle...
  12. H

    Cloud-Init suddenly stopped working for CentOS/Fedora templates

    As I probably left out some important information: below is the log of which packages where installed/upgraded. Full DPKG log is attached too. $ sudo grep -E " (install|upgrade) " /var/log/dpkg.log.1 2021-03-09 13:14:49 upgrade base-files:amd64 10.3+deb10u6 10.3+deb10u8 2021-03-09 13:14:49...
  13. H

    Cloud-Init suddenly stopped working for CentOS/Fedora templates

    Hi there, I installed a few minor updates on two (so far) Proxmox hypervisors and suddenly all RHEL-related templates used seemed to have stopped working. The templates get cloned, boot up and they learn username+password from cloud-init, but then they don't learn any of the provided static...
  14. H

    [SOLVED] Repeatedly getting "Garbage collection failed: Server shutdown requested - aborting task" every time

    Oh... yeah, that could be it. Very likely actually as it has to restart `proxmox-backup-proxy` to apply the SSL certificates. (Or, maybe I just need to adopt the renewal script to only reload if the certificate actually got renewed.) I just moved LetsEncrypt to 0:50Z, lets see if that fixes the...
  15. H

    [SOLVED] Repeatedly getting "Garbage collection failed: Server shutdown requested - aborting task" every time

    No, but it just happened again: got a mail for a failed garbage collection task at 0:00 UTC tonight. Syslog around this time says something about a restart of some service at that time. Maybe it's just some inconvenient timely overlap of some scheduled auto-restart/reload task? Mar 9 23:03:10...