Search results

  1. helojunkie

    How can a single CT take down an entire Node within a cluster?

    First, thank you for your help... So at the same time I upgraded to 8, I added two additional nodes. Both of those nodes were totally fresh, bare-metal installs, and then they were added to the cluster. I have tried the CT on every node in my cluster and regardless of which node I put it on...
  2. helojunkie

    How can a single CT take down an entire Node within a cluster?

    So, I am running a 6-node cluster, a 10GB dedicated backend Corosync network with no other traffic, and a 10GB frontend network dedicated to various VLANs and networks to run our VMs/CTs. Also running PBS for backups. 20GB bonded link to that system running 802.3ad. I have a business dedicated...
  3. helojunkie

    [SOLVED] One node in cluster going Grey in GUI after upgrading to 8.0.4

    In case anyone runs across this thread, I determined the problem. It was a single LXC container running Turnkey Core (16.1) as a media docker LXC. It had been running fine for several years, but I think when I updated Proxmox from 7.4 to 8.0.4 this caused an issue. It took some time to figure...
  4. helojunkie

    [SOLVED] One node in cluster going Grey in GUI after upgrading to 8.0.4

    I have a 6-node cluster that I recently (two nights ago) upgraded to 8.0.4. This cluster has been going strong with no issues at all until after the upgrade to 8.0.4. The upgrade went smoothly with zero issues at all. Now, one node in the cluster keeps 'greying out' on the GUI. I rebooted the...
  5. helojunkie

    Dell Server Hardware Advice

    Hello Everyone - We currently have a 4-node cluster running on Del R820s, each with 4 x E5-4650 2.7Ghz CPU, 256GB RAM, 10GB NICs, and mirrored SSD drives for VM/CT and NFS for other storage. We run Cisco Nexus 10G switching as our backend corosync separately from our 10G frontend. These...
  6. helojunkie

    [SOLVED] Replication failing in cluster but only for some CT/VMs

    Just a follow-up in case anyone runs across this, that was the issue, one of the systems had a different version on it somehow and after upgrading them all to 7, everything works again!
  7. helojunkie

    [SOLVED] Replication failing in cluster but only for some CT/VMs

    Thank you so much for your help, they were all installed at the same time but something obviously got upgraded on the one! I will work on upgrading the rest of the nodes one-at-a-time and let you know if that did the trick. Thank You again for your timely help and direction!
  8. helojunkie

    [SOLVED] Replication failing in cluster but only for some CT/VMs

    Thank you @Fabian_E OK, I have four systems, all of them were built and put into production the same day, all of them have been updated continually, and an apt update on all systems show them up to date. Of the four systems, replication is working as expected except that the 4th system (aptly...
  9. helojunkie

    [SOLVED] Replication failing in cluster but only for some CT/VMs

    I have a four-server cluster running HA and replication. I have multiple replication jobs running and most of them are replication just fine, but I have several that continue to fail with the following output and I cannot figure it out. I have deleted the replication jobs, readded them all to no...
  10. helojunkie

    DMESG errors on migrated VMs and CTs, then VMs crash

    Everything is the latest version, these are brand new installs. All OSs are affected across both cluster nodes but only on migrated VMs and CTs. If I create it from scratch on the node, the problem does not happen. Where is the journal, I will take a look.
  11. helojunkie

    DMESG errors on migrated VMs and CTs, then VMs crash

    The dmesg is from the host, not the proxserver, and yes, this happens to a host that is stopped, backed up on one Proxmox server, and then restored to the new Proxmox cluster and restarted. All proxmox servers store (and run) the images off locally attached NVMe or SSD ZFS storage. I backup the...
  12. helojunkie

    DMESG errors on migrated VMs and CTs, then VMs crash

    Hi Mira - I have a three-node cluster. Two of the nodes are Dell R820, 4 x CPU, 256GB RAM, 2 x HGST 12Gbs SAS SSD (OS) in a ZFS Mirror on an LSI 3008 IT mode HBA, 2 x Intel P4500 Series NVMe drives (4TB EACH) also in a ZFS mirror, these plug directly into the motherboard PCI slots as they are...
  13. helojunkie

    DMESG errors on migrated VMs and CTs, then VMs crash

    Hello Community - We built a new proxmox cluster (3 node) and started migrating VMs and CTs from an existing Proxmox single node unit to the cluster. I keep having issues with these nodes crashing with weird filesystem and mount errors. They run perfectly fine on the old proxmox unit and w are...
  14. helojunkie

    Multiple VMs Fails to Boot after restoring them to new cluster (but not all)

    Hello - I have a very weird issue I am trying to solve. I had a stand-alone promox server that has been running fantastic for quite some time. We decided to move to a more powerful proxmox custer. I utilize shard storage for all of our vzdump backups. After getting the new cluster up and running...
  15. helojunkie

    [SOLVED] ZFS RAID1 fails to boot after power failure

    We do, server going down wasn't the problem, it coming back up after I forgot to put grub back on it was :-). I had just figured that it copied 'everything' not just the ZFS. Lesson learned along with the fact that I should have rebooted anyway just to check! So two lessons learned. Thanks for...
  16. helojunkie

    [SOLVED] Installing From USB Hard/SSD Drive

    hummmm.......that is a good question. While I have not tried it personally, I would imagine that since you can install from a USB thumb drive and a USB CD/DVD drive that you should be able to install from a USB hard drive. If I were in your position I would throw Parted Magic or Hirens (or...
  17. helojunkie

    [SOLVED] Installing From USB Hard/SSD Drive

    ?? Are you trying to install from a USB CDROM drive or a USB thumb drive? It sounds like you are trying from a CD. As far as installing from a CDROM drive, I have done it in the past so I assume that it should still work. You may want to verify if your system itself supports booting from CD/DVD.
  18. helojunkie

    [SOLVED] ZFS RAID1 fails to boot after power failure

    Well as luck would have it, a while back I had to replace one of my RAID1 drives and they are hot-swappable. When I replaced the drive, it resilvered just fine and I went about my business. Fast forward more than a year later (without ever having to reboot the server) and the power goes out and...
  19. helojunkie

    [SOLVED] ZFS RAID1 fails to boot after power failure

    So the power company lost a transformer last night and our proxmox server went down after its large UPS died. This morning when trying to bring it back online it fails to boot. Controller sees all drives including the two SSD RAID1 drives that is the boot device. However when booting it gets...
  20. helojunkie

    ZFS Device Fault - Advice on moving device to new Port

    @Nemesiz - Thank You. The is the way it works on other ZFS systems, just wanted to make sure it was the same on Proxmox.

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!