We have a cluster with a bunch of nodes running the latest Proxmox v4 in production. The services we host on this cluster can't suffer prolonged downtime. A few minutes of downtime (say, for a reboot) is the most we can allow really.
We also have a Ceph cluster for storage of these VPS.
We would like to upgrade to Proxmox v5 and then Proxmox v6 but are extremely worried about the possible complications of the process.
It is our understanding that if things go wrong during the upgrade, we might end up having to restore our VPS from backups which will mean extensive and prolonged downtime.
Since it's not possible to upgrade from v4 to v6 directly, we would be doing it in two steps.
From your experience,
1. How risky is the upgrade? Are there high chances of things going wrong and extensive downtime happening?
2. What are the main things to keep in mind when upgrading to ensure it goes smoothly?
3. What is the most comprehensive and detailed guide you've seen for these upgrades?
4. What tests would you recommend we do before actually upgrading? Is it possible to somehow test the upgrade on 1 non-production node before doing it in production?
5. What are your recommendations to ensure that we have a solid contingency plan in case things don't go as planned?
Any and all help is appreciated!
Thank you.
We also have a Ceph cluster for storage of these VPS.
We would like to upgrade to Proxmox v5 and then Proxmox v6 but are extremely worried about the possible complications of the process.
It is our understanding that if things go wrong during the upgrade, we might end up having to restore our VPS from backups which will mean extensive and prolonged downtime.
Since it's not possible to upgrade from v4 to v6 directly, we would be doing it in two steps.
From your experience,
1. How risky is the upgrade? Are there high chances of things going wrong and extensive downtime happening?
2. What are the main things to keep in mind when upgrading to ensure it goes smoothly?
3. What is the most comprehensive and detailed guide you've seen for these upgrades?
4. What tests would you recommend we do before actually upgrading? Is it possible to somehow test the upgrade on 1 non-production node before doing it in production?
5. What are your recommendations to ensure that we have a solid contingency plan in case things don't go as planned?
Any and all help is appreciated!
Thank you.
Last edited: