Hello, all.
Happy New Year. I hope 2025 is wonderful for you.
I have been wrestling with how to properly deploy proxmox and hope I can get some thoughts here, please.
I have included a simple diagram to help illustrate my environment and what I'd like to accomplish. I am asking for advice on what the best way to accomplish my goals is.
I have a main server (sol) running PVE which serves out ~30 LXC services. It also has a TrueNAS VM that I pass an HBA to. The TrueNAS VM manages a ZFS RAIDZ2 pool with datasets such as abyss/family (all family photos/videos), abyss/dox (all scanned documents), etc. The reason this is set up this was is because I migrated from ESXi 6.7 to PVE about 3 months ago, but seeing the power of PVE, I'd like to drop the TrueNAS VM if possible. This server has 2 Zpools, rpool for PVE + LXCs/VMs, and abyss for all my file storage needs.
I have a secondary server (proxima) that I just built, and populated with old drives to act as a backup in case the primary has an issue. It has no LXCs, nor VMs, just a ZFS RAIDZ1 that I have the primary server (sol)'s TrueNAS VM perform ZFS replication to. This server has 2 Zpools, rpool for PVE + LXCs/VMs, and abyss as a ZFS replication target for backing up the primary server's abyss zpool.
I have clustered the primary and secondary servers, but they are not in HA (yet). I have a 10Gbe between these servers with 2 more 10Gbe ports available on my brocade ICX6450-48P switch. Once I get this figured out, I would move these 2 servers into HA with a Qdevice, or I can re-purpose my daily driver to be a 3rd node in the HA cluster.
What I'd like to have happen:
1 - Retire the TrueNAS VM
2 - Import the ZFS Pool into the primary server's PVE for control
3 - Create snapshots of the data stored in that ZFS Pool (abyss)
4 - Perform ZFS replication to the secondary server's ZFS Pool (also named abyss) for a subset of datasets (I don't have enough storage on the secondary to back up all the datasets on the primary)
5 - When the primary server goes down, fail over several services to the secondary (ie Jellyfin, Immich, DNS, NPM, etc)
6 - Failed over services use the local ZFS Pool of the server they are active on for their data (so Immich doesn't try to access the primary server's dataset while it is down for maintenance for example)
7 - When the primary is back up, fail back over to the primary and perform ZFS replication back to the primary from the secondary
End state would be a primary server that hosts several LXCs as well as a large ZFS pool as a file server (with ~2 weeks of snapshots), with a secondary server that has a ZFS pool that the primary sends ZFS replication to, and runs services in case the primary is down.
Thank you for your help!
Happy New Year. I hope 2025 is wonderful for you.
I have been wrestling with how to properly deploy proxmox and hope I can get some thoughts here, please.
I have included a simple diagram to help illustrate my environment and what I'd like to accomplish. I am asking for advice on what the best way to accomplish my goals is.
I have a main server (sol) running PVE which serves out ~30 LXC services. It also has a TrueNAS VM that I pass an HBA to. The TrueNAS VM manages a ZFS RAIDZ2 pool with datasets such as abyss/family (all family photos/videos), abyss/dox (all scanned documents), etc. The reason this is set up this was is because I migrated from ESXi 6.7 to PVE about 3 months ago, but seeing the power of PVE, I'd like to drop the TrueNAS VM if possible. This server has 2 Zpools, rpool for PVE + LXCs/VMs, and abyss for all my file storage needs.
I have a secondary server (proxima) that I just built, and populated with old drives to act as a backup in case the primary has an issue. It has no LXCs, nor VMs, just a ZFS RAIDZ1 that I have the primary server (sol)'s TrueNAS VM perform ZFS replication to. This server has 2 Zpools, rpool for PVE + LXCs/VMs, and abyss as a ZFS replication target for backing up the primary server's abyss zpool.
I have clustered the primary and secondary servers, but they are not in HA (yet). I have a 10Gbe between these servers with 2 more 10Gbe ports available on my brocade ICX6450-48P switch. Once I get this figured out, I would move these 2 servers into HA with a Qdevice, or I can re-purpose my daily driver to be a 3rd node in the HA cluster.
What I'd like to have happen:
1 - Retire the TrueNAS VM
2 - Import the ZFS Pool into the primary server's PVE for control
3 - Create snapshots of the data stored in that ZFS Pool (abyss)
4 - Perform ZFS replication to the secondary server's ZFS Pool (also named abyss) for a subset of datasets (I don't have enough storage on the secondary to back up all the datasets on the primary)
5 - When the primary server goes down, fail over several services to the secondary (ie Jellyfin, Immich, DNS, NPM, etc)
6 - Failed over services use the local ZFS Pool of the server they are active on for their data (so Immich doesn't try to access the primary server's dataset while it is down for maintenance for example)
7 - When the primary is back up, fail back over to the primary and perform ZFS replication back to the primary from the secondary
End state would be a primary server that hosts several LXCs as well as a large ZFS pool as a file server (with ~2 weeks of snapshots), with a secondary server that has a ZFS pool that the primary sends ZFS replication to, and runs services in case the primary is down.
Thank you for your help!