Topic: Proxmox 6.2 on ZFS (live migration included) on HPE Proliant servers. (Sorry for such a long post )
ceph is not quite a solution, because it requires (or at least highly recommends) at least 10gigs network
I have some older and some newer hp servers, like:
- HP Proliant DL380 G6 / Smart Array P410i (old piece of $h1t, for testlab or smth)
- HP Proliant DL360 Gen8 / Smart Array P420i
- HP Proliant DL360 Gen10 / HPE Smart Array P408i-a SR Gen10
- HP Proliant DL380 G6 / Smart Array P410i / storage solutions:
1. lvm-thin
- PROs: hot swap, official supported
- CONs: go shoot yourself with live migration (thin becomes thick - as least in migration time); start migrate and go to coffee, lunch, maybe dinner ; possible bitrot problems (will remain undetected)
2. JBOD or IT mode is not supported on P410i (unless itanium), so ... it's a no-no
3. zfs raid0 over hardware raid (i.e. raid6 onboard)
- PROs: hot swap for disks, array will get rebuild automatically
- CONs: probably the worst solution, single bit rot and zfs will go crazy as zfs cannot correct data (only realize mismatched checksum)
4. zfs raidz2 over N x hardware raid0
- PROs: using "what you already have"
- CONs: no hot swap, you must reboot when change failed drive (or at least re-create raid0 array with new disk, remove old raid0, add new raid0 dev to zfs)
5. zfs raidz2 over disks connected to some extra HBA board
- PROs: best version for zfs (direct access to disks)
- CONs: need to buy new hardware; also new data cables; if new board is not hp branded server fans will go crazy
Questions:
frankly I should go with 4th (raidz2 over N x hardware raid0, like I've said not optimal solution, but with zero investement in an already archaic server
... did anyone tested this scenario and answer to this questions before I can get some tests:
Q1) can the server boot "as is" from such configuration or I must also add some raid1 for the system partition
Q2) how painful it is for zfs when a hard is malfunction ? as far as I read zfs will see only read errors from that drive and eventually will kick out the drive for that reason
Q3) did anyone has problems changing failed hard drives or at the very next boot (when disks we're changed "online")?
Q4) also, can anyone recommend some hba board for 5th case ? (so the fans will not go mad and the zfs recovery for failed drives will go easier?)
- HP Proliant DL360 Gen8 / Smart Array P420i
1. lvm-thin: as above, not optimal for live migration
2. JBOD is supported, but AFAIK you cannot boot from non-raid drive, so you need to install proxmox on some ssd board or to use 2 bays to create a raid1 installation disk (assuming mixed mode - both raid and jbod - to be supported)
3. zfs raid0 over hardware raid (i.e. raid6 onboard); like I've said before, worst solution possible, silent errors will turn into disaster
I've just read that mixed mode (JBOD and RAID) is not (yet) supported on P420i, so best solution seems to be that system (or at least /boot) partition to be installed / moved to an onboard flash drive. Any better solution ?
- HP Proliant DL360 Gen10 / HPE Smart Array P408i-a SR Gen10
must disable both secure boot and uefi (although debian 10 will boot as secure, proxmox installed won't; also surely it will not boot securely after "magical" install); so ... boot in legacy mode
they say you can use at least 4 disks in IT mode to create a zfs raidz2 array and the system should boot; can anyone confirm ?
Thanks! (and again sorry for such a long "story")
ceph is not quite a solution, because it requires (or at least highly recommends) at least 10gigs network
I have some older and some newer hp servers, like:
- HP Proliant DL380 G6 / Smart Array P410i (old piece of $h1t, for testlab or smth)
- HP Proliant DL360 Gen8 / Smart Array P420i
- HP Proliant DL360 Gen10 / HPE Smart Array P408i-a SR Gen10
- HP Proliant DL380 G6 / Smart Array P410i / storage solutions:
1. lvm-thin
- PROs: hot swap, official supported
- CONs: go shoot yourself with live migration (thin becomes thick - as least in migration time); start migrate and go to coffee, lunch, maybe dinner ; possible bitrot problems (will remain undetected)
2. JBOD or IT mode is not supported on P410i (unless itanium), so ... it's a no-no
3. zfs raid0 over hardware raid (i.e. raid6 onboard)
- PROs: hot swap for disks, array will get rebuild automatically
- CONs: probably the worst solution, single bit rot and zfs will go crazy as zfs cannot correct data (only realize mismatched checksum)
4. zfs raidz2 over N x hardware raid0
- PROs: using "what you already have"
- CONs: no hot swap, you must reboot when change failed drive (or at least re-create raid0 array with new disk, remove old raid0, add new raid0 dev to zfs)
5. zfs raidz2 over disks connected to some extra HBA board
- PROs: best version for zfs (direct access to disks)
- CONs: need to buy new hardware; also new data cables; if new board is not hp branded server fans will go crazy
Questions:
frankly I should go with 4th (raidz2 over N x hardware raid0, like I've said not optimal solution, but with zero investement in an already archaic server
... did anyone tested this scenario and answer to this questions before I can get some tests:
Q1) can the server boot "as is" from such configuration or I must also add some raid1 for the system partition
Q2) how painful it is for zfs when a hard is malfunction ? as far as I read zfs will see only read errors from that drive and eventually will kick out the drive for that reason
Q3) did anyone has problems changing failed hard drives or at the very next boot (when disks we're changed "online")?
Q4) also, can anyone recommend some hba board for 5th case ? (so the fans will not go mad and the zfs recovery for failed drives will go easier?)
- HP Proliant DL360 Gen8 / Smart Array P420i
1. lvm-thin: as above, not optimal for live migration
2. JBOD is supported, but AFAIK you cannot boot from non-raid drive, so you need to install proxmox on some ssd board or to use 2 bays to create a raid1 installation disk (assuming mixed mode - both raid and jbod - to be supported)
3. zfs raid0 over hardware raid (i.e. raid6 onboard); like I've said before, worst solution possible, silent errors will turn into disaster
I've just read that mixed mode (JBOD and RAID) is not (yet) supported on P420i, so best solution seems to be that system (or at least /boot) partition to be installed / moved to an onboard flash drive. Any better solution ?
- HP Proliant DL360 Gen10 / HPE Smart Array P408i-a SR Gen10
must disable both secure boot and uefi (although debian 10 will boot as secure, proxmox installed won't; also surely it will not boot securely after "magical" install); so ... boot in legacy mode
they say you can use at least 4 disks in IT mode to create a zfs raidz2 array and the system should boot; can anyone confirm ?
Thanks! (and again sorry for such a long "story")