Recently one of my PVE machines had a degraded RAIDz2 pool where 2 disks just disappeared, since the redundancy level is 2 I did not notice this at all in day to day work.
The disks returned to being detected with a reboot and resilvered in a few hours while I am mildly curious if there was a bug in one of the recent kernel builds or maybe hardware didn't reinitialize properly during boot the real question I was left with is:
Why wasn't this very dangerous state my machine was in reported in a prominent way?
PVE summary and PDM summary pages all showed nice green checkmarks, I don't recall why I even ended up looking at the disks I think I was playing around with something completely unrelated and realized I was missing disks.
Obviously I should spend some time and setup mail alerts too however to me it seems that a system that has a degraded pool should show warning signs on the summary pages/PDM too and not just buried in the ZFS settings page or zpool status command.
The disks returned to being detected with a reboot and resilvered in a few hours while I am mildly curious if there was a bug in one of the recent kernel builds or maybe hardware didn't reinitialize properly during boot the real question I was left with is:
Why wasn't this very dangerous state my machine was in reported in a prominent way?
PVE summary and PDM summary pages all showed nice green checkmarks, I don't recall why I even ended up looking at the disks I think I was playing around with something completely unrelated and realized I was missing disks.
Obviously I should spend some time and setup mail alerts too however to me it seems that a system that has a degraded pool should show warning signs on the summary pages/PDM too and not just buried in the ZFS settings page or zpool status command.