HI Guys,
I know how to fix it but am wondering why it happens.
when I reboot a node or nodes for an update everything works perfectly.
However, once it is done I always end up with a few of these
Possible data damage: 7 pgs recovery_unfound
pg 1.12 is active+recovery_unfound+degraded, acting [9,0], 1 unfound
pg 1.22 is active+recovery_unfound+undersized+degraded+remapped, acting [0], 1 unfound
...
If I leave the system the errors will not be fixed and I have to manually fix them.
I.e. ceph pg 1.12 mark_unfound_lost revert
I am wondering if there is something wrong with my Configuration?
I use 1 SSD in each node as a CACHE and then 2 HDD (on each node) for storage.
there are 4 nodes and 3 monitors (node 2,3 and 4)
...
...
Lastly,
all the OSD's are green and using the current version. (14 Feb 2020)
Any thought are greatly appreciated OR is this expected behavior?
thanks
Damon
I know how to fix it but am wondering why it happens.
when I reboot a node or nodes for an update everything works perfectly.
However, once it is done I always end up with a few of these
Possible data damage: 7 pgs recovery_unfound
pg 1.12 is active+recovery_unfound+degraded, acting [9,0], 1 unfound
pg 1.22 is active+recovery_unfound+undersized+degraded+remapped, acting [0], 1 unfound
...
If I leave the system the errors will not be fixed and I have to manually fix them.
I.e. ceph pg 1.12 mark_unfound_lost revert
I am wondering if there is something wrong with my Configuration?
I use 1 SSD in each node as a CACHE and then 2 HDD (on each node) for storage.
there are 4 nodes and 3 monitors (node 2,3 and 4)
...
...
Lastly,
all the OSD's are green and using the current version. (14 Feb 2020)
Any thought are greatly appreciated OR is this expected behavior?
thanks
Damon