After upgrade to Proxmox 5.4.13 with have the following errors:
o2019-08-09 08:44:37.335346 osd.2 osd.2 10.10.3.153:6800/2479 22 : cluster [ERR] 1.5f shard 1 soid 1:fbdcc12c:::rbd_data.5cffec6b8b4567.0000000000003ac5:head : candidate had a read error
2019-08-09 08:44:42.872941 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16087 : cluster [DBG] pgmap v16091: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 60.1KiB/s rd, 204KiB/s wr, 25op/s
2019-08-09 08:44:44.893407 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16088 : cluster [DBG] pgmap v16092: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 83.2KiB/s rd, 291KiB/s wr, 34op/s
2019-08-09 08:44:46.913265 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16089 : cluster [DBG] pgmap v16093: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 78.9KiB/s rd, 269KiB/s wr, 30op/s
2019-08-09 08:44:48.932986 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16090 : cluster [DBG] pgmap v16094: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 77.2KiB/s rd, 249KiB/s wr, 27op/s
2019-08-09 08:44:50.953684 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16091 : cluster [DBG] pgmap v16095: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 91.7KiB/s rd, 308KiB/s wr, 38op/s
2019-08-09 08:45:01.104992 mon.prx01 mon.0 10.10.3.151:6789/0 3068 : cluster [INF] Health check cleared: OSD_SCRUB_ERRORS (was: 1 scrub errors)
2019-08-09 08:45:01.105055 mon.prx01 mon.0 10.10.3.151:6789/0 3069 : cluster [INF] Health check cleared: PG_DAMAGED (was: Possible data damage: 1 pg inconsistent)
2019-08-09 08:45:01.105103 mon.prx01 mon.0 10.10.3.151:6789/0 3070 : cluster [INF] Cluster is now healthy
2019-08-09 08:44:52.972941 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16092 : cluster [DBG] pgmap v16096: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 44.9KiB/s rd, 156KiB/s wr, 23op/s
2019-08-09 08:44:54.993444 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16093 : cluster [DBG] pgmap v16097: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 64.4KiB/s rd, 233KiB/s wr, 32op/s
2019-08-09 08:44:57.013227 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16094 : cluster [DBG] pgmap v16098: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 45.5KiB/s rd, 173KiB/s wr, 29op/s
2019-08-09 08:44:59.032939 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16095 : cluster [DBG] pgmap v16099: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 38.3KiB/s rd, 163KiB/s wr, 26op/s
2019-08-09 08:45:01.053997 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16096 : cluster [DBG] pgmap v16100: 128 pgs: 128 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 410KiB/s rd, 337KiB/s wr, 119op/s; 338KiB/s, 0objects/s recovering
2019-08-09 08:44:56.412409 osd.2 osd.2 10.10.3.153:6800/2479 23 : cluster [ERR] 1.5f repair 0 missing, 1 inconsistent objects
2019-08-09 08:44:56.412427 osd.2 osd.2 10.10.3.153:6800/2479 24 : cluster [ERR] 1.5f repair 1 errors, 1 fixed
We already repaired a lot of errors but new one are coutinously comming.
Any idea?
o2019-08-09 08:44:37.335346 osd.2 osd.2 10.10.3.153:6800/2479 22 : cluster [ERR] 1.5f shard 1 soid 1:fbdcc12c:::rbd_data.5cffec6b8b4567.0000000000003ac5:head : candidate had a read error
2019-08-09 08:44:42.872941 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16087 : cluster [DBG] pgmap v16091: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 60.1KiB/s rd, 204KiB/s wr, 25op/s
2019-08-09 08:44:44.893407 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16088 : cluster [DBG] pgmap v16092: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 83.2KiB/s rd, 291KiB/s wr, 34op/s
2019-08-09 08:44:46.913265 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16089 : cluster [DBG] pgmap v16093: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 78.9KiB/s rd, 269KiB/s wr, 30op/s
2019-08-09 08:44:48.932986 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16090 : cluster [DBG] pgmap v16094: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 77.2KiB/s rd, 249KiB/s wr, 27op/s
2019-08-09 08:44:50.953684 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16091 : cluster [DBG] pgmap v16095: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 91.7KiB/s rd, 308KiB/s wr, 38op/s
2019-08-09 08:45:01.104992 mon.prx01 mon.0 10.10.3.151:6789/0 3068 : cluster [INF] Health check cleared: OSD_SCRUB_ERRORS (was: 1 scrub errors)
2019-08-09 08:45:01.105055 mon.prx01 mon.0 10.10.3.151:6789/0 3069 : cluster [INF] Health check cleared: PG_DAMAGED (was: Possible data damage: 1 pg inconsistent)
2019-08-09 08:45:01.105103 mon.prx01 mon.0 10.10.3.151:6789/0 3070 : cluster [INF] Cluster is now healthy
2019-08-09 08:44:52.972941 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16092 : cluster [DBG] pgmap v16096: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 44.9KiB/s rd, 156KiB/s wr, 23op/s
2019-08-09 08:44:54.993444 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16093 : cluster [DBG] pgmap v16097: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 64.4KiB/s rd, 233KiB/s wr, 32op/s
2019-08-09 08:44:57.013227 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16094 : cluster [DBG] pgmap v16098: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 45.5KiB/s rd, 173KiB/s wr, 29op/s
2019-08-09 08:44:59.032939 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16095 : cluster [DBG] pgmap v16099: 128 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 127 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 38.3KiB/s rd, 163KiB/s wr, 26op/s
2019-08-09 08:45:01.053997 mgr.prx01 client.22276741 10.10.3.151:0/4173536751 16096 : cluster [DBG] pgmap v16100: 128 pgs: 128 active+clean; 1.82TiB data, 5.44TiB used, 14.2TiB / 19.6TiB avail; 410KiB/s rd, 337KiB/s wr, 119op/s; 338KiB/s, 0objects/s recovering
2019-08-09 08:44:56.412409 osd.2 osd.2 10.10.3.153:6800/2479 23 : cluster [ERR] 1.5f repair 0 missing, 1 inconsistent objects
2019-08-09 08:44:56.412427 osd.2 osd.2 10.10.3.153:6800/2479 24 : cluster [ERR] 1.5f repair 1 errors, 1 fixed
We already repaired a lot of errors but new one are coutinously comming.
Any idea?