my ceph.monitor is skewed !

Mar 20, 2020
47
15
8
51
France
Hi all,

following my installation (details here) : of Ceph on Proxmox for multi purpose (object bucket & HCI VM for K8s cluster), I'm facing well skewing :

clock skew detected on mon.srv158ilo151, mon.srv161ilo150mon.srv158ilo151 clock skew 0.79889s > max 0.05s (latency 0.000515921s)
mon.srv161ilo150 clock skew 0.637552s > max 0.05s (latency 0.00053284s)

on 2 of my 3 monitors. It's not a final installation just first POC iteration of a future product. But it's on a production environnement and it's not possible to use internet NTP server for security concern. So my NTP server a inside server, furnished by IT and... I expect it's an Active Directory server so drifting is the rule.

I've read quite a few post relative to problems relative to this time shifting on proxmox - Ceph and resolution is a bit confusing for a noob like me (noob at debian stuff not HCI). And some people I know that are working w/ NTP on production datacenter gave me other advices, powering up the confusion. So since I cannot directly connect to internet on this network, I have 2 choices as far as I understand :

moving to Chrony (but how ? should I put another NTP server try likne this w/ my AD server ?) or
moving to NTPD : using my 3 nodes to mount a proximity NTPD architecture with my AD alone as a "root" for those server. This way even if it's drifting on AD side, my cluster will be synchrone for itself which is the most important thing to achieve.

Did I miss someting ? Do you have some pros and cons on thoses scenarii ? should I give a try to chrony at first and then NTPD ?
 
The important part is, that the time for all nodes must be the same. If there is a skew between the nodes, that's when Ceph starts to complain.

If systemd-timesyncd is not enough for you, you can disable/mask that service and switch over to Chrony without much of a problem.
 
Last edited:
  • Like
Reactions: virttom
The important part is, that the time for all nodes must be the same. If there is a skew between the nodes, that's when Ceph starts to complain.

If systemd-timesyncd is not enough for you, you can disable/mask that service and switch over to Chrony with much of a problem.

ok, ! before doing that, i tryed to investigate this strange behaviour. and i tryed to investigate as stated in other thread :

root@srv158ilo151:~# timedatectl status
Local time: Thu 2021-03-04 11:59:10 CET
Universal time: Thu 2021-03-04 10:59:10 UTC
RTC time: Thu 2021-03-04 10:59:10
Time zone: Europe/Paris (CET, +0100)
System clock synchronized: no
NTP service: inactive
RTC in local TZ: no


how can I fix that ? didn't find answers on this.
 
ok, ! before doing that, i tryed to investigate this strange behaviour. and i tryed to investigate as stated in other thread :

root@srv158ilo151:~# timedatectl status
Local time: Thu 2021-03-04 11:59:10 CET
Universal time: Thu 2021-03-04 10:59:10 UTC
RTC time: Thu 2021-03-04 10:59:10
Time zone: Europe/Paris (CET, +0100)
System clock synchronized: no
NTP service: inactive
RTC in local TZ: no


how can I fix that ? didn't find answers on this.
Could you solve it? How did you get it?