my ceph.monitor is skewed !

Mar 20, 2020
47
14
8
50
France
Hi all,

following my installation (details here) : of Ceph on Proxmox for multi purpose (object bucket & HCI VM for K8s cluster), I'm facing well skewing :

clock skew detected on mon.srv158ilo151, mon.srv161ilo150mon.srv158ilo151 clock skew 0.79889s > max 0.05s (latency 0.000515921s)
mon.srv161ilo150 clock skew 0.637552s > max 0.05s (latency 0.00053284s)

on 2 of my 3 monitors. It's not a final installation just first POC iteration of a future product. But it's on a production environnement and it's not possible to use internet NTP server for security concern. So my NTP server a inside server, furnished by IT and... I expect it's an Active Directory server so drifting is the rule.

I've read quite a few post relative to problems relative to this time shifting on proxmox - Ceph and resolution is a bit confusing for a noob like me (noob at debian stuff not HCI). And some people I know that are working w/ NTP on production datacenter gave me other advices, powering up the confusion. So since I cannot directly connect to internet on this network, I have 2 choices as far as I understand :

moving to Chrony (but how ? should I put another NTP server try likne this w/ my AD server ?) or
moving to NTPD : using my 3 nodes to mount a proximity NTPD architecture with my AD alone as a "root" for those server. This way even if it's drifting on AD side, my cluster will be synchrone for itself which is the most important thing to achieve.

Did I miss someting ? Do you have some pros and cons on thoses scenarii ? should I give a try to chrony at first and then NTPD ?
 
The important part is, that the time for all nodes must be the same. If there is a skew between the nodes, that's when Ceph starts to complain.

If systemd-timesyncd is not enough for you, you can disable/mask that service and switch over to Chrony without much of a problem.
 
Last edited:
  • Like
Reactions: virttom
The important part is, that the time for all nodes must be the same. If there is a skew between the nodes, that's when Ceph starts to complain.

If systemd-timesyncd is not enough for you, you can disable/mask that service and switch over to Chrony with much of a problem.

ok, ! before doing that, i tryed to investigate this strange behaviour. and i tryed to investigate as stated in other thread :

root@srv158ilo151:~# timedatectl status
Local time: Thu 2021-03-04 11:59:10 CET
Universal time: Thu 2021-03-04 10:59:10 UTC
RTC time: Thu 2021-03-04 10:59:10
Time zone: Europe/Paris (CET, +0100)
System clock synchronized: no
NTP service: inactive
RTC in local TZ: no


how can I fix that ? didn't find answers on this.
 
ok, ! before doing that, i tryed to investigate this strange behaviour. and i tryed to investigate as stated in other thread :

root@srv158ilo151:~# timedatectl status
Local time: Thu 2021-03-04 11:59:10 CET
Universal time: Thu 2021-03-04 10:59:10 UTC
RTC time: Thu 2021-03-04 10:59:10
Time zone: Europe/Paris (CET, +0100)
System clock synchronized: no
NTP service: inactive
RTC in local TZ: no


how can I fix that ? didn't find answers on this.
Could you solve it? How did you get it?
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!