[SOLVED]ceph Module 'telemetry' has experienced an error

elmacus

Renowned Member
Mar 20, 2011
71
3
73
Hi.
I got Health_Err in gui for telemetry.ceph.com:
#ceph telemetry off

Error EIO: Module 'telemetry' has experienced an error and cannot handle commands: HTTPSConnectionPool(host='telemetry.ceph.com', port=443): Max retries exceeded with url: /report (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7fe3bd98ef10>: Failed to establish a new connection: [Errno 110] Connection timed out',))

# ping telemetry.ceph.com
PING telemetry.ceph.com (8.43.84.137) 56(84) bytes of data.
64 bytes from ip-8-43-84-137.rdu2c.ceph.com (8.43.84.137): icmp_seq=1 ttl=49 time=119 ms
64 bytes from ip-8-43-84-137.rdu2c.ceph.com (8.43.84.137): icmp_seq=2 ttl=49 time=119 ms

Started 4 Mars, same day as LetsEncrypt caos, connected problem ? (i dont use LetsEncypt for gui, yet)

Anyone had similar problem ?
Howto turn Telemetry off until problem is fixed ?
Proxmox 6.1-5
 
Sorry I've somehow missed the part with the off command.
I can establish a connection via browser without any certificate errors, so I guess this is either something specific to your network or already solved.
What does the following print:
# ceph mgr module disable telemetry
 
Well, this turned off telemetry and silenced the gui err: ceph mgr module disable telemetry

Still why it fails i dont know, i will not activate telemetry again until i understand why it failed.
 
They upgraded the telemetry server VM and then this error message popped up. A restart of the mgr daemon solves the issue.