Hmmm very stange...
I invite you to configure remote logging (for having fresher info if reboots occurs again).
And maybe to subscribe official support from Proxmox.
It happened only one time ?
Ok.
What did you do as special parameters or installs on the servers that doesn't come from PVE ISO installer ?
Did the server already worked on an other OS with any problem ?
Did you look at hardware logs for over-temperature or manual hard reset ?
I understand but it is very difficult to put a diagnostic on this. I saw no evidence in the logs. Maybe somebody will have an idea ?
What is the hardware of your servers ?
Hello.
I think offline is not a concern. The installer uses ISO packages.
Personally, I use the ability to fetch the toml configuration by an URL, very practical.
Did you read all the doc ? https://pve.proxmox.com/wiki/Automated_Installation
Hello.
How many disk do you have per server ?
Is there only one 10Gb network for Ceph ?
Did you made a comparative test with an other OS or on the host ?
Hello Aaron.
Thank you very much for this explanation.
You saved us several hours of extensive tests .
Now, do you think there is a workaround if this case occurs, like shutting down each server in a datacenter ?
Good job for the documentation...
Ah do you test cluster fencing ?
Personally too and without any problem.
But molly-guard is only efficient and enabled in SSH interactive mode.
Thus it shouldn't interfere with internal Proxmox mechanisms (non interactive mode).
BUT ...
Awesome news! We will get it into our official testing pipeline ASAP!
Great work!
Blockbridge : Ultra low latency all-NVME shared storage for Proxmox - https://www.blockbridge.com/proxmox