Hi guys,
I have been working on Proxmox VE 5.1-41 for a couple of days.
It's been the second time I encoure this issue, I'm wondering if anybody have heard about it.
It's a 3 nodes cluster , classical setup.
Installation works well, network works well, corrum and everything works like a charm.
I installed few VM to do some testing , all works great, and from nowhere , the cluster start crashing.
The crash cause this :
- PVE binaries are deleted :
- PVE systemd files are removed
- PVE scripts in /usr/share/perl5/PVE/ are removed
10 files has been deleted
- PVE /usr/share/pve-manager/ directory is fully removed
This directory does not exist anymore
So this is terrible because it happen to ALL the nodes in the cluster.
I can't explain how PVE decide to kill himself and delete all those files /folder without human interaction .
It does not look like it has been hacked two times
Nobody updated the system or upgraded anything nor remove anything in the mind time
Have you ever heard about this ?
One time, i can tell myself, yes , i did a reboot who fucked everything , but two time, no way, the cluster was working grate.
Let me know if I can provide more log..
I was able to solve by taking files, folders and binaries from another working cluster , reboot and all is back but this is not ok for production unless we can find what really happen
Thanks a lot for you help.
Regards,
Ben
I have been working on Proxmox VE 5.1-41 for a couple of days.
It's been the second time I encoure this issue, I'm wondering if anybody have heard about it.
It's a 3 nodes cluster , classical setup.
Installation works well, network works well, corrum and everything works like a charm.
I installed few VM to do some testing , all works great, and from nowhere , the cluster start crashing.
The crash cause this :
- PVE binaries are deleted :
Code:
ls /usr/bin/|grep pve
pvecm
- PVE systemd files are removed
Code:
ls /lib/systemd/system|grep pve
pve-cluster.service
pve-container@.service
pve-firewall.service
pvefw-logger.service
pve-ha-crm.service
pve-ha-lrm.service
system-pve\x2dcontainer.slice
- PVE scripts in /usr/share/perl5/PVE/ are removed
10 files has been deleted
Code:
> API2.pm
> API2Tools.pm
> APLInfo.pm
> AutoBalloon.pm
> CephTools.pm
> HTTPServer.pm
> pvecfg.pm
> Report.pm
> Status
> VZDump.pm
- PVE /usr/share/pve-manager/ directory is fully removed
This directory does not exist anymore
So this is terrible because it happen to ALL the nodes in the cluster.
I can't explain how PVE decide to kill himself and delete all those files /folder without human interaction .
It does not look like it has been hacked two times
Nobody updated the system or upgraded anything nor remove anything in the mind time
Have you ever heard about this ?
One time, i can tell myself, yes , i did a reboot who fucked everything , but two time, no way, the cluster was working grate.
Let me know if I can provide more log..
I was able to solve by taking files, folders and binaries from another working cluster , reboot and all is back but this is not ok for production unless we can find what really happen
Thanks a lot for you help.
Regards,
Ben