Hi,
I had a power surge at home a couple days ago and since then my proxmox server has stopped working. I've been reading lots of posts and documentation to try to find the reason but I can't find anything conclusive.
My setup is 5 hard drives: 1 ssd, 2 individual 500 gb in lvm-thin and 2 1TB in a pool with zfs raid1. I boot proxmox from a 32GB USB pen drive and I have an external USB hard drive for isos and generic storage. The system has been working flawlessly for the last couple of months.
My problems and conclusions so far:
1º The server boots and I can log in through SSH but no web GUI. The port is listening as well as opened on the firewall. I've tried to curl into it and I get:
The error I get seemed a problem with the SSL certificate, so I tried to regenerate the certificates following different posts and tutorials I found like this post:
https://forum.proxmox.com/threads/proxmox-6-gui-problem-certificate-has-been-revoked.60357/
I've also tried to restart the proxy, daemon etc. Analysing proxmox logs, everytime I try to log in I get this msg:
Also calls my attention after booting:
2º VMs set up to autoboot don't boot (apart from one LXC that seems to be on and off inconsistently) I've also tried to boot them manually through SSH but I get the same error in both cases:
3º I've also tested for bad sectors as well as memtest++ to see if any of my dims were dead. Everything seemed okay.
4º The zfs pool with the 2 1TB hard drives is gone. The hard drives are loaded but the pool is not found. (zpool status -v shows "no pools available")
I've tried to import it manually as well as many other options like:
None worked. I've tried to restart the .cache zpool service, delete and create it again, restart the pool name service.... nothing seems to work. The pool is not present anywhere.
5º I've also tried to boot from a proxmox .iso in rescue mode, hoping for at least accessing all the data, but I face the same problems. No GUI, no zpool etc.
I'm honestly running out of options. I'm not the kind of person that asks for help on the first problem, but I really don't know what else to check.
Seeing all the posts here and what is usually requested, I attach the info that I think it will be useful (I'm running the last version of proxmox as well as all packages updated)
Thanks in advance for any help
I had a power surge at home a couple days ago and since then my proxmox server has stopped working. I've been reading lots of posts and documentation to try to find the reason but I can't find anything conclusive.
My setup is 5 hard drives: 1 ssd, 2 individual 500 gb in lvm-thin and 2 1TB in a pool with zfs raid1. I boot proxmox from a 32GB USB pen drive and I have an external USB hard drive for isos and generic storage. The system has been working flawlessly for the last couple of months.
My problems and conclusions so far:
1º The server boots and I can log in through SSH but no web GUI. The port is listening as well as opened on the firewall. I've tried to curl into it and I get:
curl: (35) OpenSSL SSL_connect: SSL_ERROR_SYSCALL in connection to 192.168.1.90:8006
The error I get seemed a problem with the SSL certificate, so I tried to regenerate the certificates following different posts and tutorials I found like this post:
https://forum.proxmox.com/threads/proxmox-6-gui-problem-certificate-has-been-revoked.60357/
I've also tried to restart the proxy, daemon etc. Analysing proxmox logs, everytime I try to log in I get this msg:
pveproxy worker[1234] general protection fault ip:7f7c3cd585f7 sp:7fffd13afd8f error:0 in libssl.so.1.1[7f7c3cd35000+4d000]
Also calls my attention after booting:
bug kernel: traps: dbus-daemon[1350] general protection fault ip:7f1baf1a5206 sp:7ffc34b7aff8 error:0 in libc-2.28.so[7f1baf12f000+148000]
2º VMs set up to autoboot don't boot (apart from one LXC that seems to be on and off inconsistently) I've also tried to boot them manually through SSH but I get the same error in both cases:
org.freedesktop.DBus.Error.Disconnected: Connection is closed
3º I've also tested for bad sectors as well as memtest++ to see if any of my dims were dead. Everything seemed okay.
4º The zfs pool with the 2 1TB hard drives is gone. The hard drives are loaded but the pool is not found. (zpool status -v shows "no pools available")
I've tried to import it manually as well as many other options like:
zpool import -D -f (poolname)
zpool import -a
None worked. I've tried to restart the .cache zpool service, delete and create it again, restart the pool name service.... nothing seems to work. The pool is not present anywhere.
5º I've also tried to boot from a proxmox .iso in rescue mode, hoping for at least accessing all the data, but I face the same problems. No GUI, no zpool etc.
I'm honestly running out of options. I'm not the kind of person that asks for help on the first problem, but I really don't know what else to check.
Seeing all the posts here and what is usually requested, I attach the info that I think it will be useful (I'm running the last version of proxmox as well as all packages updated)
Thanks in advance for any help
Attachments
Last edited: