Hi there. About two weeks ago, a host bus adapter on my single node PVE server died. That seemed to throw things into chaos and my system become unresponsive - I could login or do anything even at a local console connected directly to the server. I ended up having to power down and up again, at which point I was thrown into emergency mode. I then saw that a lot of my drives had not been mounted, and figured out the HBA had failed (an LSI 9207-8i).
In any event, I finally received the replacement today and replaced the failed HBA. All the drives in /etc/fstab mounted just fine. However, I did encounter two issues:
First, while I can login just fine at the console or through SSL as root, I can no longer login to the webgui, either as root@pam or another user I had set up using PVE authentication. I have 2FA turned on. I checked and rechecked my user ID, password, and 2FA number and tried to login multiple times either as root@pam or user@pve, to no avail. Because I'm using 2FA, thought it might be something to do with the clock on the server. However, I checked the system time on the server and that on my phone and they look to be within one second of each other.
shows the following:
I tried the suggestions in this post, but none of the suggestions there worked. I suspect it has something to do with 2FA, but don't know how exactly to diagnose or resolve the problem.
Second, I had manually created a RAID1 zpool through SSH on two of the drives that were attached through the replaced HBA using this command:
I can see the two drives in
. They show as unmounted. But I don't see the zpool when I run
. The mountpoint /zdata is still there, but not surprisingly it shows as being empty. I'm a bit nervous about this as the pool contained some rather important data. Most of it is backed up, but would strongly prefer some method of reinstating the pool with the existing drives if at all possible, as there's several terabytes of data and recovery from cloud backups will take a long, long time. If anyone has any suggestions as to how to properly restore the zpool, I'd be most grateful.
I'd perhaps also be interested if anyone has any comments on ZFS and why it did not automatically restore that zpool. The other drives I had which are mounted through /etc/fstab mounted just fine. It's just the ZFS pool that seems to have disappeared.
Any advice would be most appreciated. I'm running PVE 7.2-7.
In any event, I finally received the replacement today and replaced the failed HBA. All the drives in /etc/fstab mounted just fine. However, I did encounter two issues:
First, while I can login just fine at the console or through SSL as root, I can no longer login to the webgui, either as root@pam or another user I had set up using PVE authentication. I have 2FA turned on. I checked and rechecked my user ID, password, and 2FA number and tried to login multiple times either as root@pam or user@pve, to no avail. Because I'm using 2FA, thought it might be something to do with the clock on the server. However, I checked the system time on the server and that on my phone and they look to be within one second of each other.
Code:
systemctl status pvedaemon
Code:
● pvedaemon.service - PVE API Daemon
Loaded: loaded (/lib/systemd/system/pvedaemon.service; enabled; vendor preset: enabled)
Active: active (running) since Wed 2022-10-05 11:21:38 EDT; 5h 44min ago
Process: 6945 ExecStart=/usr/bin/pvedaemon start (code=exited, status=0/SUCCESS)
Main PID: 7049 (pvedaemon)
Tasks: 4 (limit: 154579)
Memory: 207.7M
CPU: 4.640s
CGroup: /system.slice/pvedaemon.service
├─7049 pvedaemon
├─7050 pvedaemon worker
├─7051 pvedaemon worker
└─7052 pvedaemon worker
Oct 05 16:23:17 fava2 pvedaemon[7051]: authentication failure; rhost=::ffff:10.0.10.1 user=root@pam msg=Authentication failure
Oct 05 16:23:45 fava2 pvedaemon[7052]: authentication failure; rhost=::ffff:10.0.10.1 user=user@pve msg=invalid credentials
I tried the suggestions in this post, but none of the suggestions there worked. I suspect it has something to do with 2FA, but don't know how exactly to diagnose or resolve the problem.
Second, I had manually created a RAID1 zpool through SSH on two of the drives that were attached through the replaced HBA using this command:
Code:
zpool create -f zdata mirror /dev/disk/by-id/[id of first drive] /dev/disk/by-id/[id of first drive]
I can see the two drives in
Code:
lsblk
Code:
zpool status
I'd perhaps also be interested if anyone has any comments on ZFS and why it did not automatically restore that zpool. The other drives I had which are mounted through /etc/fstab mounted just fine. It's just the ZFS pool that seems to have disappeared.
Any advice would be most appreciated. I'm running PVE 7.2-7.