I've had three disks go out on two different Proxmox servers in the past two days. Not sure what's up. Could be coincidence, maybe not. Just thought I'd check if anyone else is having problems due to kernel updates, patches, etc. Both boxes are fully patched running pve-manager/3.4-11/6502936f (running kernel: 2.6.32-43-pve)
On box #1 running Areca arc-1224:
I was restoring a VM from dump. About 50% through, Areca tossed one of the SSDs (Samsung EVO 1TB) from the RAID1 array. The box was completely locked up. Could not ssh in or login from console. Picture of the console messages is below. Weird that it failed that hard.... I rebooted (power button) and added a new hotspare.
The following day after the array was rebuilt, I tried the VM restore again. About 50% through and Areca tossed another disk from the same array. After adding *another* new disk, the array started rebuilding. I went home. We came in this morning and yet another disk was tossed from the array but the machine was responding. After the second disk went out yesterday, just for good measure, I updated the Areca firmware to latest 1.52 2014-12-26 and have rebooted.
Box #2 is running Areca 1882:
This morning it tossed a SATA disk from it's array. I've replaced that and things are quiet... for now.
This could all be wild coincidence, but generally speaking, these boxes have been running pretty much flawlessly for a couple years (with an occasional disk failure). Proxmox has been performing beyond expectation. Just wondering if anyone else is having problems too? I may boot to a previous kernel (and arcmsr driver) if it happens again.
On box #1 running Areca arc-1224:
I was restoring a VM from dump. About 50% through, Areca tossed one of the SSDs (Samsung EVO 1TB) from the RAID1 array. The box was completely locked up. Could not ssh in or login from console. Picture of the console messages is below. Weird that it failed that hard.... I rebooted (power button) and added a new hotspare.
The following day after the array was rebuilt, I tried the VM restore again. About 50% through and Areca tossed another disk from the same array. After adding *another* new disk, the array started rebuilding. I went home. We came in this morning and yet another disk was tossed from the array but the machine was responding. After the second disk went out yesterday, just for good measure, I updated the Areca firmware to latest 1.52 2014-12-26 and have rebooted.
Box #2 is running Areca 1882:
This morning it tossed a SATA disk from it's array. I've replaced that and things are quiet... for now.
This could all be wild coincidence, but generally speaking, these boxes have been running pretty much flawlessly for a couple years (with an occasional disk failure). Proxmox has been performing beyond expectation. Just wondering if anyone else is having problems too? I may boot to a previous kernel (and arcmsr driver) if it happens again.