I am running proxmox on a Dell R210 II, which I recently upgraded to use four Kingston A400 SSDs running in RAID 10 using the H200 RAID controller. After this machine ran for a few hours, it encountered an error and fell back into a read-only root filesystem (I've seen some answers that suggest changing the mount options in /etc/fstab to
Potential Cause 1: Kingston Firmware. I bought the four SSDs off Amazon, and unfortunately the cheapest seller limited purchases to three units; I needed to buy the fourth from a different seller. When I set these up in RAID 10, I noticed that first three drives are running the same firmware version (SA400S30107) but the fourth drive is running a different firmware version (SA400S30009). see screenshot here: https://imgur.com/a/Y5TI3tO. I'm not sure whether/how to run a firmware update on them given that the disks are in a RAID configuration and, to the best of my knowledge, the H200 does not support disk pass through.
Potential Cause 2: Power Saving Issues. I found a similar problem on Ubuntu (see https://askubuntu.com/a/1113288) that has to do with the power saving features on NVME drives. I don't think this is the problem, since it pertains to NVMEs (and my SSDs are SATA drives). I'm including it for the purpose of comprehensiveness. The link recommends passing the following grub option at boot:
Potential Cause 3: Time Synchronization Issues. Unfortunately, I cannot find the post that I obtained this possible cause from. However, I remember slightly that there might be issues stemming from differences between the hardware clock and time set by the NTP server. Putting this here just in case someone has more thoughts on this.
I would appreciate any advice on identifying and fixing this problem.
errors=continue
, but I'd rather identify and fix the source of the problem). I rans fsck and was able to fix the problems, but I am interested in preventing this from occurring in the future when I am not around to fix it.Potential Cause 1: Kingston Firmware. I bought the four SSDs off Amazon, and unfortunately the cheapest seller limited purchases to three units; I needed to buy the fourth from a different seller. When I set these up in RAID 10, I noticed that first three drives are running the same firmware version (SA400S30107) but the fourth drive is running a different firmware version (SA400S30009). see screenshot here: https://imgur.com/a/Y5TI3tO. I'm not sure whether/how to run a firmware update on them given that the disks are in a RAID configuration and, to the best of my knowledge, the H200 does not support disk pass through.
Potential Cause 2: Power Saving Issues. I found a similar problem on Ubuntu (see https://askubuntu.com/a/1113288) that has to do with the power saving features on NVME drives. I don't think this is the problem, since it pertains to NVMEs (and my SSDs are SATA drives). I'm including it for the purpose of comprehensiveness. The link recommends passing the following grub option at boot:
nvme_core.default_ps_max_latency_us=6000
.Potential Cause 3: Time Synchronization Issues. Unfortunately, I cannot find the post that I obtained this possible cause from. However, I remember slightly that there might be issues stemming from differences between the hardware clock and time set by the NTP server. Putting this here just in case someone has more thoughts on this.
I would appreciate any advice on identifying and fixing this problem.
Last edited: