Hello folks,
I have problems with high SSD usage.
it has been approx 5 days, since I installed 3 Samsung 850 Pro 512GB SSD's in RAIDz with proxmox 4.0. SSD's are connected to the onboard SATA ports, no raid card ...
I use ZFS as the filesystem (not zvol). I use the full RAID array and store my VM's (qcow2, VIRT-IO SCSI, fixed memory). I store my backups on a seperate spinning 1TB disk.
I have total 48GB memory and have assigned a total of 36GB assigned to VM's. I can upgrade the memory to 72GB, if needed.
My local storage has 75GB on it, that includes VM's, Snapshots, ISO's. Now I had to redo the server 2 times (change of raid setup).
My problem is the high SSD usage that I am seeing. I copied some SMART values below.
If I estimate the usage that I had up till now, I would say, that I wrote 1TB or less in the 94 hours uptime of the HDD.
According the the SMART 177 below, I wrote 4.5TB. That is 49GB/hour!!!
There was approx 3.5TB written and not from me coping stuff onto the server and cerating VM's ect. This was created by the system and VMs running.
Sorry for the formatting:
For reference ONLY my Samsung 840 Pro used in my laptop:
Problem is, if this keeps up this way, I will reach the 300TBW warranty in only 260 days!!! Quite short of the 10 years that I could have (I am hoping to have the drives in operation for at least 5 years and stay in warranty period/TBW).
My proxmox hyper-visor has currently 3 OS installed on it. One is SBS2011, one Win2k3Std and Win7Pro. None of them are very active. I included picture of the average Daily IO, hope that helps.
According to this guy (http://www.anandtech.com/show/8239/update-on-samsung-850-pro-endurance-vnand-die-size), apparently a Samsung 850 Pro, can do 6000 cycles (based on his test on the SMART 177, the calculated result was 6000 cycles). That would give me a total of 6.3 years, based on 50/GB hour write.
Problem with that is, that the SSD can die much sooner than reaching the 6000 cycle count. I am monitoring the SMART, however, this is a bit uncomfortable for me to gamble in a production environment.
The good thing is, that I can still return the SSD's, if this does not work out. However, I want to use the SSD's if that is possible. The Win2k3 has a Database, so IO is needed.
How can I fix this? Where should I look? I do not find the wiki to helpful, unless I am looking at all the wrong places.
I am comfortable with command line, however, I do not have to much experience with Linux in general (more a windows guy).
Please help.
THANK YOU!!!
I have problems with high SSD usage.
it has been approx 5 days, since I installed 3 Samsung 850 Pro 512GB SSD's in RAIDz with proxmox 4.0. SSD's are connected to the onboard SATA ports, no raid card ...
I use ZFS as the filesystem (not zvol). I use the full RAID array and store my VM's (qcow2, VIRT-IO SCSI, fixed memory). I store my backups on a seperate spinning 1TB disk.
I have total 48GB memory and have assigned a total of 36GB assigned to VM's. I can upgrade the memory to 72GB, if needed.
My local storage has 75GB on it, that includes VM's, Snapshots, ISO's. Now I had to redo the server 2 times (change of raid setup).
My problem is the high SSD usage that I am seeing. I copied some SMART values below.
If I estimate the usage that I had up till now, I would say, that I wrote 1TB or less in the 94 hours uptime of the HDD.
According the the SMART 177 below, I wrote 4.5TB. That is 49GB/hour!!!
There was approx 3.5TB written and not from me coping stuff onto the server and cerating VM's ect. This was created by the system and VMs running.
Sorry for the formatting:
Code:
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
[B] 9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 94[/B]
12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 4
[B]177 Wear_Leveling_Count 0x0013 099 099 000 Pre-fail Always - 9[/B]
For reference ONLY my Samsung 840 Pro used in my laptop:
Code:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0033 100 100 --- Pre-fail Always - 0
[B] 9 Power_On_Hours 0x0032 099 099 --- Old_age Always - 2607[/B]
12 Power_Cycle_Count 0x0032 099 099 --- Old_age Always - 721
[B]177 Wear_Leveling_Count 0x0013 099 099 --- Pre-fail Always - 17[/B]
Problem is, if this keeps up this way, I will reach the 300TBW warranty in only 260 days!!! Quite short of the 10 years that I could have (I am hoping to have the drives in operation for at least 5 years and stay in warranty period/TBW).
My proxmox hyper-visor has currently 3 OS installed on it. One is SBS2011, one Win2k3Std and Win7Pro. None of them are very active. I included picture of the average Daily IO, hope that helps.
According to this guy (http://www.anandtech.com/show/8239/update-on-samsung-850-pro-endurance-vnand-die-size), apparently a Samsung 850 Pro, can do 6000 cycles (based on his test on the SMART 177, the calculated result was 6000 cycles). That would give me a total of 6.3 years, based on 50/GB hour write.
Problem with that is, that the SSD can die much sooner than reaching the 6000 cycle count. I am monitoring the SMART, however, this is a bit uncomfortable for me to gamble in a production environment.
The good thing is, that I can still return the SSD's, if this does not work out. However, I want to use the SSD's if that is possible. The Win2k3 has a Database, so IO is needed.
How can I fix this? Where should I look? I do not find the wiki to helpful, unless I am looking at all the wrong places.
I am comfortable with command line, however, I do not have to much experience with Linux in general (more a windows guy).
Please help.
THANK YOU!!!
Last edited: