[SOLVED] Avoid Proxmox "cache" on NVME

kotakomputer

Renowned Member
May 14, 2012
450
14
83
Jakarta, Indonesia
www.komputerindo.com
If Proxmox using NVME hang (all icons grayed) then I do hard restart, then vps (in my case LXC) will back to several days ago.
So any websites in VPS will back to several days ago.

How to avoid this?

NB:
Proxmox 5.4.3
VPS created on secondary disk using NVME with ext4 file system
 
Code:
~# smartctl -a /dev/nvme0n1
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.18-12-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       V-GEN03SM19EG1TP3X4IT
Serial Number:                      AA000000000000000060
Firmware Version:                   R1115A0
PCI Vendor/Subsystem ID:            0x126f
IEEE OUI Identifier:                0x000000
Controller ID:                      1
Number of Namespaces:               1
Namespace 1 Size/Capacity:          1,024,209,543,168 [1.02 TB]
Namespace 1 Formatted LBA Size:     512
Local Time is:                      Fri Sep 13 23:58:22 2019 WIB
Firmware Updates (0x12):            1 Slot, no Reset required
Optional Admin Commands (0x0007):   Security Format Frmw_DL
Optional NVM Commands (0x001f):     Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
Maximum Data Transfer Size:         64 Pages
Warning  Comp. Temp. Threshold:     70 Celsius
Critical Comp. Temp. Threshold:     80 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
0 +     9.00W       -        -    0  0  0  0        0       0

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
0 +     512       0         0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
- media has been placed in read only mode

SMART/Health Information (NVMe Log 0x02, NSID 0x1)
Critical Warning:                   0x08
Temperature:                        42 Celsius
Available Spare:                    64%
Available Spare Threshold:          10%
Percentage Used:                    0%
Data Units Read:                    2,659,664 [1.36 TB]
Data Units Written:                 8,028,053 [4.11 TB]
Host Read Commands:                 41,304,790
Host Write Commands:                187,204,171
Controller Busy Time:               49,043,852
Power Cycles:                       46
Power On Hours:                     1,400
Unsafe Shutdowns:                   28
Media and Data Integrity Errors:    0
Error Information Log Entries:      34
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0

Read Error Information Log failed: NVMe Status 0x02

#

Time to replace with the new one :)
 
consider a used intel data center grade nvme . DC models needed.

note if you get used make sure it can be returned - incase smartctl shows fail OR high % Percentage Used.