zfs-import status=1/FAILURE (kingston nvme errors)

Poet

New Member
Oct 19, 2022
9
0
1
Hello together,

I got a few ZFS and NVMe errors the last 3 days since I've build my server.

Here is my server config:

Bildschirmfoto 2022-10-20 um 10.21.27.png

Here are a few log files:

Todays boot log:

Bash:
[...]
Oct 20 10:02:04 proxmox systemd[1]: Finished Helper to synchronize boot up for ifupdown.
Oct 20 10:02:04 proxmox systemd[1]: Finished Wait for udev To Complete Device Initialization.
Oct 20 10:02:04 proxmox systemd[1]: Starting Import ZFS pools by cache file...
Oct 20 10:02:04 proxmox systemd[1]: Condition check resulted in Import ZFS pools by device scanning being skipped.
Oct 20 10:02:04 proxmox systemd[1]: Starting Import ZFS pool nvmepool...
Oct 20 10:02:04 proxmox zpool[1223]: cannot import 'nvmepool': no such pool available
Oct 20 10:02:04 proxmox systemd[1]: zfs-import@nvmepool.service: Main process exited, code=exited, status=1/FAILURE
Oct 20 10:02:04 proxmox systemd[1]: zfs-import@nvmepool.service: Failed with result 'exit-code'.
Oct 20 10:02:04 proxmox systemd[1]: Failed to start Import ZFS pool nvmepool.
Oct 20 10:02:04 proxmox kernel:  zd0: p1 p2 p3
Oct 20 10:02:04 proxmox kernel:  zd16: p1 p2
Oct 20 10:02:04 proxmox kernel:  zd32: p1 p2
Oct 20 10:02:04 proxmox kernel:  zd48: p1 p2 < p5 >
Oct 20 10:02:04 proxmox kernel:  zd64: p1 p2 < p5 >
Oct 20 10:02:04 proxmox systemd[1]: Created slice system-lvm2\x2dpvscan.slice.
Oct 20 10:02:04 proxmox systemd[1]: Starting LVM event activation on device 230:3...
Oct 20 10:02:04 proxmox systemd[1]: Finished Import ZFS pools by cache file.
Oct 20 10:02:04 proxmox lvm[1982]:   pvscan[1982] /dev/zd0p3 excluded by filters: device is rejected by filter config.
Oct 20 10:02:04 proxmox systemd[1]: Reached target ZFS pool import target.
Oct 20 10:02:04 proxmox systemd[1]: Starting LVM event activation on device 230:18...
Oct 20 10:02:04 proxmox systemd[1]: Starting LVM event activation on device 230:34...
Oct 20 10:02:04 proxmox systemd[1]: Starting Mount ZFS filesystems...
Oct 20 10:02:04 proxmox systemd[1]: Starting Wait for ZFS Volume (zvol) links in /dev...
Oct 20 10:02:04 proxmox lvm[1983]:   pvscan[1983] /dev/zd16p2 excluded by filters: device is rejected by filter config.
Oct 20 10:02:04 proxmox lvm[1984]:   pvscan[1984] /dev/zd32p2 excluded by filters: device is rejected by filter config.
Oct 20 10:02:04 proxmox zvol_wait[1986]: Testing 5 zvol links
Oct 20 10:02:04 proxmox zvol_wait[1986]: All zvol links are now present.
Oct 20 10:02:04 proxmox systemd[1]: Finished Wait for ZFS Volume (zvol) links in /dev.
Oct 20 10:02:04 proxmox systemd[1]: Reached target ZFS volumes are ready.
Oct 20 10:02:04 proxmox systemd[1]: Finished Mount ZFS filesystems.
Oct 20 10:02:04 proxmox systemd[1]: Reached target Local File Systems.
[...]


general info about nvme:

Bash:
root@proxmox:~# nvme list
Node             SN                   Model                                    Namespace Usage                      Format           FW Rev
---------------- -------------------- ---------------------------------------- --------- -------------------------- ---------------- --------
/dev/nvme0n1     50026B7686069248     KINGSTON SKC3000S1024G                   1           1.02  TB /   1.02  TB    512   B +  0 B   EIFK31.6
/dev/nvme1n1     50026B7685EFFF01     KINGSTON SKC3000S1024G                   1           1.02  TB /   1.02  TB    512   B +  0 B   EIFK31.6

some info about: nvme0:

Bash:
root@proxmox:~# smartctl -a /dev/nvme0
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.30-2-pve] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       KINGSTON SKC3000S1024G
Serial Number:                      50026B7686069248
Firmware Version:                   EIFK31.6
PCI Vendor/Subsystem ID:            0x2646
IEEE OUI Identifier:                0x0026b7
Total NVM Capacity:                 1,024,209,543,168 [1.02 TB]
Unallocated NVM Capacity:           0
Controller ID:                      1
NVMe Version:                       1.4
Number of Namespaces:               1
Namespace 1 Size/Capacity:          1,024,209,543,168 [1.02 TB]
Namespace 1 Formatted LBA Size:     512
Namespace 1 IEEE EUI-64:            0026b7 6860692485
Local Time is:                      Wed Oct 19 14:41:46 2022 CEST
Firmware Updates (0x12):            1 Slot, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005d):     Comp DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp
Log Page Attributes (0x08):         Telmtry_Lg
Maximum Data Transfer Size:         512 Pages
Warning  Comp. Temp. Threshold:     84 Celsius
Critical Comp. Temp. Threshold:     89 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     8.80W       -        -    0  0  0  0        0       0
 1 +     7.10W       -        -    1  1  1  1        0       0
 2 +     5.20W       -        -    2  2  2  2        0       0
 3 -   0.0620W       -        -    3  3  3  3     2500    7500
 4 -   0.0620W       -        -    4  4  4  4     2500    7500

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 +     512       0         2
 1 -    4096       0         1

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        22 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    0%
Data Units Read:                    15,285 [7.82 GB]
Data Units Written:                 348,449 [178 GB]
Host Read Commands:                 699,273
Host Write Commands:                2,315,277
Controller Busy Time:               6
Power Cycles:                       43
Power On Hours:                     32
Unsafe Shutdowns:                   36
Media and Data Integrity Errors:    0
Error Information Log Entries:      22
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 2:               51 Celsius

Error Information (NVMe Log 0x01, 16 of 63 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc          LBA  NSID    VS
  0         22     0  0x2001  0x4004      -            0     0     -
  1         21     0  0x1001  0x4004  0x028            0     0     -

nvme0 errors (nvme1 has the same):

Bash:
root@proxmox:~# nvme error-log /dev/nvme0
Error Log Entries for device:nvme0 entries:63
.................
 Entry[ 0]
.................
error_count    : 24
sqid        : 0
cmdid        : 0x2011
status_field    : 0x4004(INVALID_FIELD: A reserved coded value or an unsupported value in a defined field)
parm_err_loc    : 0xffff
lba        : 0
nsid        : 0
vs        : 0
trtype        : The transport type is not indicated or the error is not transport related.
cs        : 0
trtype_spec_info: 0
.................

some info about nvme1:

Bash:
root@proxmox:~# smartctl -a /dev/nvme1
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.30-2-pve] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       KINGSTON SKC3000S1024G
Serial Number:                      50026B7685EFFF01
Firmware Version:                   EIFK31.6
PCI Vendor/Subsystem ID:            0x2646
IEEE OUI Identifier:                0x0026b7
Total NVM Capacity:                 1,024,209,543,168 [1.02 TB]
Unallocated NVM Capacity:           0
Controller ID:                      1
NVMe Version:                       1.4
Number of Namespaces:               1
Namespace 1 Size/Capacity:          1,024,209,543,168 [1.02 TB]
Namespace 1 Formatted LBA Size:     512
Namespace 1 IEEE EUI-64:            0026b7 685efff015
Local Time is:                      Wed Oct 19 14:42:53 2022 CEST
Firmware Updates (0x12):            1 Slot, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005d):     Comp DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp
Log Page Attributes (0x08):         Telmtry_Lg
Maximum Data Transfer Size:         512 Pages
Warning  Comp. Temp. Threshold:     84 Celsius
Critical Comp. Temp. Threshold:     89 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     8.80W       -        -    0  0  0  0        0       0
 1 +     7.10W       -        -    1  1  1  1        0       0
 2 +     5.20W       -        -    2  2  2  2        0       0
 3 -   0.0620W       -        -    3  3  3  3     2500    7500
 4 -   0.0620W       -        -    4  4  4  4     2500    7500

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 +     512       0         2
 1 -    4096       0         1

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning:                   0x00
Temperature:                        22 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    0%
Data Units Read:                    15,415 [7.89 GB]
Data Units Written:                 348,471 [178 GB]
Host Read Commands:                 706,784
Host Write Commands:                2,309,367
Controller Busy Time:               6
Power Cycles:                       20
Power On Hours:                     32
Unsafe Shutdowns:                   14
Media and Data Integrity Errors:    0
Error Information Log Entries:      21
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 2:               47 Celsius

Error Information (NVMe Log 0x01, 16 of 63 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc          LBA  NSID    VS
  0         21     0  0x301d  0x4004      -            0     0     -
  1         20     0  0x101d  0x4004  0x028            0     0     -

Do you need more logs or infos?

I hope someone can help me out! :)

Thanks you!
 
What does "zpool list" show you?
Anything?

Are there later messages in syslog about the nvme based pool? E.g. failures?
 
Hello apoc!

Thanks for being around :)

Here are the logs:

zpool:

Bash:
root@proxmox:~# zpool list
NAME       SIZE  ALLOC   FREE  CKPOINT  EXPANDSZ   FRAG    CAP  DEDUP    HEALTH  ALTROOT
nvmepool   952G  42.9G   909G        -         -     0%     4%  1.00x    ONLINE  -
rpool      230G  20.5G   209G        -         -     0%     8%  1.00x    ONLINE  -

this are the newest syslog events:

Bash:
Oct 20 17:13:54 proxmox systemd[1]: Starting Import ZFS pools by cache file...
Oct 20 17:13:54 proxmox systemd[1]: Condition check resulted in Import ZFS pools by device scanning being skipped.
Oct 20 17:13:54 proxmox systemd[1]: Starting Import ZFS pool nvmepool...
Oct 20 17:13:54 proxmox zpool[1197]: cannot import 'nvmepool': no such pool available
Oct 20 17:13:54 proxmox systemd[1]: zfs-import@nvmepool.service: Main process exited, code=exited, status=1/FAILURE
Oct 20 17:13:54 proxmox systemd[1]: zfs-import@nvmepool.service: Failed with result 'exit-code'.
Oct 20 17:13:54 proxmox systemd[1]: Failed to start Import ZFS pool nvmepool.
Oct 20 17:13:54 proxmox systemd[1]: Created slice system-lvm2\x2dpvscan.slice.
Oct 20 17:13:54 proxmox systemd[1]: Starting LVM event activation on device 230:18...
Oct 20 17:13:54 proxmox systemd[1]: Starting LVM event activation on device 230:3...
Oct 20 17:13:54 proxmox systemd[1]: Finished Import ZFS pools by cache file.
Oct 20 17:13:54 proxmox lvm[1868]:   pvscan[1868] /dev/zd0p3 excluded by filters: device is rejected by filter config.
Oct 20 17:13:54 proxmox lvm[1867]:   pvscan[1867] /dev/zd16p2 excluded by filters: device is rejected by filter config.
Oct 20 17:13:54 proxmox systemd[1]: Reached target ZFS pool import target.
Oct 20 17:13:54 proxmox systemd[1]: Starting LVM event activation on device 230:34...
Oct 20 17:13:54 proxmox systemd[1]: Starting Mount ZFS filesystems...
Oct 20 17:13:54 proxmox systemd[1]: Starting Wait for ZFS Volume (zvol) links in /dev...
Oct 20 17:13:54 proxmox lvm[1869]:   pvscan[1869] /dev/zd32p2 excluded by filters: device is rejected by filter config.
Oct 20 17:13:54 proxmox zvol_wait[1871]: Testing 5 zvol links
Oct 20 17:13:54 proxmox zvol_wait[1871]: All zvol links are now present.
Oct 20 17:13:54 proxmox systemd[1]: Finished Wait for ZFS Volume (zvol) links in /dev.
Oct 20 17:13:54 proxmox systemd[1]: Reached target ZFS volumes are ready.
Oct 20 17:13:54 proxmox systemd[1]: Finished Mount ZFS filesystems.
Oct 20 17:13:54 proxmox systemd[1]: Reached target Local File Systems.
Oct 20 17:13:54 proxmox systemd[1]: Starting Load AppArmor profiles...
Oct 20 17:13:54 proxmox systemd[1]: Starting Set console font and keymap...
Oct 20 17:13:54 proxmox systemd[1]: Starting Network initialization...
Oct 20 17:13:54 proxmox systemd[1]: Starting Preprocess NFS configuration...
Oct 20 17:13:54 proxmox systemd[1]: Starting Proxmox VE Login Banner...
Oct 20 17:13:54 proxmox systemd[1]: Starting Proxmox VE firewall logger...
Oct 20 17:13:54 proxmox systemd[1]: Starting Commit Proxmox VE network changes...

.
.
.
.
and here
.
.
.
.

Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sda [SAT], not found in smartd database.
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sda [SAT], can't monitor Current_Pending_Sector count - no Attribute 197
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sda [SAT], can't monitor Offline_Uncorrectable count - no Attribute 198
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list.
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sda [SAT], state read from /var/lib/smartmontools/smartd.LITEON_CV8_CE256_11_SATA_256GB-TW04G95MLOH0082100B1.ata.state
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb, type changed from 'scsi' to 'sat'
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], opened
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], Samsung SSD 850 EVO 250GB, S/N:S1YBNXAG604863Z, WWN:5-002538-d4021037f, FW:EMT01B6Q, 250 GB
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], found in smartd database: Samsung based SSDs
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], can't monitor Current_Pending_Sector count - no Attribute 197
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], can't monitor Offline_Uncorrectable count - no Attribute 198
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list.
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], state read from /var/lib/smartmontools/smartd.Samsung_SSD_850_EVO_250GB-S1YBNXAG604863Z.ata.state
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme0, opened
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme0, KINGSTON SKC3000S1024G, S/N:50026B7686069248, FW:EIFK31.6, 1.02 TB
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme0, is SMART capable. Adding to "monitor" list.
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme0, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7686069248.nvme.state
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme1, opened
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme1, KINGSTON SKC3000S1024G, S/N:50026B7685EFFF01, FW:EIFK31.6, 1.02 TB
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme1, is SMART capable. Adding to "monitor" list.
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7685EFFF01.nvme.state
Oct 20 17:13:54 proxmox smartd[1972]: Monitoring 2 ATA/SATA, 0 SCSI/SAS and 2 NVMe devices
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 69 to 72
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme0, number of Error Log entries increased from 30 to 32
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme1, number of Error Log entries increased from 25 to 27
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sda [SAT], state written to /var/lib/smartmontools/smartd.LITEON_CV8_CE256_11_SATA_256GB-TW04G95MLOH0082100B1.ata.state
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/sdb [SAT], state written to /var/lib/smartmontools/smartd.Samsung_SSD_850_EVO_250GB-S1YBNXAG604863Z.ata.state
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme0, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7686069248.nvme.state
Oct 20 17:13:54 proxmox smartd[1972]: Device: /dev/nvme1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7685EFFF01.nvme.state
Oct 20 17:13:54 proxmox systemd[1]: Started Self Monitoring and Reporting Technology (SMART) Daemon.
Oct 20 17:13:55 proxmox systemd[1]: Finished Proxmox VE Login Banner.
Oct 20 17:13:55 proxmox systemd-udevd[1007]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.

here are a few later log events:

lxc error:

Bash:
Oct 20 17:09:48 proxmox systemd[1]: Stopped PVE Local HA Resource Manager Daemon.
Oct 20 17:09:48 proxmox systemd[1]: Stopping LXC Container Initialization and Autoboot Code...
Oct 20 17:09:48 proxmox systemd[1]: Stopping PVE Cluster HA Resource Manager Daemon...
Oct 20 17:09:48 proxmox systemd[1]: Stopping PVE API Proxy Server...
Oct 20 17:09:48 proxmox systemd[1]: Stopping PVE Qemu Event Daemon...
Oct 20 17:09:48 proxmox systemd[1]: qmeventd.service: Succeeded.
Oct 20 17:09:48 proxmox systemd[1]: Stopped PVE Qemu Event Daemon.
Oct 20 17:09:48 proxmox systemd[1]: lxc.service: Control process exited, code=exited, status=1/FAILURE
Oct 20 17:09:48 proxmox systemd[1]: lxc.service: Failed with result 'exit-code'.
Oct 20 17:09:48 proxmox systemd[1]: Stopped LXC Container Initialization and Autoboot Code.
Oct 20 17:09:48 proxmox systemd[1]: Stopping LXC network bridge setup...
Oct 20 17:09:48 proxmox systemd[1]: Stopping FUSE filesystem for LXC...
Oct 20 17:09:48 proxmox lxcfs[2058]: Running destructor lxcfs_exit
Oct 20 17:09:48 proxmox systemd[1]: var-lib-lxcfs.mount: Succeeded.
Oct 20 17:09:48 proxmox systemd[1]: Unmounted /var/lib/lxcfs.
Oct 20 17:09:48 proxmox systemd[1]: lxc-net.service: Succeeded.
Oct 20 17:09:48 proxmox systemd[1]: Stopped LXC network bridge setup.
Oct 20 17:09:48 proxmox fusermount[5181]: /bin/fusermount: failed to unmount /var/lib/lxcfs: Invalid argument
Oct 20 17:09:48 proxmox systemd[1]: lxcfs.service: Succeeded.
Oct 20 17:09:48 proxmox systemd[1]: Stopped FUSE filesystem for LXC.
Oct 20 17:09:48 proxmox systemd[1]: pvefw-logger.service: Succeeded.
Oct 20 17:09:48 proxmox systemd[1]: Stopped Proxmox VE firewall logger.

more errors:

Bash:
Oct 20 17:06:22 proxmox zed[2111]: ZFS Event Daemon 2.1.4-pve1 (PID 2111)
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda, type changed from 'scsi' to 'sat'
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], opened
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], LITEON CV8-CE256-11 SATA 256GB, S/N:TW04G95MLOH0082100B1, WWN:5-002303-101148d86, FW:C18110A, 256 GB
Oct 20 17:06:22 proxmox zed[2111]: Processing events since eid=0
Oct 20 17:06:22 proxmox systemd[1]: Started Kernel Samepage Merging (KSM) Tuning Daemon.
Oct 20 17:06:22 proxmox systemd[1]: Started PVE Qemu Event Daemon.
Oct 20 17:06:22 proxmox systemd[1]: Finished ZFS file system shares.
Oct 20 17:06:22 proxmox systemd[1]: Reached target ZFS startup target.
Oct 20 17:06:22 proxmox zed: eid=3 class=pool_import pool='rpool'
Oct 20 17:06:22 proxmox zed: eid=2 class=config_sync pool='rpool'
Oct 20 17:06:22 proxmox zed: eid=5 class=config_sync pool='rpool'
Oct 20 17:06:22 proxmox zed: eid=8 class=pool_import pool='nvmepool'
Oct 20 17:06:22 proxmox zed: eid=7 class=config_sync pool='nvmepool'
Oct 20 17:06:22 proxmox zed: eid=10 class=config_sync pool='nvmepool'
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], not found in smartd database.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], can't monitor Current_Pending_Sector count - no Attribute 197
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], can't monitor Offline_Uncorrectable count - no Attribute 198
Oct 20 17:06:22 proxmox dbus-daemon[2054]: [system] AppArmor D-Bus mediation is enabled
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], state read from /var/lib/smartmontools/smartd.LITEON_CV8_CE256_11_SATA_256GB-TW04G95MLOH0082100B1.ata.state
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb, type changed from 'scsi' to 'sat'
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], opened
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], Samsung SSD 850 EVO 250GB, S/N:S1YBNXAG604863Z, WWN:5-002538-d4021037f, FW:EMT01B6Q, 250 GB
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], found in smartd database: Samsung based SSDs
Oct 20 17:06:22 proxmox systemd[1]: Finished LVM event activation on device 230:3.
Oct 20 17:06:22 proxmox systemd[1]: Finished LVM event activation on device 230:18.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], can't monitor Current_Pending_Sector count - no Attribute 197
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], can't monitor Offline_Uncorrectable count - no Attribute 198
Oct 20 17:06:22 proxmox systemd[1]: Started User Login Management.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], state read from /var/lib/smartmontools/smartd.Samsung_SSD_850_EVO_250GB-S1YBNXAG604863Z.ata.state
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme0, opened
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme0, KINGSTON SKC3000S1024G, S/N:50026B7686069248, FW:EIFK31.6, 1.02 TB
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme0, is SMART capable. Adding to "monitor" list.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme0, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7686069248.nvme.state
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme1, opened
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme1, KINGSTON SKC3000S1024G, S/N:50026B7685EFFF01, FW:EIFK31.6, 1.02 TB
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme1, is SMART capable. Adding to "monitor" list.
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7685EFFF01.nvme.state
Oct 20 17:06:22 proxmox smartd[2080]: Monitoring 2 ATA/SATA, 0 SCSI/SAS and 2 NVMe devices
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 71 to 69
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme0, number of Error Log entries increased from 28 to 30
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme1, number of Error Log entries increased from 23 to 25
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sda [SAT], state written to /var/lib/smartmontools/smartd.LITEON_CV8_CE256_11_SATA_256GB-TW04G95MLOH0082100B1.ata.state
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/sdb [SAT], state written to /var/lib/smartmontools/smartd.Samsung_SSD_850_EVO_250GB-S1YBNXAG604863Z.ata.state
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme0, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7686069248.nvme.state
Oct 20 17:06:22 proxmox smartd[2080]: Device: /dev/nvme1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7685EFFF01.nvme.state
Oct 20 17:06:22 proxmox systemd[1]: Finished LVM event activation on device 230:34.
Oct 20 17:06:22 proxmox systemd[1]: Started Self Monitoring and Reporting Technology (SMART) Daemon.
Oct 20 17:06:23 proxmox systemd[1]: Finished Proxmox VE Login Banner.

Post 1 / 2
 
Post 2 / 2

I always get ERST error:

Bash:
Oct 19 02:15:32 proxmox kernel: [    0.422846] ERST: Error Record Serialization Table (ERST) support is initialized.


This was the first time I had errors. This was 2 hours after I'd built the server and installed proxmox for the first time:

Bash:
Oct 18 12:13:51 proxmox smartd[1292]: smartd 7.2 2020-12-30 r5155 [x86_64-linux-5.15.30-2-pve] (local build)
Oct 18 12:13:51 proxmox smartd[1292]: Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
Oct 18 12:13:51 proxmox smartd[1292]: Opened configuration file /etc/smartd.conf
Oct 18 12:13:51 proxmox smartd[1292]: Drive: DEVICESCAN, implied '-a' Directive on line 21 of file /etc/smartd.conf
Oct 18 12:13:51 proxmox smartd[1292]: Configuration file /etc/smartd.conf was parsed, found DEVICESCAN, scanning devices
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda, type changed from 'scsi' to 'sat'
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], opened
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], LITEON CV8-CE256-11 SATA 256GB, S/N:TW04G95MLOH0082100B1, WWN:5-002303-101148d86, FW:C18110A, 256 GB
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], not found in smartd database.
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], can't monitor Current_Pending_Sector count - no Attribute 197
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], can't monitor Offline_Uncorrectable count - no Attribute 198
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list.
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], state read from /var/lib/smartmontools/smartd.LITEON_CV8_CE256_11_SATA_256GB-TW04G95MLOH0082100B1.ata.state
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb, type changed from 'scsi' to 'sat'
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], opened
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], Samsung SSD 850 EVO 250GB, S/N:S1YBNXAG604863Z, WWN:5-002538-d4021037f, FW:EMT01B6Q, 250 GB
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], found in smartd database: Samsung based SSDs
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], can't monitor Current_Pending_Sector count - no Attribute 197
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], can't monitor Offline_Uncorrectable count - no Attribute 198
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list.
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], state read from /var/lib/smartmontools/smartd.Samsung_SSD_850_EVO_250GB-S1YBNXAG604863Z.ata.state
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme0, opened
Oct 18 12:13:51 proxmox systemd[1]: Started User Login Management.
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme0, KINGSTON SKC3000S1024G, S/N:50026B7686069248, FW:EIFK31.6, 1.02 TB
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme0, is SMART capable. Adding to "monitor" list.
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme0, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7686069248.nvme.state
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme1, opened
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme1, KINGSTON SKC3000S1024G, S/N:50026B7685EFFF01, FW:EIFK31.6, 1.02 TB
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme1, is SMART capable. Adding to "monitor" list.
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7685EFFF01.nvme.state
Oct 18 12:13:51 proxmox smartd[1292]: Monitoring 2 ATA/SATA, 0 SCSI/SAS and 2 NVMe devices
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme0, number of Error Log entries increased from 8 to 10
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme1, number of Error Log entries increased from 7 to 9
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sda [SAT], state written to /var/lib/smartmontools/smartd.LITEON_CV8_CE256_11_SATA_256GB-TW04G95MLOH0082100B1.ata.state
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/sdb [SAT], state written to /var/lib/smartmontools/smartd.Samsung_SSD_850_EVO_250GB-S1YBNXAG604863Z.ata.state
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme0, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7686069248.nvme.state
Oct 18 12:13:51 proxmox smartd[1292]: Device: /dev/nvme1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S1024G-50026B7685EFFF01.nvme.state
Oct 18 12:13:51 proxmox systemd[1]: Started Self Monitoring and Reporting Technology (SMART) Daemon.
Oct 18 12:13:51 proxmox systemd[1]: Finished Proxmox VE Login Banner.
Oct 18 12:13:51 proxmox systemd-udevd[950]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Oct 18 12:13:51 proxmox kernel: [    4.950176] vmbr0: port 1(enp3s0) entered blocking state
Oct 18 12:13:51 proxmox kernel: [    4.950530] vmbr0: port 1(enp3s0) entered disabled state
Oct 18 12:13:52 proxmox kernel: [    4.953513] device enp3s0 entered promiscuous mode

I hope this will help :)

Thanks a lot!
 
I don't get your problem mate.
Your nvme pool is online. Are you concerned about the smart values the system can't read?
To me this reads as if the devices are too new to be in the smart-database

You can contact Kingston support - these guys are very fast and helpful from my experience
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!