Is anyone else experiencing following zfs crash error like me? it already happened to two of my pve nodes already
My Package Version
Code:
Jun 10 15:14:40 pveLA01 kernel: [2306462.646839] Code: 75 0e 4d 89 f9 41 f6 47 0b 04 0f 84 f4 fe ff ff 4c 89 ff e8 1a f5 01 00 49 89 c1 e9 e4 fe ff ff 41 8b 41 20 49 8b 39 4c 01 d0 <48> 8b 18 48 89 c1 49 33 99 70 01 00 00 4c 89 d0 48 0f c9 48 31 cb
Jun 10 15:14:40 pveLA01 kernel: [2306462.650572] R10: dd65a112c2f52240 R11: ffff993269741560 R12: 0000000000042d00
Jun 10 15:14:40 pveLA01 kernel: [2306462.985224] RBP: ffffaac419b479f0 R08: ffff991aa08b0040 R09: ffff991aa0007b80
Jun 10 15:14:40 pveLA01 kernel: [2306462.988381] CR2: 000000c000e85000 CR3: 0000002eeaf26006 CR4: 00000000007626e0
Jun 10 15:14:40 pveLA01 kernel: [2306462.990328] PKRU: 55555554
Jun 10 15:14:40 pveLA01 kernel: [2306462.992162] spl_kmem_zalloc+0xe9/0x140 [spl]
Jun 10 15:14:40 pveLA01 kernel: [2306462.993985] dmu_write_uio_dnode+0x4c/0x140 [zfs]
Jun 10 15:14:40 pveLA01 kernel: [2306462.995688] zfs_write+0xa1b/0xed0 [zfs]
Jun 10 15:14:40 pveLA01 kernel: [2306462.997247] zpl_iter_write+0xee/0x130 [zfs]
Jun 10 15:14:40 pveLA01 kernel: [2306462.998627] vfs_write+0xab/0x1b0
Jun 10 15:14:40 pveLA01 kernel: [2306462.999925] do_syscall_64+0x57/0x190
Jun 10 15:14:40 pveLA01 kernel: [2306463.002056] RSP: 002b:00007ff6d1152840 EFLAGS: 00000293 ORIG_RAX: 0000000000000001
Jun 10 15:14:40 pveLA01 kernel: [2306463.003825] R10: 0000000000000000 R11: 0000000000000293 R12: 0000000000000072
Jun 10 15:14:40 pveLA01 kernel: [2306463.011854] ---[ end trace 5fcf6e5bb7cbecb4 ]---
Jun 10 15:14:41 pveLA01 kernel: [2306463.772578] RSP: 0018:ffffaac4a742f9b0 EFLAGS: 00010282
Jun 10 15:14:41 pveLA01 kernel: [2306464.099178] RSP: 0018:ffffaac4a742f9b0 EFLAGS: 00010282
Jun 10 15:14:41 pveLA01 kernel: [2306464.101652] R10: dd65a112c2f52240 R11: ffff993269741560 R12: 0000000000042d00
Jun 10 15:14:41 pveLA01 kernel: [2306464.103360] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 10 15:15:03 pveLA01 kernel: [2306486.413641] ---[ end trace 5fcf6e5bb7cbecb7 ]---
Jun 10 15:15:03 pveLA01 kernel: [2306486.433023] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jun 10 15:15:14 pveLA01 kernel: [2306496.459072] RAX: dd65a112c2f52240 RBX: 0000000000000000 RCX: 0000000000000000
Jun 10 15:15:14 pveLA01 kernel: [2306496.461582] R13: 0000000000000008 R14: 00000000ffffffff R15: ffff991aa0007b80
Jun 10 15:15:14 pveLA01 kernel: [2306496.463984] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 10 15:15:14 pveLA01 kernel: [2306496.466197] ? __vmalloc_node_range+0xd4/0x270
Jun 10 15:15:14 pveLA01 kernel: [2306496.468203] alloc_counters.isra.11+0x2b/0x130 [ip6_tables]
Jun 10 15:15:14 pveLA01 kernel: [2306496.470021] ipv6_getsockopt+0xa1/0xe0
Jun 10 15:15:14 pveLA01 kernel: [2306496.471695] __x64_sys_getsockopt+0x24/0x30
Jun 10 15:15:14 pveLA01 kernel: [2306496.473285] Code: 48 8b 0d c9 08 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 49 89 ca b8 37 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 96 08 0c 00 f7 d8 64 89 01 48
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.601639] general protection fault: 0000 [#9] SMP NOPTI
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.602247] CPU: 10 PID: 1567647 Comm: zvol Tainted: P D OE 5.4.34-1-pve #1
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.603444] RIP: 0010:__kmalloc_node+0x198/0x330
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.604020] Code: 75 0e 4d 89 f9 41 f6 47 0b 04 0f 84 f4 fe ff ff 4c 89 ff e8 1a f5 01 00 49 89 c1 e9 e4 fe ff ff 41 8b 41 20 49 8b 39 4c 01 d0 <48> 8b 18 48 89 c1 49 33 99 70 01 00 00 4c 89 d0 48 0f c9 48 31 cb
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.605084] RSP: 0018:ffffaac415e83ba0 EFLAGS: 00010282
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.605583] RAX: dd65a112c2f52240 RBX: 0000000000000000 RCX: 0000000000000000
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.606548] RBP: ffffaac415e83be0 R08: ffff991aa08b0040 R09: ffff991aa0007b80
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.607009] R10: dd65a112c2f52240 R11: 00000000842c7000 R12: 0000000000042d00
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.607914] FS: 0000000000000000(0000) GS:ffff991aa0880000(0000) knlGS:0000000000000000
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.608776] CR2: 0000011e159ad000 CR3: 0000002276a0a002 CR4: 00000000007626e0
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.609598] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.610753] ? spl_kmem_zalloc+0xe9/0x140 [spl]
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.612251] dmu_write_uio_dnode+0x4c/0x140 [zfs]
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.613671] kthread+0x120/0x140
Jun 10 15:15:44 gpu01-la3 kernel: [2306526.614967] Modules linked in: tcp_diag(E) inet_diag(E) rbd(E) libceph(E) veth(E) ebtable_filter(E) ebtables(E) ip_set(E) ip6table_raw(E) iptable_raw(E) ip6table_filter(E) ip6_tables(E) sctp(E) iptable_filter(E) bpfilter(E) binfmt_misc(E) ipmi_watchdog(E) bonding(E) nfnetlink_log(E) nfnetlink(E) ipmi_ssif(E) snd_hda_codec_hdmi(E) intel_rapl_msr(E) intel_rapl_common(E) isst_if_common(E) skx_edac(E) nfit(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) kvm(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) aesni_intel(E) crypto_simd(E) snd_hda_intel(E) joydev(E) input_leds(E) cryptd(E) snd_intel_dspcfg(E) glue_helper(E) snd_hda_codec(E) snd_hda_core(E) ast(E) snd_hwdep(E) intel_cstate(E) drm_vram_helper(E) snd_pcm(E) ttm(E) snd_timer(E) drm_kms_helper(E) intel_rapl_perf(E) snd(E) drm(E) soundcore(E) i2c_algo_bit(E) fb_sys_fops(E) mei_me(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) mei(E) ioatdma(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E)
My Package Version
Code:
proxmox-ve: 6.2-1 (running kernel: 5.4.41-1-pve)
pve-manager: 6.2-6 (running version: 6.2-6/ee1d7754)
pve-kernel-5.4: 6.2-2
pve-kernel-helper: 6.2-2
pve-kernel-5.3: 6.1-6
pve-kernel-5.0: 6.0-11
pve-kernel-5.4.41-1-pve: 5.4.41-1
pve-kernel-5.4.34-1-pve: 5.4.34-2
pve-kernel-5.3.18-3-pve: 5.3.18-3
pve-kernel-5.3.13-1-pve: 5.3.13-1
pve-kernel-5.0.21-5-pve: 5.0.21-10
pve-kernel-5.0.15-1-pve: 5.0.15-1
ceph: 14.2.9-pve1
ceph-fuse: 14.2.9-pve1
corosync: 3.0.3-pve1
criu: 3.11-3
glusterfs-client: 7.6-1
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.15-pve1
libproxmox-acme-perl: 1.0.4
libpve-access-control: 6.1-1
libpve-apiclient-perl: 3.0-3
libpve-common-perl: 6.1-3
libpve-guest-common-perl: 3.0-10
libpve-http-server-perl: 3.0-5
libpve-storage-perl: 6.1-8
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve4
lxc-pve: 4.0.2-1
lxcfs: 4.0.3-pve2
novnc-pve: 1.1.0-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.2-7
pve-cluster: 6.1-8
pve-container: 3.1-8
pve-docs: 6.2-4
pve-edk2-firmware: 2.20200229-1
pve-firewall: 4.1-2
pve-firmware: 3.1-1
pve-ha-manager: 3.0-9
pve-i18n: 2.1-3
pve-qemu-kvm: 5.0.0-4
pve-xtermjs: 4.3.0-1
qemu-server: 6.2-3
smartmontools: 7.1-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.4-pve1