Hi one of our smalls Cluster crashed and i cant explain why yet even when reading the log.
we are running 7.3.3 .
i was looking to maintenance node 1 , i moved a VM to node 2, ZFS local to ZFS local.
our HA was active , so it tried to move it back right away after the task to Node1.
that where something as appened. NODE 1 seem to have crached . if you look at the log it seem something made the Vm bridge going from blocking to fowarding.
right after those log, nothing the server rebooted, i dont even think he got killed by HA .
then Node 3 sent around 1000 emails trying to repeat i think the fence and HA move for ever until the node i think actually resetted by itself .
what the hell is that
Task START:
Feb 7 11:04:08 node_01 pve-ha-lrm[2516888]: Task 'UPID:node_01:0026679B:08C737FB:63E27569:qmigrate:7013:root@pam:' still active, waiting
Feb 7 11:04:09 node_01 pve-ha-lrm[2516888]: <root@pam> end task UPID:node_01:0026679B:08C737FB:63E27569:qmigrate:7013:root@pam: OK
Feb 7 11:04:17 node_01 pmxcfs[2137]: [status] notice: received log
Feb 7 11:04:17 node_01 systemd[1]: Started Session 7015 of user root.
Feb 7 11:04:17 node_01 systemd[1]: session-7015.scope: Succeeded.
Feb 7 11:04:18 node_01 systemd[1]: Started Session 7016 of user root.
Feb 7 11:04:19 node_01 qm[2536688]: <root@pam> starting task UPID:node_01:0026B53B:08C7A5CD:63E27683:qmstart:7013:root@pam:
Feb 7 11:04:19 node_01 qm[2536763]: start VM 7013: UPID:node_01:0026B53B:08C7A5CD:63E27683:qmstart:7013:root@pam:
Feb 7 11:04:19 node_01 kernel: [1473010.930448] debugfs: Directory 'zd64' with parent 'block' already present!
Feb 7 11:04:20 node_01 systemd[1]: Started 7013.scope.
Feb 7 11:04:20 node_01 systemd-udevd[2536954]: Using default interface naming scheme 'v247'.
Feb 7 11:04:20 node_01 systemd-udevd[2536954]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 kernel: [1473012.125572] device tap7013i0 entered promiscuous mode
Feb 7 11:04:21 node_01 systemd-udevd[2536954]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 systemd-udevd[2536954]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 systemd-udevd[2537045]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 systemd-udevd[2537045]: Using default interface naming scheme 'v247'.
Feb 7 11:04:21 node_01 kernel: [1473012.186340] vmbr1: port 3(fwpr7013p0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.186349] vmbr1: port 3(fwpr7013p0) entered disabled state
Feb 7 11:04:21 node_01 kernel: [1473012.186603] device fwpr7013p0 entered promiscuous mode
Feb 7 11:04:21 node_01 kernel: [1473012.187767] vmbr1: port 3(fwpr7013p0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.187770] vmbr1: port 3(fwpr7013p0) entered forwarding state
Feb 7 11:04:21 node_01 kernel: [1473012.216419] fwbr7013i0: port 1(fwln7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.216425] fwbr7013i0: port 1(fwln7013i0) entered disabled state
Feb 7 11:04:21 node_01 kernel: [1473012.216514] device fwln7013i0 entered promiscuous mode
Feb 7 11:04:21 node_01 kernel: [1473012.216587] fwbr7013i0: port 1(fwln7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.216590] fwbr7013i0: port 1(fwln7013i0) entered forwarding state
Feb 7 11:04:21 node_01 kernel: [1473012.230343] fwbr7013i0: port 2(tap7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.230348] fwbr7013i0: port 2(tap7013i0) entered disabled state
Feb 7 11:04:21 node_01 kernel: [1473012.230480] fwbr7013i0: port 2(tap7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.230483] fwbr7013i0: port 2(tap7013i0) entered forwarding state
NODE 1 Crash
we are running 7.3.3 .
i was looking to maintenance node 1 , i moved a VM to node 2, ZFS local to ZFS local.
our HA was active , so it tried to move it back right away after the task to Node1.
that where something as appened. NODE 1 seem to have crached . if you look at the log it seem something made the Vm bridge going from blocking to fowarding.
right after those log, nothing the server rebooted, i dont even think he got killed by HA .
then Node 3 sent around 1000 emails trying to repeat i think the fence and HA move for ever until the node i think actually resetted by itself .
what the hell is that
Task START:
Feb 7 11:04:08 node_01 pve-ha-lrm[2516888]: Task 'UPID:node_01:0026679B:08C737FB:63E27569:qmigrate:7013:root@pam:' still active, waiting
Feb 7 11:04:09 node_01 pve-ha-lrm[2516888]: <root@pam> end task UPID:node_01:0026679B:08C737FB:63E27569:qmigrate:7013:root@pam: OK
Feb 7 11:04:17 node_01 pmxcfs[2137]: [status] notice: received log
Feb 7 11:04:17 node_01 systemd[1]: Started Session 7015 of user root.
Feb 7 11:04:17 node_01 systemd[1]: session-7015.scope: Succeeded.
Feb 7 11:04:18 node_01 systemd[1]: Started Session 7016 of user root.
Feb 7 11:04:19 node_01 qm[2536688]: <root@pam> starting task UPID:node_01:0026B53B:08C7A5CD:63E27683:qmstart:7013:root@pam:
Feb 7 11:04:19 node_01 qm[2536763]: start VM 7013: UPID:node_01:0026B53B:08C7A5CD:63E27683:qmstart:7013:root@pam:
Feb 7 11:04:19 node_01 kernel: [1473010.930448] debugfs: Directory 'zd64' with parent 'block' already present!
Feb 7 11:04:20 node_01 systemd[1]: Started 7013.scope.
Feb 7 11:04:20 node_01 systemd-udevd[2536954]: Using default interface naming scheme 'v247'.
Feb 7 11:04:20 node_01 systemd-udevd[2536954]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 kernel: [1473012.125572] device tap7013i0 entered promiscuous mode
Feb 7 11:04:21 node_01 systemd-udevd[2536954]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 systemd-udevd[2536954]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 systemd-udevd[2537045]: ethtool: autonegotiation is unset or enabled, the speed and duplex are not writable.
Feb 7 11:04:21 node_01 systemd-udevd[2537045]: Using default interface naming scheme 'v247'.
Feb 7 11:04:21 node_01 kernel: [1473012.186340] vmbr1: port 3(fwpr7013p0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.186349] vmbr1: port 3(fwpr7013p0) entered disabled state
Feb 7 11:04:21 node_01 kernel: [1473012.186603] device fwpr7013p0 entered promiscuous mode
Feb 7 11:04:21 node_01 kernel: [1473012.187767] vmbr1: port 3(fwpr7013p0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.187770] vmbr1: port 3(fwpr7013p0) entered forwarding state
Feb 7 11:04:21 node_01 kernel: [1473012.216419] fwbr7013i0: port 1(fwln7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.216425] fwbr7013i0: port 1(fwln7013i0) entered disabled state
Feb 7 11:04:21 node_01 kernel: [1473012.216514] device fwln7013i0 entered promiscuous mode
Feb 7 11:04:21 node_01 kernel: [1473012.216587] fwbr7013i0: port 1(fwln7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.216590] fwbr7013i0: port 1(fwln7013i0) entered forwarding state
Feb 7 11:04:21 node_01 kernel: [1473012.230343] fwbr7013i0: port 2(tap7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.230348] fwbr7013i0: port 2(tap7013i0) entered disabled state
Feb 7 11:04:21 node_01 kernel: [1473012.230480] fwbr7013i0: port 2(tap7013i0) entered blocking state
Feb 7 11:04:21 node_01 kernel: [1473012.230483] fwbr7013i0: port 2(tap7013i0) entered forwarding state
NODE 1 Crash
Last edited: