[SOLVED] Problem after upgrade from 7 to 8

Dexter23

Member
Dec 23, 2021
195
15
23
35
Hi guys i have 2 node cluster with pbs as QDevice.

The 2nd node i upgrade without problem after that i migrate the 2 vm from pve01 to pve02 and then i start upgrade also the node master "pve01" but i lost the shell so i don't know if the apt dist-upgrade finish or not.

Now i have this state in Datacenter>HA:
1727179719262.png1727179734756.png
This line "watchdog...." is the line when i run "apt dist-upgrade" and is stuck there, now i have dpkg in "inconsistent state" so i run the command "dpkg --configure -a" but is still also in the same line.
1727180585000.png
 
Last edited:
UPDATE:
I fix the prbolem with running again "dpkg --configure -a" and after force reboot with "reboot -f" now the pve01 doesn't have joq in queque or service in "stop" state.
But in Datacenter>HA the two VM still in "deleting" state but if i do on pve01 "qm list" not show anything, and i noticed that also the directory "/etc/pve/qemu-server" not contain the configuration file for the two VM.
PVECM Status:

Code:
Cluster information
-------------------
Name:             CustomerName
Config Version:   3
Transport:        knet
Secure auth:      on

Quorum information
------------------
Date:             Wed Sep 25 09:11:43 2024
Quorum provider:  corosync_votequorum
Nodes:            2
Node ID:          0x00000001
Ring ID:          1.10c
Quorate:          Yes

Votequorum information
----------------------
Expected votes:   3
Highest expected: 3
Total votes:      3
Quorum:           2
Flags:            Quorate Qdevice

Membership information
----------------------
    Nodeid      Votes    Qdevice Name
0x00000001          1    A,V,NMW 10.0.1.236 (local)
0x00000002          1    A,V,NMW 10.0.1.237
0x00000000          1            Qdevice
root@pve01:~#

I wait your reply, thanks.

UPDATE2:
1727261641112.png
PVE01 "systemctl status pve-ha-lrm.service pve-ha-crm.service"

1727261702873.png
PVE02 "systemctl status pve-ha-lrm.service pve-ha-crm.service"
1727261767972.png
 
Last edited:
Fix by doing this command on each node:
Code:
systemctl status pve-ha-lrm.service pve-ha-crm.service