lvm issue

sahostking · Aug 4, 2020

I notice that our 2 nodes in our cluster are greyed out now after update and restart of server.

I see this in logs:

ug 4 18:10:57 pve-2 systemd[1]: pvestatd.service: Found left-over process 21897 (vgs) in control group while starting unit. Ignoring.
Aug 4 18:10:57 pve-2 systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Aug 4 18:10:57 pve-2 systemd[1]: pvestatd.service: Found left-over process 3643 (vgs) in control group while starting unit. Ignoring.
Aug 4 18:10:57 pve-2 systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Aug 4 18:10:57 pve-2 systemd[1]: pvestatd.service: Found left-over process 26649 (vgs) in control group while starting unit. Ignoring.
Aug 4 18:10:57 pve-2 systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Aug 4 18:10:57 pve-2 systemd[1]: pvestatd.service: Found left-over process 29105 (vgs) in control group while starting unit. Ignoring.
Aug 4 18:10:57 pve-2 systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Aug 4 18:10:57 pve-2 systemd[1]: pvestatd.service: Found left-over process 4118 (vgs) in control group while starting unit. Ignoring.
Aug 4 18:10:57 pve-2 systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.

and

Aug 4 18:08:00 pve-2 systemd[1]: Starting Proxmox VE replication runner...
Aug 4 18:08:01 pve-2 systemd[1]: pvesr.service: Succeeded.
Aug 4 18:08:01 pve-2 systemd[1]: Started Proxmox VE replication runner.
Aug 4 18:09:00 pve-2 systemd[1]: Starting Proxmox VE replication runner...
Aug 4 18:09:01 pve-2 systemd[1]: pvesr.service: Succeeded.
Aug 4 18:09:01 pve-2 systemd[1]: Started Proxmox VE replication runner.
Aug 4 18:09:27 pve-2 systemd[1]: pvestatd.service: State 'stop-final-sigterm' timed out. Killing.
Aug 4 18:09:27 pve-2 systemd[1]: pvestatd.service: Killing process 21897 (vgs) with signal SIGKILL.
Aug 4 18:09:27 pve-2 systemd[1]: pvestatd.service: Killing process 3643 (vgs) with signal SIGKILL.
Aug 4 18:09:27 pve-2 systemd[1]: pvestatd.service: Killing process 26649 (vgs) with signal SIGKILL.
Aug 4 18:09:27 pve-2 systemd[1]: pvestatd.service: Killing process 29105 (vgs) with signal SIGKILL.
Aug 4 18:09:27 pve-2 systemd[1]: pvestatd.service: Killing process 4118 (vgs) with signal SIGKILL.

Seems after I restart pvestatd it works fine for awhiel but then nfs servers are still not accessible for backups.

When restarting node however all works for awhile until vgs check runs and the command freezes and does not complete.

Anyone else experiencing this on the new proxmox kernels?

I even tried going down to kernel to the follow:

Kernel Version

Linux 5.4.41-1-pve #1 SMP PVE 5.4.41-1 (Fri, 15 May 2020 15:06:08 +0200)

PVE Manager Version

pve-manager/6.2-10/a20769ed

But it still happens

Could it be that these nodes are using pve-manager 6.2-10 on our cluster but the others are using 6.2-4?

Thoughts?

Moayad · Aug 5, 2020

Hi,

sahostking said:
Linux 5.4.41-1-pve #1 SMP PVE 5.4.41-1 (Fri, 15 May 2020 15:06:08 +0200)

the corrent version of Proxmox kernel is: pve-kernel-5.4.44-2-pve [1]

did you restart all nodes after upgrade? - if so please cold-restart as well, if not work please post output of pveversion -v

[1] https://pve.proxmox.com/wiki/Downlo...Proxmox_Virtual_Environment_6.x_to_latest_6.2

sahostking · Aug 6, 2020

I found that even after a reboot it would still happen.

I then thought maybe its monitoring as nothing else queries lvm partitions except monitoring to check the diskspace.

I then disabled snmp and rebooted again and left it running since last night. Its over 8 hours and no grey out of vms on the node in interface and no vgs erros either so far.

Usually it would occur after 15 minutes.

Will continue to monitor but so far seems ok

Moayad · Aug 6, 2020

please let me know if that worked

have a nice day

Search

Search

lvm issue

sahostking

Renowned Member

Moayad

Proxmox Staff Member

sahostking

Renowned Member

Moayad

Proxmox Staff Member

We value your privacy