Quorum Disk down causes odd GUI behaviors

Hi,

I don't think that your problem is related to quorum or cluster state.

The problem is that pvestatd is hanging somewhere.
pvestatd read stats from the host, then the vms, then storages sequentially.
They are some timeout implemented for storage (ping check, nfs stats checks,...) to try to detect offline storage and bypass it.

Sometime, timeout don't work and pvestatd hang on this storage check.
That's why no more infos are displayed on the gui.


So the question is, do you use your storage where the quorum disk is down, for other thing (vm storage, backup storage,....)

also check

cat /var/log/daemon.log|grep pvestatd

maybe you'll have some logs

I do use some storage on the machine which provides the quorum for some simple backups. This is happening on clusters with and without storage located on the device providing a quorum disk. This was also dietmer's thoughts, I provided output showing that it didn't seem like it was hanging while the quorum disk machine was down. I am stumped at this point, but there is no doubt the cluster itself is 100% working as expected.
 
I do use some storage on the machine which provides the quorum for some simple backups. This is happening on clusters with and without storage located on the device providing a quorum disk. This was also dietmer's thoughts, I provided output showing that it didn't seem like it was hanging while the quorum disk machine was down. I am stumped at this point, but there is no doubt the cluster itself is 100% working as expected.

I really am starting to question why we even bother to purchase subscriptions. The "Enterprise Repo's" have proven to be no more stable than the "Non Enterprise Repo's" and the dev's provide responses when they feel like it. This is clearly a proxmox issue which should be addressed. I do have the ability to create tickets, but imo I shouldn't be wasting our tickets on bug's within proxmox.
 
I really am starting to question why we even bother to purchase subscriptions. The "Enterprise Repo's" have proven to be no more stable than the "Non Enterprise Repo's" and the dev's provide responses when they feel like it. This is clearly a proxmox issue which should be addressed. I do have the ability to create tickets, but imo I shouldn't be wasting our tickets on bug's within proxmox.

I have proven and stated time and time again that this is a proxmox issue and not a configuration/quorum disk issue.
 
Hi adam,

when this problem occur,

- what is the content of /var/log/cluster/corosync.log
- are you able to write into /etc/pve/ ? (for example, can you open a /etc/pve/qemu-server/vmid.conf and write it ?).


can you try to restart

/etc/init.d/pve-cluster restart
/etc/init.d/pvedaemon restart
/etc/init.d/pvestatd restart
 
I really am starting to question why we even bother to purchase subscriptions. The "Enterprise Repo's" have proven to be no more stable than the "Non Enterprise Repo's" and the dev's provide responses when they feel like it. This is clearly a proxmox issue which should be addressed. I do have the ability to create tickets, but imo I shouldn't be wasting our tickets on bug's within proxmox.

I doubt that you waste your time on the Support. So far you got a lot of info and hints to fix your "not optimal" network and cluster setup - if you are unhappy with the support services, write your complains to office@.. - the forum is the wrong place for this.
 
I doubt that you waste your time on the Support. So far you got a lot of info and hints to fix your "not optimal" network and cluster setup - if you are unhappy with the support services, write your complains to office@.. - the forum is the wrong place for this.

Yep you guys have assisted me in a few issues and I do appreciate it, but lets face it, they were nothing that serious. I never said I waste my time, I simply said I wonder why we purchase them when the dev's respond to issues when they feel like it. They asked for the output of a few things which I provided, then never responded back?

I don't see how my setup is not optimal, are we going to start playing that hole quorum disk is clunky game again when its a fully supported and certified setup per red hat engineers? I understand this is promxox and not red hat, but the majority of it is built off the same concepts.

I do agree this is not the place and I should have kept that remark to myself. At the same time, the majority of this thread is me trying to work through the issue and community members providing input.

I did take offence by the first remark from the proxmox dev because I feel instead of really trying to understand the issue he wanted to blame it on the quorum disk.

Wouldn't it make more sense to provide input and assistance on my issue instead of telling me were the complaints go?
 
Last edited:
...

Wouldn't it make more sense to provide input and assistance on my issue instead of telling me were the complaints go?

I am the forum moderator and I have to make sure that all post are clean and to the forum topic. As you do off-topic comments, I commented on this. If you need direct contact to the developers, use the commercial support - here is a short explanation what you can expect from the different support channels:

http://pve.proxmox.com/wiki/Get_support
 
I am the forum moderator and I have to make sure that all post are clean and to the forum topic. As you do off-topic comments, I commented on this. If you need direct contact to the developers, use the commercial support - here is a short explanation what you can expect from the different support channels:

http://pve.proxmox.com/wiki/Get_support

Understood.

We get a set amount of tickets per subscription. I don't feel I should be using our tickets on bugs within proxmox which this clearly is. Would I be better off simply filling a bug report? I asked this in a previous post already.
 
if you found a bug in the Proxmox VE code, just filed it on https://bugzilla.proxmox.com - no support ticktet needed.

Do not forget to add a testcase so that our devs can easily reproduce the bug in their test lab.
 
Well, I went to test some of the suggestions and I am unable to reproduce the issue on any of my clusters now. Quite odd. I will update this thread again when it happens and I pin down exactly what is the cause. I appreciate everyone's input.
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!