Proxmox API will return OK while the task actually failed

layer7.net · May 17, 2022

Hi,

when using ceph, the removal of an RBD volume might take a moment. Thats natural as ceph will remove the data.

The problem is: When you are doing multiple actions ( no matter if its create or remove of RBD volumes ) -- like you do it when you create / destroy a VM then the different API processes will wait to acquire a lock to do their magic.

Unfortunatelly it happens ( reproduceable ) quiet easily, that a process will wait to acquire a cfs lock and after a while, will receive a timeout for this lock request and give up.

So far, the story is still fine ( even not nice ). But the problem is, that the status of this API task will actually become OK ( even its not ).

So the API will report back a success of the action, while this is actually not true. And thats a problem if you can not rely on a OK from the API.

It would be very nice if this could be improved.

Thank you & Greetings
Oliver

fabian · May 18, 2022

yeah, now that we have 'task warnings' these should probably be logged accordingly!

filed https://bugzilla.proxmox.com/show_bug.cgi?id=4064

Search

Search

Proxmox API will return OK while the task actually failed

layer7.net

Member

Attachments

fabian

Proxmox Staff Member

We value your privacy