Help with TRIM on Virtio SCSI Single

Tmanok · Mar 7, 2023

Hi Everyone,

Wondering whether or not Virtio SCSI Single supports TRIM in guests, I have observed it not working automatically during disk migration with a hook script but when the disk controller is set to Virtio SCSI, it works as expected with no other changes. Evidently, I've prefer to use Virtio SCSI Single, due to the performance gains, notably with iothread.

Is this issue known? Does anyone have any solution for this?
Thank you,

Tmanok

fiona · Mar 8, 2023

Hi,
it works on my setup. Please share the output of pveversion -v and qm config <ID> with the ID of your VM. What is the source and target storage of the move operation? What exactly are you doing in the hook script, how do you invoke it? FYI, you can just install the QEMU guest agent inside the VM and then enable the option to trim cloned disks.

Alwin Antreich · Mar 8, 2023

Hi,

attached the output you've requested. After some more testing, it seems that the fstrim works only once on a freshly startup VM (tested debian11 & arch). Regardless of `virtio scsi` or `virtio scsi single`. A second run of `fstrim -v /` yields 0 trimmed bytes. I guess that is expected. But then the underlying storage isn't trimmed. How to we get a thin provisioned storage after disk move?

Cheers,
Alwin

EDIT: tested with RBD & LVM-thin storage, here I can reproduce it consistently when moving between these two storages.

fiona · Mar 9, 2023

Hi Alwin

Thanks, I understand the issue now and am able to reproduce it. To summarize (you already mentioned the first two points):

It only happens if an fstrim was already run before the disk move.
Doesn't matter if VirtIO SCSI or VirtIO SCSI single or SATA.
It's not a recent regression in QEMU, I get the same behavior with pve-qemu-kvm=6.0.0-4 too.
It also happens with Debian 9 with kernel 4.9
If write a large file with zeroes and remove it in the VM, trim will do something again
If I issue two fstrim calls right after another, for the second one, the function scsi_disk_emulate_unmap in QEMU is not even reached
A reboot of the guest makes fstrim work again.

From the last two points I'd guess that this is not a QEMU issue at all, but there is some kind of optimization inside the guest to avoid issuing duplicate trim requests. But the guest cannot know about the underlying disk move, so it cannot know that it shouldn't use that optimization.

Alwin Antreich · Mar 9, 2023

fiona said:
If write a large file with zeroes and remove it in the VM, trim will do something again

If I issue two fstrim calls right after another, for the second one, the function scsi_disk_emulate_unmap in QEMU is not even reached

A reboot of the guest makes fstrim work again.

Yup. I've changed the filesystem from ext4 to xfs. And with xfs it works.

~~Do you have any good pointers how to get the old behavior back?~~

Cheers,
Alwin

EDIT: maybe I just never payed attention to it.

But the post in the link explains it.
https://serverfault.com/questions/1...return-same-value-unlike-ext4/1113129#1113129

fiona · Mar 10, 2023

Alwin Antreich said:
Yup. I've changed the filesystem from ext4 to xfs. And with xfs it works.

~~Do you have any good pointers how to get the old behavior back?~~

Cheers,
Alwin

EDIT: maybe I just never payed attention to it. But the post in the link explains it.
https://serverfault.com/questions/1...return-same-value-unlike-ext4/1113129#1113129

Good find! So it really is a guest optimization (in particular ext4) and there's nothing we can do, except ask kernel devs to make it optional.

Tmanok · Mar 22, 2023

Given that EXT4 is the default and most supported file system for Linux servers, is this something PVE can talk "shop" with the kernel devs about?

Thanks, Fiona! I'd do it myself but I feel hopelessly unwise about how to communicate the issue upstream.

Tmanok

fiona · Mar 22, 2023

Hi,

Tmanok said:
Given that EXT4 is the default and most supported file system for Linux servers, is this something PVE can talk "shop" with the kernel devs about?

Thanks, Fiona! I'd do it myself but I feel hopelessly unwise about how to communicate the issue upstream.

there is https://bugzilla.kernel.org/ but often, you'll have more luck writing a mail to the relevant mailing list and maintainers:
In this case linux-ext4@vger.kernel.org, see EXT4 FILE SYSTEM in https://www.kernel.org/doc/linux/MAINTAINERS

There was already a brief discussion about this once: https://lore.kernel.org/all/20211025094227.yio3cjpboxumt5ml@work/ probably doesn't hurt to mention either.

Search

Search

Help with TRIM on Virtio SCSI Single

Tmanok

Renowned Member

fiona

Proxmox Staff Member

Alwin Antreich

Well-Known Member

Attachments

fiona

Proxmox Staff Member

Alwin Antreich

Well-Known Member

fiona

Proxmox Staff Member

Tmanok

Renowned Member

fiona

Proxmox Staff Member