Moving container on LVM thin volume to smaller storage

wbk · Mar 11, 2026

Hi all,

I have a container on a thinly provisioned LVM volume. I want to extract the disk that backs the thin volume out of the system running Proxmox, but run into the issue that thin volumes don't support shrinking.

After moving stuff out of the way, there is a lot of air in the container, but the thin volume is, of course, still the original size. Some (approximate) numbers :

PV / VG backing the thin pool : 7,3 TiB
- Thin pool : 4,5 TiB
- The containers thin volume : 3,6 TiB
Container :
- Size of the file system : 3,6 TiB
- Actual storage used: 0,5 TiB
Available target for moving : 3,5 TiB disk

It's quite close, but the target disk is a few tens of GB short.

I ran a short test to see whether Proxmox refuses to move the container to a target that is too small. It will diligently start moving.

What are my options, while also minimizing downtime?

I have backups, but I suspect restoring such a backup will run into the same problem eventually as moving the volume (to be tested, after I have a backup of the new smaller size)
The container used about 3,3 TiB before moving data elsewhere. The last 0,3 TiB are, I presume, never used or claimed by LVM. I would not lose extends that got data if the container got cut off at 3,5 TiB, but LVM will probably stress over losing integrity
File level move to a new container may work
- create a new 0,7 TiB container on target storage
- shutdown both containers
- mount the filesystems on the host
- rsync from the existing container to the new one
- comes with downtime
I don't really have an option at the moment to put larger disks in the system, nor is the container expected to grow to its old size anymore

My plan is to:

shutdown the container, resize2fs to shrink the filesystem
create a backup of the container with the large amount of unused space
restore to the smaller HDD on a regular logical volume
in case the restore does not fail, continue
fsck.ext4 the file system (it's probably confused about the missing tail of the filesystem)
check whether the container boots, shutdown
resize2fs the file system (shrink to some 0,7 TiB)
lvresize -r the volume to some 0,7 TiB
see whether the container still boots, shutdown
rsync the delta from the running original running container since creating the backup
shutdown both containers, rsync once more
boot the new container

Can be said upfront that this is not going to work? In case it might work, which part is highest risk? Suggestions or improvements?

wbk · Mar 11, 2026

Update, unexpected failure at step 1,

wbk said:
My plan is to:

shutdown the container, resize2fs to shrink the filesystem

While logged in to the host, with the container shut down, I resized the FS in the LV to 600 GB:

Bash:

# fsck.ext4 /dev/mapper/allerlei-vm--104--disk--0
e2fsck 1.47.0 (5-Feb-2023)
/dev/mapper/allerlei-vm--104--disk--0: clean, 5198085/244908032 files, 113790776/979632128 blocks
# resize2fs -p -z 20260311_resize2fs-device.e2undo /dev/mapper/allerlei-vm--104--disk--0 600G
resize2fs 1.47.0 (5-Feb-2023)
Overwriting existing filesystem; this can be undone using the command:
    e2undo 20260311_resize2fs-device.e2undo /dev/mapper/allerlei-vm--104--disk--0

Please run 'e2fsck -f /dev/mapper/allerlei-vm--104--disk--0' first.
# e2fsck -f /dev/mapper/allerlei-vm--104--disk--0
e2fsck 1.47.0 (5-Feb-2023)
Pass 1: Checking inodes, blocks, and sizes
Inode 46012422 extent tree (at level 2) could be narrower.  Optimize<y>? no
Inode 46039626 extent tree (at level 2) could be narrower.  Optimize<y>? no
Pass 2: Checking directory structure
Pass 3: Checking directory connectivity
Pass 4: Checking reference counts
Pass 5: Checking group summary information

Booting the container afterwards an checking the size of the filesystem, I was in for a surprise:

Code:

# # (in the container)
# df -h
Filesystem                             Size  Used Avail Use% Mounted on
/dev/mapper/allerlei-vm--104--disk--0  3.6T  374G  3.1T  11% /

It still is 3.6 T , with only 374 G used. I had expected the size of the of the filesystem to be 600 G.

I did not know what to expect as values for the LV, but it has not changed; the current value data% for `lvs` is the same as it was before resizing the filesystem

Code:

# lvs
  LV            VG       Attr       LSize  Pool      Origin Data%  Meta%  Move Log Cpy%Sync Convert
  vm-104-disk-0 allerlei Vwi-aot--- <3.65t dunnedata        79.30

wbk · Mar 12, 2026

The following step was unexpectedly successful,

wbk said:
create a backup of the container with the large amount of unused space

I had expected to wait a day for 3 TB to be backed up, but to my surprise the backup was already finished. It had actually only processed data for the size of the file system, not for the size of the partition. That's a bonus in my eyes ;-)

Step after that,

wbk said:
restore to the smaller HDD on a regular logical volume

not so much:

Code:

Recovering backed-up configuration from 'mt_pbs_localbaks_online:backup/ct/104/2026-03-11T17:09:27Z'
TASK ERROR: unable to restore CT 101 - lvcreate 'online/vm-101-disk-0' error:   Volume group "online" has insufficient free space (953861 extents): 956672 required.

Not _totally_ to my surprise, the backup still expects an LV with the correct size.

I'll try the `rsync` route.

Thanks for reading. As before: I'm open to thoughts and suggestions!

Impact · Mar 12, 2026

I didn't read all of it but you can shrink thin volumes, just not the thin pool itself. Here's some outputs from when I demonstrated this to someone

Bash:

# lvresize -r -L 8G pve/vm-101-disk-0
  File system ext4 found on pve/vm-101-disk-0.
  File system size (20.00 GiB) is larger than the requested size (8.00 GiB).
  File system reduce is required using resize2fs.
  File system fsck will be run before reduce.
  Reducing file system ext4 to 8.00 GiB (8589934592 bytes) on pve/vm-101-disk-0...
e2fsck /dev/pve/vm-101-disk-0
/dev/pve/vm-101-disk-0: Inode 1688 extent tree (at level 1) could be shorter.  IGNORED.
/dev/pve/vm-101-disk-0: Inode 15990 extent tree (at level 1) could be shorter.  IGNORED.
/dev/pve/vm-101-disk-0: 22805/1310720 files (0.3% non-contiguous), 297383/5242880 blocks
e2fsck done
resize2fs /dev/pve/vm-101-disk-0 8388608k
resize2fs 1.47.2 (1-Jan-2025)
Resizing the filesystem on /dev/pve/vm-101-disk-0 to 2097152 (4k) blocks.
The filesystem on /dev/pve/vm-101-disk-0 is now 2097152 (4k) blocks long.

resize2fs done
  Reduced file system ext4 on pve/vm-101-disk-0.
  Size of logical volume pve/vm-101-disk-0 changed from 20.00 GiB (5120 extents) to 8.00 GiB (2048 extents).
  Logical volume pve/vm-101-disk-0 successfully resized.

# pct fsck 101
fsck from util-linux 2.41
/dev/mapper/pve-vm--101--disk--0: clean, 22805/524288 files, 245989/2097152 blocks

Before doing this make a backup, run pct fstrim CTID and pct fsck CTID, then shut the CT down. After resizing fsck again and issue a pct rescan.
This depends a lot on the storage so the recommended/universal way is to restore a backup while setting the size.
I also described how to do this with ZFS here.

wbk · Mar 12, 2026

Hi Impact,

Great! Thank you for mentioning, I was under the impression (as you probably guessed) that the LV itself was grow-only as well.

I'm at the moment not able to test it right away, but I'll post feedback after I tried!

wbk · Mar 12, 2026

Shortly after I had the opportunity to log in and kick off the fstrim, than realized it would be handy to chain fsck:

Code:

# pct fstrim 104
^C
# pct fstrim 104 && pct fsck 104
CT is locked (fstrim)

It's nine hours later now. I realize that fsck wouldn't have worked out on the running container anyway, but I'd expected the container to be unlocked by now.

Belatedly, from

Code:

man pct/code] for fstrim
[QUOTE]
pct fstrim <vmid> [OPTIONS]

Run fstrim on a chosen CT and its mountpoints, except bind or read-only mountpoints.
[/QUOTE]

Apart from the root volume, there are two NFS mounts. Supposedly root on the host does not have anything to say about the FS underlying these mountpoints on another server (they're not on SSD or thinly provisioned volumes, so they wouldn't have an effect either way). 

Reading the man description though, it may have got stuck on that: they're mountpoints, not bind- or read-only mountpoints. 

There's not any visible disk activity. With SSDs I would be less surprised, as the command could have been kicked down the line until it is executed by the SSD itself. On my thin pool, it's all in the open and involves (I suppose) direct disk actions with visible I/O and at least some CPU impact. 

There was an ever so slight rise in server load around the time I executed the command. As a check, I temporarily mounted the container volume directly on the host, and am running [code]fstrim /mnt/container

on that directly.
That sees a similar but lower and shorter moderate increase in load. The process is ready within a minute. It leads me to conclude that the lock on the container is a stale lock, perhaps caused by me starting the trim twice and loading an fsck task on top.

Impact said:
This depends a lot on the storage so the recommended/universal way is to restore a backup while setting the size.

I'm not parsing this sentence, could you elaborate? Nice set of hints and tips by the way!

Impact · Mar 13, 2026

One way is to restore the backup via the CLi and give a specific size for the rootfs like this. Another way is to lie to the system by editing the size in the config file before creating the backup and then restoring it.
As for the lock I'd try something like pkill fstrim and pct unlock 104. I don't use bind mounts but I wouldn't expect them to cause an issue with fstrim/fsck but maybe it makes sense to remove them until this process is done.

wbk · Mar 13, 2026

Thanks again!

pkill fstrim runtime less than a second, no output
pct unlock 104 slight pauze, ok with no output
PVE had a as stale mount of the volume in /var/lib/lxc/104/roofs, manually unmounted
pve fsck 104 ran less than a minute, states "clean"
lvresize -tvr -L 400G /dev/mapper/allerlei-vm--104--disk--0 , returns fsadm: Filesystem has not been checked after the last mount, using fsck -f
After a few tries, I figured it needed fsck.ext4 -f to force check of clean filesystem
now lvresize goes through. The command writes quite heavily to the HDD. As I had moved (mostly recent) data off disk, and had earlier shrunk the filesystem, I had supposed there would not be a large amount of extents at the end of the LV. By implication, the shrinking takes quite a bit longer than I had anticipated.

So far everything looks fine, but in retrospect I'd only resized by a few hundred of GB, in order to be able to move the volume without so much downime. As it is, I have no idea how long it's going to take

If it actually needs to write all emptied extents or something, with 3,6 TB --> 0,4 TB that's 3,2 TB of write actions on a single HDD VG, and though there are 'peaks' of 20 MB/s, mostly it has been hovering at 2-4 MB/s for the last six hours.

wbk · Mar 13, 2026

Slight worry is setting in.

The transaction is still running. It put an exclusive lock on everything LVM, and also prevents the volume from being mounted read-only.

I never had trouble shrinking an LV before, but this is the first time it involves thin pools.

Without any feedback from the system, I have no idea what it is doing, how much longer it is going to take, or whether it is safe to cancel the transaction.

Besides the transaction taking a long time and preventing me from bringing the container bringing back online, the PVE summary for the node also stopped reporting statistsics.

I've taken the screenshot while being logged in to the other on-line node, because the node itself does not display any graphs. It's not vital, but surprising.

Less convenient is that the status of all storage on the node is unknown (remote as well as local). As a result, the GUI does not allow me to restore a backup. CLI pvesm scan pbs hostname username --fingerprint <cert fingerprint from PBS> --password users-pass returns 'bad request'

Logging tells me pveproxy is dealing with something, but I don't know how to match it with the symptoms:

Code:

# journalctl -f
Mar 13 23:39:39 verjaardag pveproxy[1146625]: proxy detected vanished client connection
Mar 13 23:40:03 verjaardag pveproxy[1143582]: proxy detected vanished client connection
Mar 13 23:40:40 verjaardag pveproxy[1146625]: proxy detected vanished client connection

When I try mounting the volume read-only, I'm told that's not possible now. dmesg tells me:

Code:

[741222.704823] /dev/mapper/allerlei-vm--104--disk--0: Can't open blockdev

Any help or hints appreciated! Foremost: any way to forecast the duration of the transaction? Safe to cancel?

wbk · Mar 14, 2026

Wohoo!

The shrink-operation (crosspost for details) has completed, stats are returning, backup is running, and I'm winding down.

Impact · Mar 14, 2026

I never tested this with a volume this large or with HDDs. The daemon that updates the GUI's status and graphs doesn't handle slow/missing storage that well and can get stuck. I don't know for sure why it would take so long. All I can think of right now (besides SMR which your WD80EDAZ isn't) is that maybe the chunk size isn't optimal for a HDD. You can see it with lvs -o+chunksize.

wbk · Mar 14, 2026

Mostly things are fine, though a bit on the slower side, with HDDs. Migration to faster aggregated storage is the reason for the exercise in the first place.

Unfortunately, I woke up to a failed backup, and running fsck gives quite a few suggestions for repair. As to be expected, reported storage use does not match actual usage (it's off by a few 10s of GB over the current 400 GB filesystem).

The container does start; as I don't have time to further look into it this morning, I kicked off another attempt at backup while I let te container run for further investigation later today.

Code:

# fsck.ext4 /dev/mapper/allerlei-vm--104--disk--0                                                                                                              
e2fsck 1.47.0 (5-Feb-2023)                                                                                                                                                      
/dev/mapper/allerlei-vm--104--disk--0 contains a file system with errors, check forced.                                                                                        
Pass 1: Checking inodes, blocks, and sizes                                                                                                                                      
Inode 132733 extent tree (at level 2) could be narrower.  Optimize<y>? no                                                                                                      
Inode 132748 extent tree (at level 2) could be narrower.  Optimize<y>? no                                                                                                      
Inode 159169, end of extent exceeds allowed value                                                                                                                              
        (logical block 5888, physical block 71922477, len 1)                                                                                                                    
Clear<y>? no                                                                                                                                                                    
Inode 159169 has an invalid extent                                                                                                                                              
        (logical block 5888, invalid physical block 740668928, len 1)                                                                                                          
Clear<y>? no                                                                                                                                                                    
Inode 159169, end of extent exceeds allowed value                                                                                                                              
        (logical block 9216, physical block 71913749, len 1)                                                                                                                    
Clear<y>? no                                                                                                                                                                    
Inode 159169 has an invalid extent                                                                                                                                              
        (logical block 9216, invalid physical block 740660224, len 1)                                                                                                          
Clear<y>? no                                                                                                                                                                    
Inode 159169, end of extent exceeds allowed value                                                                                                                              
        (logical block 17508, physical block 71932283, len 1)                                                                                                                  
Clear<y>? no                                                                                                                                                                    
Inode 159169 has an invalid extent                                                                                                                                              
        (logical block 17508, invalid physical block 740680804, len 1)                                                                                                          
Clear<y>? no                                                                                                                                                                    
Inode 159169 extent tree (at level 2) could be narrower.  Optimize<y>? no                                                                                                      
Inode 159169, i_blocks is 2482320, should be 2482272.  Fix<y>? no                                                                                                              
Inode 462897 extent tree (at level 2) could be narrower.  Optimize<y>? no                                                                                                      
Inode 1667381 extent tree (at level 2) could be narrower.  Optimize<y>?                                                                                                        
/dev/mapper/allerlei-vm--104--disk--0: e2fsck canceled.                                                                                                                        
                                                                                                                                                                               
/dev/mapper/allerlei-vm--104--disk--0: ********** WARNING: Filesystem still has errors **********

Not correcting errors just yet until after a backup; a couple of years back an fsck gave me a similar pattern and after repair the (previously running fine) filesystem was all but empty.

Moving container on LVM thin volume to smaller storage

wbk

Renowned Member

wbk

Renowned Member

wbk

Renowned Member

Impact

Distinguished Member

wbk

Renowned Member

wbk

Renowned Member

Impact

Distinguished Member

wbk

Renowned Member

wbk

Renowned Member

wbk

Renowned Member

Impact

Distinguished Member

wbk

Renowned Member

We value your privacy