Are there any caveats on using "Metadata Detection Mode"?

GrafZahl · Jan 20, 2025

Hi there,
I am running PBS to backup some CTs and VMs running ob PVE. As the storage has over 1.5TB and is placed on classical, rotaing hdds a backup-jobs runs nearly 6 hours. I used the default settings for the backup-job and used "Default" as "Change detection mode".
Now I tried "Metadata" as change detection mode and the backup was (with the second run) extremely faster.

This brings me to the question if using "Metadata" as change detection mode is save or if there are any caveats that could kill my backups? As this seems to be much better, from the perspection of speed, I wonder if it could be a loss not reading all blocks.

Can you give me an hint on that? Thanks a lot!

Chris · Jan 21, 2025

Hi,

GrafZahl said:
This brings me to the question if using "Metadata" as change detection mode is save or if there are any caveats that could kill my backups? As this seems to be much better, from the perspection of speed, I wonder if it could be a loss not reading all blocks.

there is no danger in "killing" your backup. But there are some caveats:

Files might be reused instead of re-encoded if the file data changed, but the metadata did not. E.g. if a file was edited, but the file size remained the same and the mtime of the file was restored after the change, the change detection mode will see this as unchanged file and reuse it. That is why we also provide the change-detection-mode data, which always reads all files again.
The change detection mode might reuse existing chunks partially, leading to some padding. E.g. a file contained within a single chunk vanished in-between backup runs, but the chunk is reused. This additional padding can become wasted space if the previous snapshot actually referencing that file is pruned. This also has implication for sensitive data, please see the notes in https://pbs.proxmox.com/docs/maintenance.html#pruning. The client tired to minimize this by aligning chunk boundaries with file boundaries when possible and re-encode smaller files in some situations although they might not have changed.
Restore times can be slower, if there is additional padding which is downloaded by the client in any cases because part of a chunk.

couteaucu · Jun 10, 2025

One caveat I'm wondering about is the scheduled jobs are running under metadata detection mode, but when I hit backup now for a manual backup I believe that runs in default mode, which breaks the chain and causes the backups to take longer and take more space correct? Is there some way to change manual backups on proxmox to use metadata detection mode?

Impact · Jun 10, 2025

I do have that option when manually backing up a CT

Make sure you use a recent PVE version and the Storage is PBS.

couteaucu · Jun 10, 2025

Ah I see it on the containers but not on VMs. Is it just not implemented yet for VMs? And is there a workaround then?

Impact · Jun 10, 2025

It is not but VMs have the dirty bitmap feature for fast backups.

Lukas Wagner · Jun 11, 2025

Exactly, this does not apply for VMs. VMs are backed up on a block level and not the filesystem level. For VM's there is the dirty bitmap maintained by the Qemu process which gives indication about which blocks of the disk image have changed since the last backup. Therefore VM backups do not need this change detection mode, the backups were already pretty fast

EvertM · Jun 11, 2025

Chris said:
Hi,

there is no danger in "killing" your backup. But there are some caveats:

Files might be reused instead of re-encoded if the file data changed, but the metadata did not. E.g. if a file was edited, but the file size remained the same and the mtime of the file was restored after the change, the change detection mode will see this as unchanged file and reuse it. That is why we also provide the change-detection-mode data, which always reads all files again.

Hi,
I can see this happening with regular files (editing a text file without changing mtime), but is this even possible with backups of LXC/VM's?

Chris · Jun 11, 2025

EvertM said:
Hi,
I can see this happening with regular files (editing a text file without changing mtime), but is this even possible with backups of LXC/VM's?

Yes, this can happen if the mtime is deliberately restored to it's previous value after changing a files content (which updates the mtime), located on the host or container filesystem being backed up. But this is not typically the case, unless done on purpose or by some scripting/tooling. Also, the size must be unchanged as well, differences in size are detected.

But to emphasize this again, the change detection mode only regards host and LXC backups, not VM backups. VM backups are handled differently (at the block level), the client being completely agnostic to the contents of the block device. There might not even be a filesystem on the block device at all.

couteaucu · Jun 11, 2025

Ah so then when the backup job is set to metadata detection in advanced settings, that's actually only applying to the containers in the job NOT the VMs?

Impact · Jun 11, 2025

Yep, it only applies to CTs: https://pve.proxmox.com/pve-docs/pve-admin-guide.html#_ct_change_detection_mode

Backups of VMs or to storage backends other than Proxmox Backup Server are not affected by this setting.

Search

Search

Are there any caveats on using "Metadata Detection Mode"?

GrafZahl

New Member

Chris

Proxmox Staff Member

couteaucu

New Member

Impact

Well-Known Member

couteaucu

New Member

Impact

Well-Known Member

Lukas Wagner

Proxmox Staff Member

EvertM

Renowned Member

Chris

Proxmox Staff Member

couteaucu

New Member

Impact

Well-Known Member

We value your privacy