- data is deduplicated between snapshots of a same client (between "snapshot1 from client1" and "snapshot2 from client1" for example)
- data is deduplicated between snapshots of multiple clients (between "snapshot1 from client1" and "snapshot1 from client2" for example)
- data is deduplicated within a same snapshot (within "snapshot1 from client1" if the client has duplicate files in his drive for example)
Hmm, sad. Now that you proxmox guys brought me into reading and exploring all this cool stuff about ZFS and also their dedup feature.
> it needs to keep the dedup info in memory.
Yes it needs to cache it in memory to be fast. But you can store the dedup table also on a special device (i.e. a ssd)
> Many setups to not use ZFS
Could't you just let pbs create one or more big files on any storage backend and create a zpool->zfs over it (all done on the fly by pbs).
Yes, i can aggree with that after i experienced a bit with it... I just wonder what you'd do better in your implemantion to not run into the same problems. Different block sizes / better block alginments ?if you are not 100% sure it's a perfect fit for your hardware and workload combination, the recommendation is still to not enable ZFS deduplication anywhere.
Yes, i can aggree with that after i experienced a bit with it... I just wonder what you'd do better in your implemantion to not run into the same problems. Different block sizes / better block alginments ?
Thx for the comprehensive answers !
Hi, please don't hijack threads, this question should go in a thread on its own I would suggest.Is there any way to fast recover VM from backups in PBS?
With ZFS snapshot and replication work as backup, I can simply copy the snapshot on backup server to a new dataset name and create a new VM with it. This whole process can finish within 10 minutes.
With PBS, I can map disk backup to a lookback device using proxmox-backup-client, mount this device and access the files, but can't boot VM from this. I can restore the backup to a new VM but all data need read and write which will cost a lot of time. Suppose I have a VM with 20TB of data, speed of read and restore from backup are 100MB/s, then the restore process needs more than 58 hours which definitely is not fast recovery.
Is there any way to improve VM recovery speed?
Thanks.