Well im back, I guess when their is tech issues at hand, I just can't help myself. And that why my job is head Technical Analyst.
Anyway.
I've just done a test run using 7zip, and I set the format to gzip, fastest.
This uses just the one core, here on my 2500 mips laptop, and I get 42 mb/s
tar + gzip is a known standard
tar + 7zip is not a standard
I'm not against using some other compression format but using a standard format does have some benefits.
Fast is nice but it is not the only item to take into consideration.
I can take my Proxmox backup file over to any other virtualization platform and use readily available utilities to read the data and restore vms.
Sure it might be difficult and clunky, but the fact I CAN do it is a big benefit.
In my opinion, there should be two compression options
1) Fast: Potential to actually improve backup performance, because it reduces bandwidth required to the backup device. (Especially true if two backups were running at once, provided multiple cores are available.)
2) Good: For those more concerned about storage space on the backup device.
If you need speed there are currently other solutions such as pigz.
I've been using pigz for over a year now and my backups complete in a timely manner.
Yes more improvements can be made and the Proxmox team has been looking into other ideas for some time.
As for vmtar needing to scan twice: If you get compression going full speed, then you dont need to treat the file as sparse, because compression will simply wrap up all the zeroes.
The purpose of the scan in vmtar is to identify the sparse areas.
Once identified it creates the tar archive in a manner that allows the tar utility to restore the file including the sparse areas.
This way the restored file is the same size as the original with the sparse areas remaining intact.
Also, this reduces the size of the archive WITHOUT needing compression.
That being said I am not 100% sure if the Proxmox utilities used to restore actually use this feature.
And again, I really cant see what possibly is the technical need for vmtar to scan twice to handle sparse files. Perhaps this is needed for tape destinations? But I am not at all familar with the linux structure of tools used for vm backup.
Speculating as to why this is needed adds nothing to the conversation.
You could find the answer in about 2 minutes if you bothered to search google.
Scroll to the bottom of the page and read the last few paragraphs:
http://sunsite.ualberta.ca/Documentation/Gnu/tar-1.13/html_node/tar_125.html