There's a interessant post where a user compare xfs with full clearing filesystem cache vs. zfs with nearly emtpy arc by max it to 4GB:
https://www.reddit.com/r/DataHoarder/comments/1f4ghzr/testing_xfs_vs_zfs_speed_hashing_700k_files/
If you compare xfs filesystem cache vs. arc by :
"cd /usr ; tar cf /<xfs-mount>/<dir-anywhere>/os.tar * /etc ; time cat /<xfs-mount>/<dir-anywhere>/os.tar >/dev/null" and
"cd /usr ; tar cf /<zfs-pool>/<dir-anywhere>/os.tar * /etc ; time cat /<zfs-pool>/<dir-anywhere>/os.tar >/dev/null"
you will see nearly same performance difference as in nearly cache-less test from user above - intelligence of arc here or there, it's slower.
So I'm wondering myself why the user is wondering itself about the difference while a system which has more "to_do" by checksums would be any time faster.
More work has it's price which add latency in the I/O path, you could zfs mirror where the price is capacity, doing raidz/draid where it's performance again
or take more disks in more vdevs then the price are the additional disks needed but data checksums (even in cow) are not for free.
For sure checksums are really nice and the value could be decided by everyone itself.
https://www.reddit.com/r/DataHoarder/comments/1f4ghzr/testing_xfs_vs_zfs_speed_hashing_700k_files/
If you compare xfs filesystem cache vs. arc by :
"cd /usr ; tar cf /<xfs-mount>/<dir-anywhere>/os.tar * /etc ; time cat /<xfs-mount>/<dir-anywhere>/os.tar >/dev/null" and
"cd /usr ; tar cf /<zfs-pool>/<dir-anywhere>/os.tar * /etc ; time cat /<zfs-pool>/<dir-anywhere>/os.tar >/dev/null"
you will see nearly same performance difference as in nearly cache-less test from user above - intelligence of arc here or there, it's slower.
So I'm wondering myself why the user is wondering itself about the difference while a system which has more "to_do" by checksums would be any time faster.
More work has it's price which add latency in the I/O path, you could zfs mirror where the price is capacity, doing raidz/draid where it's performance again
or take more disks in more vdevs then the price are the additional disks needed but data checksums (even in cow) are not for free.
For sure checksums are really nice and the value could be decided by everyone itself.