Hi,
During our weekly verify one week ago on a PBS datastore "datastore2" on PBS 1.1.13-2 machine "backup2" we got 351 errors "blob too small (0 bytes)". The verify job renamed the zero sized blob to ".0.bad":
This datastore2 is a daily remote sync of "datastore1" on PBS 1.1.13-2 machine "backup1", on this machine the corresponding chunk seem to be there with a non zero size, here is the first one:
Last week-end verify on datastore2 was successfull, probably it didn't recheck those problematic zero sized chunks.
Is there a way to force resync of those zero sized chunks?
I can do it manually but I wonder if there's a better way to make sure a primary and remote are well in sync.
Note: datastore1 and datastore2 are both about 10 TB. The underlying ZFS have scrubbed with zero error.
During our weekly verify one week ago on a PBS datastore "datastore2" on PBS 1.1.13-2 machine "backup2" we got 351 errors "blob too small (0 bytes)". The verify job renamed the zero sized blob to ".0.bad":
Code:
# extract from LOGFILE=/var/log/proxmox-backup/tasks/...:
2021-11-07T13:33:39+01:00: verify datastore2:vm/10000/2021-06-18T21:30:01Z
2021-11-07T13:33:39+01:00: check qemu-server.conf.blob
2021-11-07T13:33:39+01:00: check drive-scsi0.img.fidx
2021-11-07T13:37:24+01:00: can't verify chunk, load failed - store 'datastore2', unable to load chunk '01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264' - blob too small (0 bytes).
2021-11-07T13:37:24+01:00: corrupted chunk renamed to "/mnt/datastore/datastore2/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264.0.bad"
root@backup2:~# ls -l /mnt/datastore/datastore2/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264.0.bad
-rw-r--r-- 1 backup backup 0 Oct 13 17:51 /mnt/datastore/datastore2/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264.0.bad
root@backup2:~# ls -l /mnt/datastore/datastore2/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264
ls: cannot access '/mnt/datastore/datastore2/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264': No such file or directory
root@backup2:~# grep "blob too small" $LOGFILE|wc -l
351
This datastore2 is a daily remote sync of "datastore1" on PBS 1.1.13-2 machine "backup1", on this machine the corresponding chunk seem to be there with a non zero size, here is the first one:
Code:
root@backup1:~# ls -l /mnt/datastore/datastore1/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264*
-rw-r--r-- 1 backup backup 2948386 Jun 18 14:09 /mnt/datastore/datastore1/.chunks/01db/01dbac89d42d6d75d5d43878d5d3142d86941e082c41b7b2c268b3aef42c2264
Last week-end verify on datastore2 was successfull, probably it didn't recheck those problematic zero sized chunks.
Is there a way to force resync of those zero sized chunks?
I can do it manually but I wonder if there's a better way to make sure a primary and remote are well in sync.
Note: datastore1 and datastore2 are both about 10 TB. The underlying ZFS have scrubbed with zero error.