[SOLVED] Help with replication/migration problem

stefanm · Sep 2, 2020

Hi,
I am facing a problem after the replication between two nodes hung up. I am now seeing the following error when trying to migrate the two containers which were being replicated when the replication stopped working. Please bear with me, I am long time Linux user, but new to zfs. Can you please provide a hint how to safely remove the stale copy of the VM?

This is the error I get (IP and names removed)

Bash:

2020-09-02 21:36:37 starting migration of CT 310 to node 'XX' (xxx.xxx.xxx.xxx)
2020-09-02 21:36:37 found local volume 'local-zfs:subvol-310-disk-0' (in current VM config)
2020-09-02 21:36:38 full send of rpool/data/subvol-310-disk-0@__migration__ estimated size is 2.42G
2020-09-02 21:36:38 total estimated size is 2.42G
2020-09-02 21:36:38 rpool/data/subvol-310-disk-0    name    rpool/data/subvol-310-disk-0    -
2020-09-02 21:36:38 volume 'rpool/data/subvol-310-disk-0' already exists
2020-09-02 21:36:38 TIME        SENT   SNAPSHOT rpool/data/subvol-310-disk-0@__migration__
2020-09-02 21:36:38 command 'zfs send -Rpv -- rpool/data/subvol-310-disk-0@__migration__' failed: got signal 13
send/receive failed, cleaning up snapshot(s)..
2020-09-02 21:36:39 ERROR: storage migration for 'local-zfs:subvol-310-disk-0' to storage 'local-zfs' failed - command 'set -o pipefail && pvesm export local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -snapshot __migration__ | /usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=XX' root@xxx.xxx.xxx.xxx -- pvesm import local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -delete-snapshot __migration__ -allow-rename 0' failed: exit code 255
2020-09-02 21:36:39 aborting phase 1 - cleanup resources
2020-09-02 21:36:39 ERROR: found stale volume copy 'local-zfs:subvol-310-disk-0' on node 'XX'
2020-09-02 21:36:39 start final cleanup
2020-09-02 21:36:39 ERROR: migration aborted (duration 00:00:02): storage migration for 'local-zfs:subvol-310-disk-0' to storage 'local-zfs' failed - command 'set -o pipefail && pvesm export local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -snapshot __migration__ | /usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=XX' root@xxx.xxx.xxx.xxx -- pvesm import local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -delete-snapshot __migration__ -allow-rename 0' failed: exit code 255
TASK ERROR: migration aborted

Thanks a lot for your help, it is highly appreciated!
Best regards!

stefanm · Sep 2, 2020

Nevermind, I decided to be brave and try out what I thought should be the proper solutions and I seem to be lucky today

Replication and migration are working again.
Anyway, thanks for listening.

aaron · Sep 3, 2020

Good to hear that you solved the problem. Would you be so kind to quickly share what it was and how you solved it so other people who find this thread can get check if theirs is the same?

stefanm · Sep 3, 2020

Yes, of course. As the error message indicates there are leftovers from the broken replication process. I wasn't sure if I can delete those directly from the filesystem without messing Proxmox up, but as it seems, it works.
I deleted the remains on the target server with:

Code:

zfs unmount rpool/data/subvol-310-disk-0
rm -rf /zpool/data/subvol-310-disk-0
zfs destroy -r rpool/data/subvol-310-disk-0

After that replication and migration worked again.

greg · Mar 27, 2023

Is there a possibility to resume the replication without erasing all the data on the target side (and therefore re-sending a lot of data)?

Thanks in advance

Regards

fabian · Mar 27, 2023

greg said:
Is there a possibility to resume the replication without erasing all the data on the target side (and therefore re-sending a lot of data)?

Thanks in advance

Regards

please post output of "pveversion -v", the (failed) replication log and "zfs list -t all -r VOLUME" for all volumes on both source and target node (and indicate which is which

)

Search

Search

[SOLVED] Help with replication/migration problem

stefanm

Member

stefanm

Member

aaron

Proxmox Staff Member

stefanm

Member

greg

Renowned Member

fabian

Proxmox Staff Member

We value your privacy