[SOLVED] Help with replication/migration problem

Dec 10, 2019
10
0
6
Hi,
I am facing a problem after the replication between two nodes hung up. I am now seeing the following error when trying to migrate the two containers which were being replicated when the replication stopped working. Please bear with me, I am long time Linux user, but new to zfs. Can you please provide a hint how to safely remove the stale copy of the VM?

This is the error I get (IP and names removed)
Bash:
2020-09-02 21:36:37 starting migration of CT 310 to node 'XX' (xxx.xxx.xxx.xxx)
2020-09-02 21:36:37 found local volume 'local-zfs:subvol-310-disk-0' (in current VM config)
2020-09-02 21:36:38 full send of rpool/data/subvol-310-disk-0@__migration__ estimated size is 2.42G
2020-09-02 21:36:38 total estimated size is 2.42G
2020-09-02 21:36:38 rpool/data/subvol-310-disk-0    name    rpool/data/subvol-310-disk-0    -
2020-09-02 21:36:38 volume 'rpool/data/subvol-310-disk-0' already exists
2020-09-02 21:36:38 TIME        SENT   SNAPSHOT rpool/data/subvol-310-disk-0@__migration__
2020-09-02 21:36:38 command 'zfs send -Rpv -- rpool/data/subvol-310-disk-0@__migration__' failed: got signal 13
send/receive failed, cleaning up snapshot(s)..
2020-09-02 21:36:39 ERROR: storage migration for 'local-zfs:subvol-310-disk-0' to storage 'local-zfs' failed - command 'set -o pipefail && pvesm export local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -snapshot __migration__ | /usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=XX' root@xxx.xxx.xxx.xxx -- pvesm import local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -delete-snapshot __migration__ -allow-rename 0' failed: exit code 255
2020-09-02 21:36:39 aborting phase 1 - cleanup resources
2020-09-02 21:36:39 ERROR: found stale volume copy 'local-zfs:subvol-310-disk-0' on node 'XX'
2020-09-02 21:36:39 start final cleanup
2020-09-02 21:36:39 ERROR: migration aborted (duration 00:00:02): storage migration for 'local-zfs:subvol-310-disk-0' to storage 'local-zfs' failed - command 'set -o pipefail && pvesm export local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -snapshot __migration__ | /usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=XX' root@xxx.xxx.xxx.xxx -- pvesm import local-zfs:subvol-310-disk-0 zfs - -with-snapshots 0 -delete-snapshot __migration__ -allow-rename 0' failed: exit code 255
TASK ERROR: migration aborted

Thanks a lot for your help, it is highly appreciated!
Best regards!
 
Nevermind, I decided to be brave and try out what I thought should be the proper solutions and I seem to be lucky today :)
Replication and migration are working again.
Anyway, thanks for listening.
 
Good to hear that you solved the problem. Would you be so kind to quickly share what it was and how you solved it so other people who find this thread can get check if theirs is the same?
 
Yes, of course. As the error message indicates there are leftovers from the broken replication process. I wasn't sure if I can delete those directly from the filesystem without messing Proxmox up, but as it seems, it works.
I deleted the remains on the target server with:
Code:
zfs unmount rpool/data/subvol-310-disk-0
rm -rf /zpool/data/subvol-310-disk-0
zfs destroy -r rpool/data/subvol-310-disk-0
After that replication and migration worked again.
 
Is there a possibility to resume the replication without erasing all the data on the target side (and therefore re-sending a lot of data)?

Thanks in advance

Regards
 
Is there a possibility to resume the replication without erasing all the data on the target side (and therefore re-sending a lot of data)?

Thanks in advance

Regards
please post output of "pveversion -v", the (failed) replication log and "zfs list -t all -r VOLUME" for all volumes on both source and target node (and indicate which is which ;))
 

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!