The VM show replication error ( HOW I can solve It)

ibrahimabusamrah · Jul 21, 2022

Replication Log

2022-07-21 09:33:00 100-0: start replication job
2022-07-21 09:33:00 100-0: guest => VM 100, running => 3114591
2022-07-21 09:33:00 100-0: volumes => SCME-REP-DS01:vm-100-disk-0
2022-07-21 09:33:01 100-0: freeze guest filesystem
2022-07-21 09:33:04 100-0: create snapshot '__replicate_100-0_1658385180__' on SCME-REP-DS01:vm-100-disk-0
2022-07-21 09:33:04 100-0: thaw guest filesystem
2022-07-21 09:33:05 100-0: using secure transmission, rate limit: none
2022-07-21 09:33:05 100-0: full sync 'SCME-REP-DS01:vm-100-disk-0' (__replicate_100-0_1658385180__)
2022-07-21 09:33:06 100-0: full send of SCME-REP-DS01/vm-100-disk-0@__replicate_100-0_1655900100__ estimated size is 84.0G
2022-07-21 09:33:06 100-0: send from @__replicate_100-0_1655900100__ to SCME-REP-DS01/vm-100-disk-0@__replicate_100-0_1658385180__ estimated size is 17.1G
2022-07-21 09:33:06 100-0: total estimated size is 101G
2022-07-21 09:33:06 100-0: volume 'SCME-REP-DS01/vm-100-disk-0' already exists
2022-07-21 09:33:06 100-0: warning: cannot send 'SCME-REP-DS01/vm-100-disk-0@__replicate_100-0_1655900100__': signal received
2022-07-21 09:33:06 100-0: warning: cannot send 'SCME-REP-DS01/vm-100-disk-0@__replicate_100-0_1658385180__': Broken pipe
2022-07-21 09:33:06 100-0: cannot send 'SCME-REP-DS01/vm-100-disk-0': I/O error
2022-07-21 09:33:06 100-0: command 'zfs send -Rpv -- SCME-REP-DS01/vm-100-disk-0@__replicate_100-0_1658385180__' failed: exit code 1
2022-07-21 09:33:06 100-0: delete previous replication snapshot '__replicate_100-0_1658385180__' on SCME-REP-DS01:vm-100-disk-0
2022-07-21 09:33:06 100-0: end replication job with error: command 'set -o pipefail && pvesm export SCME-REP-DS01:vm-100-disk-0 zfs - -with-snapshots 1 -snapshot __replicate_100-0_1658385180__ | /usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=pve02' root@10.10.154.2 -- pvesm import SCME-REP-DS01:vm-100-disk-0 zfs - -with-snapshots 1 -snapshot __replicate_100-0_1658385180__ -allow-rename 0' failed: exit code 255

dcsapak · Jul 21, 2022

ibrahimabusamrah said:
cannot send 'SCME-REP-DS01/vm-100-disk-0': I/O error

check the disk/zpool on the target side

ibrahimabusamrah · Jul 21, 2022

dcsapak said:
check the disk/zpool on the target side

how can i check ? just this vm has a problem

dcsapak · Jul 21, 2022

e.g. with 'zpool status'
'dmesg' , etc.

aaron · Jul 21, 2022

ibrahimabusamrah said:
2022-07-21 09:33:06 100-0: delete previous replication snapshot '__replicate_100-0_1658385180__' on SCME-REP-DS01:vm-100-disk-0

Was there some issue at some point earlier? Is there more than one replication job running for this VM? For example, 2 replication jobs to 2 different nodes?
In my experience, it can happen that the snapshots don't match anymore after HA recovered such a VM. In such a case, removing the failing replication job and recreating it should help. The first run will take some time again.

ibrahimabusamrah · Jul 24, 2022

dcsapak said:
e.g. with 'zpool status'
'dmesg' , etc.

This is dmesg log error

Search

Search

The VM show replication error ( HOW I can solve It)

ibrahimabusamrah

Member

dcsapak

Proxmox Staff Member

ibrahimabusamrah

Member

dcsapak

Proxmox Staff Member

aaron

Proxmox Staff Member

ibrahimabusamrah

Member

Attachments

We value your privacy