Can't Stop or Remove Replication Job

forbin

Member
Dec 16, 2021
40
7
13
45
On my PVE 8.2.3 server, I have many replication jobs that run fine, but I have one that runs over and over and always fails with the following error in the logs.

2024-09-12T15:30:27.421629-07:00 vmhost52b pvescheduler[342902]: 4803-0: got unexpected replication job error - command 'set -o pipefail && pvesm export zpool0:vm-4803-disk-1 zfs - -with-snapshots 1 -snapshot replicate_4803-0_1726155122 | /usr/bin/ssh -e none -o 'BatchMode=yes' -o 'HostKeyAlias=vmhost52a' -o 'UserKnownHostsFile=/etc/pve/nodes/vmhost52a/ssh_known_hosts' -o 'GlobalKnownHostsFile=none' root@192.168.8.108 -- pvesm import zpool0:vm-4803-disk-1 zfs - -with-snapshots 1 -snapshot replicate_4803-0_1726155122 -allow-rename 0' failed: exit code 255

The GUI has no option to cancel the job, and the Remove button does nothing. The job runs continuously because it always restarts automatically when it fails.

How can I find and kill this job? Also, there is one snapshot for the same VM which has been stuck in a "Delete" status forever. I assume the issues may be related.
 
Hi,

Did you fix your issue? If not, you can find the job PID using `ps aux` command. if you not find please post the output of the following command:

Bash:
cat /etc/pve/replication.cfg