I woke up today with an odd cascade of events. First, one node of a cluster showed gray question marks for all objects. Interestingly enough the node showing all gray was also the one whose web console I was connected to. I rebooted the node and this issue resolved itself. Next, one LXC container showed it was locked and would not boot. I ran `pct unlock ###` which did not help. I then removed the vzdump lines in the lxc.conf. Now the LXC would try to start. At this point it started to complain that one of the bind mounts would not mount. The LXC in question has two bind mounts. Removing the one allows the LXC to start so it is definitively an issue with a specific bind mount. The bind mount exists, I can look at the files, zpool status and a scrub show no issues. I haven't run any updates for quite some time, so nothing changed that I can think of. Any ideas how I can get this container to boot?
Code:
DEBUG conf - ../src/lxc/conf.c:mount_entry:2444 - Mountflags already were 4096, skipping remount
DEBUG conf - ../src/lxc/conf.c:mount_entry:2479 - Mounted "/T340-HDD/bindmounts/downloader" on "/usr/lib/x86_64-linux-gnu/lxc/rootfs/downloader" with filesystem type "none"
ERROR utils - ../src/lxc/utils.c:safe_mount:1221 - Invalid argument - Failed to mount "/T340-HDD/bindmounts/transientdata" onto "/usr/lib/x86_64-linux-gnu/lxc/rootfs/transientdata"
ERROR conf - ../src/lxc/conf.c:mount_entry:2410 - Invalid argument - Failed to mount "/T340-HDD/bindmounts/transientdata" on "/usr/lib/x86_64-linux-gnu/lxc/rootfs/transientdata"
ERROR conf - ../src/lxc/conf.c:lxc_setup:4375 - Failed to setup mount entries
ERROR start - ../src/lxc/start.c:do_start:1275 - Failed to setup container "101"
ERROR sync - ../src/lxc/sync.c:sync_wait:34 - An error occurred in another process (expected sequence number 3)
DEBUG network - ../src/lxc/network.c:lxc_delete_network:4173 - Deleted network devices
ERROR start - ../src/lxc/start.c:__lxc_start:2074 - Failed to spawn container "101"
WARN start - ../src/lxc/start.c:lxc_abort:1039 - No such process - Failed to send SIGKILL via pidfd 16 for process 2744719
startup for container '101' failed