I've got a small Proxmox 5.1 server running 8 LXC containers (Debian).
Randomly, when the server reboots (usually after a power outage), 1 of the 8 containers (always the same) doesn't start. I can start it manually without any problem.
Only difference between this container and the 7 others : it has a second disk mounted.
Looking at the logs, I guess it has something to do with multi mount protection :
(...)
mars 07 19:49:22 saturn pve-guests[5241]: starting CT 1011: UPID:saturn:00001479:000018EF:5C8167B2:vzstart:1011:root@pam:
mars 07 19:49:22 saturn pve-guests[3857]: <root@pam> starting task UPID:saturn:00001479:000018EF:5C8167B2:vzstart:1011:root@pam:
mars 07 19:49:22 saturn pvestatd[3732]: status update time (38.181 seconds)
mars 07 19:49:22 saturn systemd[1]: Starting PVE LXC Container: 1011...
(...)
mars 07 19:49:23 saturn kernel: EXT4-fs warning (device loop1): ext4_multi_mount_protect:324: MMP interval 42 higher than expected, please wait.
(...)
mars 07 19:50:09 saturn kernel: EXT4-fs (loop1): 1 orphan inode deleted
mars 07 19:50:09 saturn kernel: EXT4-fs (loop1): recovery complete
mars 07 19:50:09 saturn kernel: EXT4-fs (loop1): mounted filesystem with ordered data mode. Opts: (null)
mars 07 19:50:09 saturn kernel: EXT4-fs warning (device loop2): ext4_multi_mount_protect:324: MMP interval 42 higher than expected, please wait.
(...)
mars 07 19:50:52 saturn systemd[1]: pve-container@1011.service: Start operation timed out. Terminating.
mars 07 19:50:52 saturn systemd[1]: Failed to start PVE LXC Container: 1011.
mars 07 19:50:52 saturn systemd[1]: pve-container@1011.service: Unit entered failed state.
mars 07 19:50:52 saturn systemd[1]: pve-container@1011.service: Failed with result 'timeout'.
mars 07 19:50:52 saturn pve-guests[5241]: command 'systemctl start pve-container@1011' failed: exit code 1
(...)
mars 07 19:50:54 saturn kernel: EXT4-fs (loop2): 1 orphan inode deleted
mars 07 19:50:54 saturn kernel: EXT4-fs (loop2): recovery complete
mars 07 19:50:54 saturn kernel: EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: noacl
(...)
I have a hunch that if the container 1011 would have waited about 5 seconds more before timing out, then loop2 would have been mounted and the container would have started.
1°/ Is there a way to increase the startup timeout of an LXC container ?
2°/ Any other idea how to solve this ?
Randomly, when the server reboots (usually after a power outage), 1 of the 8 containers (always the same) doesn't start. I can start it manually without any problem.
Only difference between this container and the 7 others : it has a second disk mounted.
Looking at the logs, I guess it has something to do with multi mount protection :
(...)
mars 07 19:49:22 saturn pve-guests[5241]: starting CT 1011: UPID:saturn:00001479:000018EF:5C8167B2:vzstart:1011:root@pam:
mars 07 19:49:22 saturn pve-guests[3857]: <root@pam> starting task UPID:saturn:00001479:000018EF:5C8167B2:vzstart:1011:root@pam:
mars 07 19:49:22 saturn pvestatd[3732]: status update time (38.181 seconds)
mars 07 19:49:22 saturn systemd[1]: Starting PVE LXC Container: 1011...
(...)
mars 07 19:49:23 saturn kernel: EXT4-fs warning (device loop1): ext4_multi_mount_protect:324: MMP interval 42 higher than expected, please wait.
(...)
mars 07 19:50:09 saturn kernel: EXT4-fs (loop1): 1 orphan inode deleted
mars 07 19:50:09 saturn kernel: EXT4-fs (loop1): recovery complete
mars 07 19:50:09 saturn kernel: EXT4-fs (loop1): mounted filesystem with ordered data mode. Opts: (null)
mars 07 19:50:09 saturn kernel: EXT4-fs warning (device loop2): ext4_multi_mount_protect:324: MMP interval 42 higher than expected, please wait.
(...)
mars 07 19:50:52 saturn systemd[1]: pve-container@1011.service: Start operation timed out. Terminating.
mars 07 19:50:52 saturn systemd[1]: Failed to start PVE LXC Container: 1011.
mars 07 19:50:52 saturn systemd[1]: pve-container@1011.service: Unit entered failed state.
mars 07 19:50:52 saturn systemd[1]: pve-container@1011.service: Failed with result 'timeout'.
mars 07 19:50:52 saturn pve-guests[5241]: command 'systemctl start pve-container@1011' failed: exit code 1
(...)
mars 07 19:50:54 saturn kernel: EXT4-fs (loop2): 1 orphan inode deleted
mars 07 19:50:54 saturn kernel: EXT4-fs (loop2): recovery complete
mars 07 19:50:54 saturn kernel: EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: noacl
(...)
I have a hunch that if the container 1011 would have waited about 5 seconds more before timing out, then loop2 would have been mounted and the container would have started.
1°/ Is there a way to increase the startup timeout of an LXC container ?
2°/ Any other idea how to solve this ?
Last edited: