Lost container

SIVedecom

New Member
Jan 30, 2023
3
0
1
Hi,
I have a container that worked perfectly fine until last week. Is is now impossible to do anything with it, from console or terminal.
It seems like it cant reach the storage, when i go into the GUI and try to see the content section of this storage point it just stay loading forever :
1714139073213.png
Here are the informations about the conf:

pveversion :
Code:
proxmox-ve: 6.1-2 (running kernel: 5.3.13-1-pve)
pve-manager: 6.1-5 (running version: 6.1-5/9bf06119)
pve-kernel-5.3: 6.1-1
pve-kernel-helper: 6.1-1
pve-kernel-5.0: 6.0-11
pve-kernel-5.3.13-1-pve: 5.3.13-1
pve-kernel-5.0.21-5-pve: 5.0.21-10
pve-kernel-5.0.21-2-pve: 5.0.21-7
pve-kernel-5.0.15-1-pve: 5.0.15-1
ceph: 14.2.6-pve1
ceph-fuse: 14.2.6-pve1
corosync: 3.0.2-pve4
criu: 3.11-3
glusterfs-client: 5.5-3
ifupdown: 0.8.35+pve1
ksm-control-daemon: 1.3-1
libjs-extjs: 6.0.1-10
libknet1: 1.13-pve1
libpve-access-control: 6.0-5
libpve-apiclient-perl: 3.0-2
libpve-common-perl: 6.0-9
libpve-guest-common-perl: 3.0-3
libpve-http-server-perl: 3.0-3
libpve-storage-perl: 6.1-3
libqb0: 1.0.5-1
libspice-server1: 0.14.2-4~pve6+1
lvm2: 2.03.02-pve3
lxc-pve: 3.2.1-1
lxcfs: 3.0.3-pve60
novnc-pve: 1.1.0-1
proxmox-mini-journalreader: 1.1-1
proxmox-widget-toolkit: 2.1-1
pve-cluster: 6.1-2
pve-container: 3.0-15
pve-docs: 6.1-3
pve-edk2-firmware: 2.20191127-1
pve-firewall: 4.0-9
pve-firmware: 3.0-4
pve-ha-manager: 3.0-8
pve-i18n: 2.0-3
pve-qemu-kvm: 4.1.1-2
pve-xtermjs: 3.13.2-1
qemu-server: 6.1-4
smartmontools: 7.0-pve2
spiceterm: 3.1-1
vncterm: 1.6-1
zfsutils-linux: 0.8.2-pve2

the pct conf file :

Code:
arch: amd64
cores: 2
hostname: d1os4
memory: 6144
net0: name=eth0,bridge=vmbr0,gw=example,hwaddr=example,ip=example,tag=5,type=veth
onboot: 1
ostype: ubuntu
rootfs: SATORI:vm-104-disk-0,size=50G
swap: 2048
lxc.apparmor.profile: unconfined
lxc.cgroup.devices.allow: a
lxc.cap.drop:

the storage.cfg file :
Code:
dir: local
        path /var/lib/vz
        content backup,iso,vztmpl

lvmthin: local-lvm
        thinpool data
        vgname pve
        content images,rootdir

rbd: SATORI
        content rootdir,images
        krbd 0
        pool SATORI


i tried to get more infos with these commands that i found online ;

root@pmx12:/var/log/lxc# lxc-start -lDEBUG -o lxc104start.log -F -n 104
lxc-start: 104: tools/lxc_start.c: main: 279 Container is already running


root@pmx12:/var/log/lxc# cat lxc104start.log
lxc-start 104 20240426134912.241 DEBUG commands - commands.c:lxc_cmd_rsp_recv:168 - Response data length for command "get_init_pid" is 0
lxc-start 104 20240426134912.241 DEBUG commands - commands.c:lxc_cmd_rsp_recv:168 - Response data length for command "get_state" is 0
lxc-start 104 20240426134912.241 DEBUG commands - commands.c:lxc_cmd_get_state:630 - Container "104" is in "RUNNING" state
lxc-start 104 20240426134912.241 ERROR lxc_start - tools/lxc_start.c:main:279 - Container is already running


root@pmx12:/var/log/lxc# df -T
Filesystem Type 1K-blocks Used Available Use% Mounted on
udev devtmpfs 65805112 0 65805112 0% /dev
tmpfs tmpfs 13165924 1465052 11700872 12% /run
/dev/mapper/pve-root ext4 98559220 18670776 74838896 20% /
tmpfs tmpfs 65829608 67488 65762120 1% /dev/shm
tmpfs tmpfs 5120 0 5120 0% /run/lock
tmpfs tmpfs 65829608 0 65829608 0% /sys/fs/cgroup
/dev/sdi2 vfat 523248 312 522936 1% /boot/efi
tmpfs tmpfs 65829608 24 65829584 1% /var/lib/ceph/osd/ceph-7
tmpfs tmpfs 65829608 24 65829584 1% /var/lib/ceph/osd/ceph-6
tmpfs tmpfs 65829608 24 65829584 1% /var/lib/ceph/osd/ceph-5
tmpfs tmpfs 65829608 24 65829584 1% /var/lib/ceph/osd/ceph-4
/dev/fuse fuse 30720 56 30664 1% /etc/pve
exampleip:/volume2/VM-Backup nfs 52477854592 30518762752 21959091840 59% /mnt/pve/VM-Backup
exampleip:/volume2/VM-Backup-Incre-S nfs 52477854592 30518762752 21959091840 59% /mnt/pve/Quotidienne
exampleip:/volume1/Logitheque nfs 52477854592 30518762752 21959091840 59% /mnt/pve/NFS-MIGRATION
exampleip:/volume1/cluster-backup nfs 52477854592 30518762752 21959091840 59% /mnt/pve/cluster-backup
tmpfs tmpfs 13165920 0 13165920 0% /run/user/0
exampleip:/volume1/cluster-backup/dump test nfs 52477854592 30518762752 21959091840 59% /mnt/pve/NFS-MIGRATION
exampleip:/volume1/cluster-backup/dump test nfs 52477854592 30518762752 21959091840 59% /mnt/pve/Quotidienne
exampleip:/volume1/cluster-backup/dump test nfs 52477854592 30518762752 21959091840 59% /mnt/pve/VM-Backup
exampleip:/volume1/cluster-backup/dump test nfs 52477854592 30518762752 21959091840 59% /mnt/pve/BACKUPS-CLUSTER


root@pmx12:/var/log/lxc# pct enter 104
lxc-attach: 104: attach.c: lxc_attach: 1136 No such file or directory - Failed to attach to mnt namespace of 580059

As you can see there is probably a storage problem, but i really dont know how to solve this, if someone could help it would be awesome.
Do you guys know how to get it back, or how to create a new ct from this one ?

PS : i know not upgrading pve sucks, but it is what it is, and i have to deal with it unfortunately :(

Thanks,
 
Hi,

Do you have anyother CT on the same `SATORI` storage?

I would mount the CT using this command `pct mount <CTID>` This may give you direct access to inspect its storage.


You can also check the syslog to see if there is any interesting error message.
 
  • Like
Reactions: SIVedecom
Hi, thanks for the reply,

We can see that the storage is online here :

1714384047716.png

I tried to mount it, but it seems like it is already in mounted status :
1714381938385.png

When trying to stop the container i get this result :
1714382840987.png


Should i delete or modify the /run/lock/lxc/pve-config-104.lock file (it is empty) ?

I dont have any error in the /var/log/syslog file.

Here is the content of the /var/log/lxc/lxc104start.log :
Code:
lxc-start 104 20240426134912.241 DEBUG    commands - commands.c:lxc_cmd_rsp_recv:168 - Response data length for command "get_init_pid" is 0
lxc-start 104 20240426134912.241 DEBUG    commands - commands.c:lxc_cmd_rsp_recv:168 - Response data length for command "get_state" is 0
lxc-start 104 20240426134912.241 DEBUG    commands - commands.c:lxc_cmd_get_state:630 - Container "104" is in "RUNNING" state
lxc-start 104 20240426134912.241 ERROR    lxc_start - tools/lxc_start.c:main:279 - Container is already running
lxc104start.log (END)

the /var/log/lxc/lxc-monitord.log :

Code:
root@pmx12:/var/log/lxc# less lxc-monitord.log
lxc-monitord 20190930194546.519 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2129 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20191001072351.810 INFO     monitor - monitor.c:lxc_monitor_sock_name:212 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20191001072351.821 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2145 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20191001134833.842 INFO     monitor - monitor.c:lxc_monitor_sock_name:212 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20191001134833.849 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2100 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20191001154151.768 INFO     monitor - monitor.c:lxc_monitor_sock_name:212 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20191001154151.772 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2104 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20191002092548.509 INFO     monitor - monitor.c:lxc_monitor_sock_name:212 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20191002092548.179 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2154 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20191003081400.207 INFO     monitor - monitor.c:lxc_monitor_sock_name:212 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20191003081400.216 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2133 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20191123135844.976 INFO     monitor - monitor.c:lxc_monitor_sock_name:212 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20191123135844.992 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2168 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20200121122018.524 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20200121122018.524 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 1487800 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20200123200123.798 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20200123200123.904 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2226 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20201121120319.175 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20201121120319.192 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2188 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20211108074148.450 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20211108074148.450 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2208 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20220223102719.162 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20220223102719.166 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2178 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20220610132210.337 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20220610132210.370 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2155 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20220721073756.611 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20220721073756.620 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2358 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20221121084605.559 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20221121084605.566 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2376 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20221220101324.983 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20221220101325.101 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2174 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230404084222.775 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230404084222.797 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2114 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230411095211.654 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230411095211.662 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2138 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230411102442.652 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230411102442.668 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2136 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230411103405.837 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230411103405.860 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2172 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230411104406.698 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230411104406.707 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2160 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230904131331.530 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230904131331.601 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2119 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230905075917.846 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230905075917.861 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2301 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230911093850.913 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230911093850.928 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2352 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230911123703.104 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230911123703.122 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2119 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20230912083236.835 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20230912083236.841 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2176 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20231106085207.585 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20231106085207.602 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 1928 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20231218094908.612 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20231218094908.633 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2227 is now monitoring lxcpath /var/lib/lxc
lxc-monitord 20240118075356.459 INFO     monitor - monitor.c:lxc_monitor_sock_name:213 - Using monitor socket name "lxc/ad055575fe28ddd5//var/lib/lxc" (length of socket name 33 must be <= 105)
lxc-monitord 20240118075356.467 NOTICE   lxc_monitord - cmd/lxc_monitord.c:main:472 - lxc-monitord with pid 2160 is now monitoring lxcpath /var/lib/lxc
(END)


I was thinking about rebooting the whole cluster but am unsure. Do you think i should do so ?

Do you have an idea about wich log files i could check to get more info ?

Is there something i can do to restore the container from snapshots ?


Thanks, have a nice day !
 

Attachments

  • 1714382469797.png
    1714382469797.png
    14.6 KB · Views: 3
Last edited: