Hi all,
Going through the motions to get CephFS as a shared mount point for a set of VMs.
- Created a new CephFS called 'docker' using Proxmox UI
- Added a ceph user called 'docker' with rw access to fs 'docker'
- set up VM with /etc/ceph/ containing basic ceph.conf and keyring for 'docker' user
- create /mnt/ceph folder for the new mount
- add mount details using fuse.ceph for fstab and run mount -a
mount point is listed successfully, and I can go to the mount, create a file, ls contents, everything is great!
Then an intermittent time later I'll try to use the client mount point and it:
- takes a really long time and then throws the error 'Cannot send after transport endpoint shutdown'
- using 'ceph osd blocklist ls' on one of the hosts shows the client as blocked.
- remove it from the blocklist and the client immediately comes back.
The time can literally be 1-2 days or as short as 30 seconds.
I've tried a few different logs to try and find a cause and everything looks like normal operations.
Can anyone suggest a next step for me to look at?
Going through the motions to get CephFS as a shared mount point for a set of VMs.
- Created a new CephFS called 'docker' using Proxmox UI
- Added a ceph user called 'docker' with rw access to fs 'docker'
- set up VM with /etc/ceph/ containing basic ceph.conf and keyring for 'docker' user
- create /mnt/ceph folder for the new mount
- add mount details using fuse.ceph for fstab and run mount -a
mount point is listed successfully, and I can go to the mount, create a file, ls contents, everything is great!
Then an intermittent time later I'll try to use the client mount point and it:
- takes a really long time and then throws the error 'Cannot send after transport endpoint shutdown'
- using 'ceph osd blocklist ls' on one of the hosts shows the client as blocked.
- remove it from the blocklist and the client immediately comes back.
The time can literally be 1-2 days or as short as 30 seconds.
I've tried a few different logs to try and find a cause and everything looks like normal operations.
Can anyone suggest a next step for me to look at?