Hey man, I have the same problem with you. This How I fix it, I have a cluster have 39 nodes, always not stable with question mark or reboot.
1. Close all of this node.
2. Start 3 node first, then after some minute start the rest one by one
pvecm status
root@g8kvm03:~# pvecm status
Cluster information
-------------------
Name: AW-G8-KVM
Config Version: 56
Transport: knet
Secure auth: on
Quorum information
------------------
Date: Mon Sep 14 09:03:37 2020
Quorum provider: corosync_votequorum...
I've change the /etc/pve/ceph.conf , in the begining the are in the same network 10.0.141.x/24, now I change ceph cluster to 10.98.141.0/24
[global]
auth_client_required = cephx
auth_cluster_required = cephx
auth_service_required = cephx
cluster_network = 10.98.141.0/24...
Hi, I have add the new ceph cluster network, do I want to restart in all nodes ? Can I just place the commamd to all 39 nodes?
systemctl restart ceph.target
I can not enter the folder, even I type chmod 755 cephfs, no use
root@g8kvm37:/mnt/pve#
root@g8kvm37:/mnt/pve# cd cephfs
-bash: cd: cephfs: Permission denied
root@g8kvm37:/mnt/pve# ls -al
ls: cannot access 'cephfs': Permission denied
total 8
drwxr-xr-x 3 root root 4096 Sep 4 17:27 .
drwxr-xr-x...
It seems no errors, the log show this log frequency , the Ceph show health is OK, but run " df -h " hand there.
root@g8kvm37:~# ras-mc-ctl --summary
No Memory errors.
No PCIe AER errors.
No Extlog errors.
No MCE errors.
root@g8kvm37:~#
My cluster comes error as below, any one who can help.
Sep 08 18:18:12 g8kvm13 pvestatd[1873]: unable to activate storage 'cephfs' - directory '/mnt/pve/cephfs' does not exist or is unreachable
Sep 08 18:18:22 g8kvm13 pvestatd[1873]: got timeout
Sep 08 18:18:22 g8kvm13 pvestatd[1873]: unable to...
Is there a same way to change the ceph network? I want to put ceph into another vlan. So thoses traffice would not in the same vlan, might be slove my problem.
Before apply to the production envirment I deploy to the Test envirement first, is it right?
root@backupkvm05:~# pvecm status
Cluster information
-------------------
Name: BackupKvm
Config Version: 13
Transport: knet
Secure auth: on
Quorum information...