I have 3 nodes all same hardware and specs. I was able to install ceph on 2 of my nodes successfully but it only fails on third one and I can't figure why. Any pointers please ?
root@hp2:~# pveceph install
update available package list
start installation
Reading package lists... Done
Building...
I have a 9 node cluster - all of them have same repositories and sources list but only on 1 of the node I am getting this error.
Starting system upgrade: apt-get dist-upgrade
Reading package lists... Done
Building dependency tree
Reading state information... Done
Calculating upgrade...
I accidentally unplugged boot Hard Disk from one of the 11-node ceph cluster.
The hard disk is gone no way to recover.
Now my cluster shows 11 nodes and 1 node in gray.
I know I have to reinstall it from scratch. Just wanted to run this by to make sure I am on right path
1. Remove existing...
This error keeps popping in my Proxmox GUI and everytime I have to manually scrub pgs with "ceph pg deep-scrub 7.da" , why is this happening and how can I prevent/fix it.
pveversion --verbose
proxmox-ve: 6.1-2 (running kernel: 5.3.18-3-pve)
pve-manager: 6.1-8 (running version: 6.1-8/806edfe1)...
I happen to reboot my 11 node cluster yesterday all together, big mistake ! It took almost 24 hours to come back up and get out of reboot loop. Now all nodes are up but no cluster or quorum. Nodes can ping each other fine but still no cluster. Logs/Debugs attached.
root@dellr730-1:~#...
Hello Experts,
I have recently setup new proxmox 6 environment with ceph, I removed 3 osds from one of the node and after that ceph has stopped recovering even though its shows there is Degraded Data Redundancy, I can't figure out why it wouldn't recover these.
~# pveversion...
Hello Experts,
I have been struggling with this Intel X-520-T2 card, I have tried everything bit it just fails to show up, please if anyone can provide some direction or has seen similar issue using this card.
lshw -c network -businfo
Bus info Device Class Description...
Hello experts,
I am in process of setting a brand new cluster with 6.1, I am not able to compile the crushmap, the output file just does not show up, wondering if anyone else has seen it and whats the workaround
root@dellr730-1:~# ls -l
total 8
-rw-r--r-- 1 root root 352 Mar 14 03:18...
Hello Experts,
I have read tons of blogs, posts, forums but I still can't get my head around setting networking for my following setup. I have following physical nodes
4 X Super Micros
Each has 4X1G and 2X10G
4 X HP Proliant 360 G7
Each has 4X1G and 2X10G
3 X Dell R730
Each has 2X10G, 2X1G...
Hello Experts,
Pardon my ignorance.
I am in process of migrating my existing data centre physically to a different location and at the same time adding 3 more nodes. I have done a lot research but can't reach a conclusive answer.
This is what I have right now
4 X SuperMicro Server, 128G RAM...
Hello everyone,
Just wanted to get feedback on using this tune ups, all these valid for any proxmox version any configuration ? I do not understand most of them even though going to the links mentioned, wanted to get some insight if these are useful to do in a prod 4 node cluster with ceph...
I have a 4 node ceph cluster, on each node I see this consistently, The LVM under Disks shows its 97% used, should I be worried, should I increase this space, will it effect my data, can I resize it in production. I do not keep any disk or iso locally, all is stored on rbd and nfs.
~# pvs
PV...
Hello Experts,
I am looking for some advice.
Our VMs need had outgrown our current 4 node cluster setup. We just got 4 new servers different specs than original.
I would like a recommendation on either to add these 4 nodes to same cluster or create a separate cluster. Pros Vs Cons ...
Hello experts,
I am using 4 box cluster running 5.x, when I initially set it up I used single link for cluster communication. Now I intend to change that to a LAG to get more bandwidth for cluster communication.
From what I understand, all I need is to change single link to a LAG, use same IP...
Esteemed members,
I was wondering why can't I use "bridge-vids 1-4094" instead of default "bridge-vids 2-4094" when vlan aware option is enabled on network bridge in network options.
My use case is that my cluster communication is running untagged on same interface but I also want to provide...
To whom it may be useful.
I picked it up this script in the same forum and had modified it a little. It helped me move all my wal/db from HDD to SSD in my case and can be used to replace an older SSD with new SSD with wal/db.
Just pass all OSD names and have a good night sleep. The process is...
Hello all,
I recently decided to use SSD in order to improve performance of my cluster. Here is my cluster setup
4 Nodes
36 HDD X 465 GB / node
CPU(s) 8 x Intel(R) Xeon(R) CPU E5-2609 v2 @ 2.50GHz (2 Sockets) /node
RAM 128GB / node
I wanted to move all my WAL DB to new SSD in order to improve...
Hello All,
I need some recommendation on improving my cluster. I have a 4 node cluster with CEPH having 65 OSDs at 30 TB capacity. My cluster communication is currently at 1G NIC and I dont have any SSDs in my servers.
I recently had issue with disk I/O time with some web-servers applications...
Dear SME,
I had a machine freeze on master node in 4 node HA cluster. One of my new VM migrated to other node but it failed to start since it still had iso in cd rom and other node did not had that iso available locally. The VM went into error state. I took it out of HA but it still wont start...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.