Search results

  1. P

    Ceph OSD disk replacement

    Hello, I am looking for updated documentation on correct procedure for Ceph OSD disk replacement. Current Proxmox docs only cover OSD creation but is lacking of management procedures using pveceph commands. I found this Ceph document but some commands are not giving the same output (ie...
  2. P

    smartd false positive SSD CurrentPendingSector?

    Hello, thank you, I updated the drive database and I have not received alerts since days. So those alert are to consider false positives? Thanks, P.
  3. P

    Issue removing Ceph OSD

    It seems some intelligent job passed and fixed it automatically, it was again set as "OSD.8 " and I could put back in .
  4. P

    Issue removing Ceph OSD

    Hello, I had to temporary remove a drive (OSD) from Ceph but I think I made it wrong. - I set noout - I Outed the OSD but I saw data redundancy started the same (I though setting noout avoided that). - I stopped OSD.8 service Then I extracted hot swap bay. When I inserted disk back device...
  5. P

    CPU cores overcommit

    Thank you for the clear explanation. I suggest you to add this explanation to documentation as it is really useful. I have a Ceph Hyper-converged setup, How can I assign some cores for OSDs/Monitors and avoid VMs to use them? Any updates to this...
  6. P

    CPU cores overcommit

    Hello, what does happen if we overcommit total of CPU cores assigned to VMs higher than number of cores available on host? Does Proxmox gives any alert during assignment? Is it the same behaviour for LXC and KVM? In my case I have 40 cores on host and 30 LXC with 1 core. What happens if I...
  7. P

    [SOLVED] Adding a second public IP range

    I just have received the range assigned by ISP and I confirm it works: echo 1 > /proc/sys/net/ipv4/ip_forward ip route add X.Y.Z.0/24 dev vmbr0 Created a new LXC with IP X.Y.Z.10/32 and gateway in a different prefix A.B.C.254 (same as my vmbr0 default GW) Simpler than I thought. Thank you! P.
  8. P

    [SOLVED] Adding a second public IP range

    Hello Stoiko, thank you for your reply. The Debian page has simple examples with IPs in the same subnet and without bridge involved. My case is with a bridge and a new subnet with its gateway address. I think adding the iproute and ip_forward lines to bridge config could be enough to make it...
  9. P

    [SOLVED] Adding a second public IP range

    Hello, my cluster shares a public IP range assigned to bridge in a very simple config: iface vmbr0 inet static address 5.X.X.1 # each node has his own IP from within the range. netmask 255.255.255.0 gateway 5.X.X.254 bridge_ports eno1 bridge_stp off bridge_fd 0 I need to add another new IP...
  10. P

    Proxmox cluster scaling best practices

    Thanks a lot ! So I can think of scaling from 5 to 9 nodes (and from 10 to 18 OSDs) without any specific worries. Best regards P.
  11. P

    Proxmox cluster scaling best practices

    Hi Wolfgang, thank you for your reply. My plan was to always use Hyper-converged clusters and not to split Ceph from Compute even for easier and progressive growth. In my 5 nodes I have seen peaks of 20 MB/s and 2000 Iops on Ceph network and I think there room to grow. One thing is not...
  12. P

    smartd false positive SSD CurrentPendingSector?

    Hello, I am still getting the same messages only for one of my Ceph OSDs. I run smartd tests that are successful: SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed...
  13. P

    Proxmox cluster scaling best practices

    Hello, I have a 5 node PVE cluster with Ceph (2 OSD every node on Intel DC SSD on dedicated 10G network). I am actually using it for LXC containers and my idea is to always keep one node empty to move containers in case of node failure. I will need to scale up in the near future probably to a...
  14. P

    Stuck on Bad Kernel? (4.15.18-12)

    Hello, I don't know if it is related but I see a difference on kernel updates available on CLI and GUI. CLI shows: pve-kernel-4.15/stable 5.4-1 all [upgradable from: 5.3-3] pve-kernel-4.15.18-12-pve/stable 4.15.18-36 amd64 [upgradable from: 4.15.18-35] GUI shows...
  15. P

    smartd false positive SSD CurrentPendingSector?

    I am getting same error since a couple of days and seems to auto fix after an hour. Apr 23 14:27:14 proxmox smartd[1495]: Device: /dev/sda [SAT], 1 Currently unreadable (pending) sectors ... Apr 23 15:27:14 proxmox smartd[1495]: Device: /dev/sda [SAT], No more Currently unreadable (pending)...
  16. P

    Ceph Hyperconverged on Blade servers

    Hello, I was considering a Dell M1000E enclosure + blade servers, but I see blade server support 2 disks only. If I start with a cluster of 4 nodes I will have only 4 Ceph OSDs (4 disks are for boot). Do you have experience or suggestions for Hyperconvergence on blade servers? Thank you. P.
  17. P

    [SOLVED] Change ring address to node in cluster

    I thought so, maybe I missed one, now is fixed. Thank you again. Have a nice day ;-)
  18. P

    [SOLVED] Change ring address to node in cluster

    Thank you very much Chris, it worked. In the pvecm status I see the name of the node I fixed showing old IP of wrong network. Is it possible to fix this too? Then in web interface for cluster nodes summary I see some nodes with "Server Address" as public IP and some as local one, but in...
  19. P

    [SOLVED] Change ring address to node in cluster

    Hello, During cluster creation I forgot to add ring address to one of the nodes resulting it to be added in wrong network. How can I fix this? I was thinking to remove node and add again but documentation says removed nodes can't be added back. The node is part of a Ceph hyper-converged...