Search results

  1. A

    Unable to create new Ceph monitor - "Invalid IP string"

    After upgrading to 7.2-5, the error indeed disappeared! Have removed my manually created monitors and recreated the monitors via the GUI and can see in the Ceph config that the port numbers are no longer showing for the newly created monitors. (for good measure, have replaced all monitors to...
  2. A

    Unable to create new Ceph monitor - "Invalid IP string"

    Thanks for the update. Will keep an eye out for it and at least good to know I haven't missed it somewhere ;)
  3. A

    Unable to create new Ceph monitor - "Invalid IP string"

    Hi Fabian, Do you know if the patch will be rolled out soon? I did go for the manual route and that did the trick for now, but it seems to cause some trouble when rebooting the box (monitor doesn't start automatically) - putting it back the way ProxMox likes it seems to be the most suitable...
  4. A

    Nightly backup causes VM instability

    In case it is useful for anyone else, after switching my backup mode from "Snapshot" to "Suspend" seems to have done the trick and the particular VM's are now more stable.
  5. A

    Nightly backup causes VM instability

    Not sure if this topic is still relevant, but I've seen similar issues over the last few months. I've got a few VM load balancers that have become unstable on a regular basis (once a week). After first suspecting the internet line, as well as many other things, I now seem to have also found a...
  6. A

    Unable to create new Ceph monitor - "Invalid IP string"

    >> I sent a preliminary patch. Brilliant! Will keep an eye out for it. As long as nothing else goes wrong at the moment, should be fine running off 2 monitors for now. Will update this thread if that's gone to plan. >> You might want to switch your mon_host config line to use spaces as...
  7. A

    Unable to create new Ceph monitor - "Invalid IP string"

    >> Thanks, I was now able to reproduce the issue! I'll see about fixing it, as old configurations should of course continue to work. Great! Having a quick look on Ceph's pages, I keep seeing the mon_addr in the config files WITH the port number, so I'll await the outcome of your assessment...
  8. A

    Unable to create new Ceph monitor - "Invalid IP string"

    Hi Fabian, Thanks for your help! >> Was this configuration created via Proxmox VE? If yes, approximately what version? Yes, I think it was originally created on version 5.4-ish. I do seem to remember an upgrade of corosync from version 5 to 6 or so that was changing port numbers /...
  9. A

    Unable to create new Ceph monitor - "Invalid IP string"

    Sure! /etc/pve/ceph.conf : [global] auth_client_required = cephx auth_cluster_required = cephx auth_service_required = cephx cluster_network = 192.168.1.0/24 fsid = 1166d897-f578-49fc-a943-008a01b67dc1 mon_allow_pool_delete = true...
  10. A

    Unable to create new Ceph monitor - "Invalid IP string"

    Hi, I've recently added a couple of new nodes to my system and now want to move the Ceph monitors to the new nodes, but am struggling. When I go to Ceph - Monitor in the Proxmox GUI, click Create and select the new node, it almost instantly give me the error message "Invalid IP string"...
  11. A

    Unable to create Ceph OSD

    In case it's of any use to anyone; I had a similar problem and this thread helped me pinpoint the answer. In my case, a newly added node that wasn't completely new had the wrong ceph key in a few files. Replacing the key value in the following two files with the key value of an existing cluster...
  12. A

    [SOLVED] PVE 5.4-11 + Corosync 3.x: major issues

    Hi all, Looks like I've got the same issue as you all - corosync failing randomly (but roughly every 12 - 48 hours), causing various management and connectivity issues. If there's any logs that I can provide to shed more light on the problem, please shout. Couple of things about my cluster...
  13. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    Hi Alwin, I've got a nice steaming pile of log files for you! https://www.dropbox.com/s/iykxek4hwqj3sj2/ceph-logs-extended.zip?dl=0 This contains: OSD logs - switched log/mem levels to 20/20 (assumed that was the best to choose) Ceph log Ceph Audit log Ceph Mon/Mgr/Mds logs Order of...
  14. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    Thanks. I'm just moving the remainder of my VM disks across to the new pool - probably should be finished by tonight/tomorrow morning, after which I'll rebalance the cluster and then enable the logging. My cluster setup; - 4 nodes, 24-32 CPU cores each, 128GB ram each (CPU load normally...
  15. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    I think I've tried that last week, but let me indeed try again once the cluster is rebalanced and see what happens. From memory, I think it ended up just crashing the osd's quicker ;) I've started to do that with a few disks - was hoping I could avoid it as there's a good 100 VM's or so, and...
  16. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    ...Saying that, I have rebalanced some of the OSD's yesterday to bring them back in (had removed a bunch of them to see if it made a difference last week). It is still rebalancing the cluster and has about 12% of objects misplaced which it is slowly putting back in the right place.
  17. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    No, at the moment, they're running only about 10-ish VM's (some websites and some Windows boxes) - all crucial VM's have been migrated to the backup cluster. Normally they're running about 50 to 100-ish VM's that are used for a training lab environment. root@prox7:~# ceph osd df tree ID CLASS...
  18. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    Fair suggestion, but already tried that last week. It still has a copy of the data on osd.16, and that one is then trying to replicate its data to the other ones, causing the same results. When I wasn't aware of the norecover/nobackfill flags, the only way I had to stop the OSD's flapping, was...
  19. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    Hmm, that did something alright ;) The moment I stopped OSD 14, it also stopped OSD's 23 and 26 - the two it's currently trying to replicate to. Surprisingly OSD 16, another acting drive for this PG and which should have the correct data for this PG, did NOT go down. I've uploaded the log files...
  20. A

    Ceph - OSD's crashing when trying to backfill a specific PG

    Oops... :) For completeness, I've attached the full output of "ceph pg 1.3e4 query" Couple of things I've done yesterday evening and this morning; - Moved the VM disk to the different pool (using the Move Disk option in the Hardware section of the VM - Proxmox GUI), and as the different pool...

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!