Search results

  1. I

    [SOLVED] ceph problem - Reduced data availability: 15 pgs inactive

    i can do (i have backup or able to regenerate everything , but it will take time around a week of work). but will it fix the cluster? because everything is marked gray, all lxc\vms are down. perhaps upgrade to latest version might help? 7.1-8 - 7.2 , but it minor dont think it will change...
  2. I

    [SOLVED] ceph problem - Reduced data availability: 15 pgs inactive

    this is one of the first things i did. i also rebooted all the servers in the cluster. what to do from the link. the delete is not looks as a safe option
  3. I

    [SOLVED] ceph problem - Reduced data availability: 15 pgs inactive

    ceph pg 2.b query Error ENOENT: i don't have pgid 2.b ceph pg 2.b list_unfound Error ENOENT: i don't have pgid 2.b
  4. I

    [SOLVED] ceph problem - Reduced data availability: 15 pgs inactive

    HEALTH_WARN 1 MDSs report slow metadata IOs; Reduced data availability: 15 pgs inactive; 504 slow ops, oldest one blocked for 11906 sec, daemons [osd.14,osd.23,osd.27,osd.30,osd.31,osd.7] have slow ops. [WRN] MDS_SLOW_METADATA_IO: 1 MDSs report slow metadata IOs mds.pve-srv3(mds.0): 1 slow...
  5. I

    [SOLVED] ceph problem - Reduced data availability: 15 pgs inactive

    deleted and recreated them. because they fail to start. how i can fix the PG warning? (i have backup for everything) but it don know what is deleted\corrupted. what can cause the cluster instability? all nodes are appear grayed out.
  6. I

    [SOLVED] ceph problem - Reduced data availability: 15 pgs inactive

    proxmox 7.1-8 yesterday i executed a large delete operation on the ceph-fs pool (around 2 TB of data) the operation ended withing few seconds successful (without any noticeable errors). and then the following problem occurred: 7 out of 32 osds went to down and out. trying to set them in and...
  7. I

    Any tips on balancing osd data usage?

    I have the same problem, lowest at 50% and highest at 89. running the command "ceph osd reweight-by-utilization" initiate some re balancing, running it few more times until it looks better. can it be automated ?
  8. I

    [SOLVED] lxc template for ubuntu 22.04

    works. i ll close the question. thanks
  9. I

    [SOLVED] lxc template for ubuntu 22.04

    according to history versions http://download.proxmox.com/images/system/ it can take around two weeks.
  10. I

    [SOLVED] lxc template for ubuntu 22.04

    do-release-upgrade -d . and do-release-upgrade -d -c returns: Checking for a new Ubuntu release New release '22.04' available. Run 'do-release-upgrade' to upgrade to it.
  11. I

    [SOLVED] lxc template for ubuntu 22.04

    not latest but relatively updated proxmox-ve: 7.1-1 (running kernel: 5.13.19-2-pve) pve-manager: 7.1-8 (running version: 7.1-8/5b267f33) pve-kernel-helper: 7.1-6 pve-kernel-5.13: 7.1-5 pve-kernel-5.11: 7.0-10 pve-kernel-5.4: 6.4-6 pve-kernel-5.13.19-2-pve: 5.13.19-4 pve-kernel-5.11.22-7-pve...
  12. I

    [SOLVED] lxc template for ubuntu 22.04

    from 20.04 i did upgrade and this is the output: Reading cache Checking package manager Reading package lists... Done Building dependency tree Reading state information... Done Hit http://archive.ubuntu.com/ubuntu focal InRelease Hit...
  13. I

    [SOLVED] lxc template for ubuntu 22.04

    I know this is a bit early and the version is not final, but i would like to start integrate our system and migration from older Ubuntu to this one, i have a working Ubuntu 20.04 based on standard 20.04 template, but the upgrade did not work. any idea what would be the best practice ?
  14. I

    Does proxmox have integration with ups?

    just bought some UPS to protect against power failure and electrical spikes. the UPS support powerwalker, any tips or best practice how to integrate it ? the ups should have enough capacity to maintain the servers under full load for around 10 minuts and low load for at least double i am...
  15. I

    Unexplained cluster crash after upgrade from 7. -> 7.1-8

    sure, ill post it, to be clear you need journalctl -u corosync -u pve-cluster --since "XXXXX" --until "YYYYYYY" > log_$(hostname) for all servers ? do i need to add anything else?
  16. I

    Unexplained cluster crash after upgrade from 7. -> 7.1-8

    I have tried to change the switch and cable and i could not find any improvement, around once an hour usually at any round hour and 50 minuts (01:50 02:50 .. etch) Dec 26 04:25:57 pve-blade-102 corosync[2238]: [KNET ] link: host: 1 link: 0 is down Dec 26 04:50:56 pve-blade-102...
  17. I

    Unexplained cluster crash after upgrade from 7. -> 7.1-8

    Another event occurred after debug enabled : Dec 23 15:50:00 pve-blade-102 corosync[2238]: [KNET ] link: host: 1 link: 0 is down
  18. I

    Unexplained cluster crash after upgrade from 7. -> 7.1-8

    done , ill post again once we have another error on host 1
  19. I

    Unexplained cluster crash after upgrade from 7. -> 7.1-8

    How i can enable debugging for logs? I think the problematic host is pve-srv-102, ill try to inspect the network cable and card on Sunday, and replace them,
  20. I

    Unexplained cluster crash after upgrade from 7. -> 7.1-8

    looks like it, host1 all the time. the logs you asked for yesterday : journalctl -u corosync -u pve-cluster --since yesterday >/mnt/pve/nfs_home/pve_logs/log_$(hostname) i see that host 1 have errors and sometimes it recovers and when it not the cluster crash initiated host1 had issues in...

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!