Search results

  1. E

    Anyone have KernelCare experience with Proxmox?

    I realize the following is fairly new but I am curious if anyone is using this and can share their experiences with the product? https://www.proxmox.com/en/partners/technology-partner/item/kernel-care After reading the another post on the forum, I guess I am not 100% sure how KernelCare works...
  2. E

    Determine system name from backup lzo

    Is it possible to determine what the name of the KVM virtual machine is from a backup lzo file? Best regards, Eric
  3. E

    Backup data from server without quorum

    I have a server that we are decommissioning and I want to get some data off of it. This server is the only remaining server in a cluster and doesn't have quorum so when I try to do a backup the lock fails because pve is read-only. Is there a way to tell it to ignore the quorum and do the...
  4. E

    Urgent: Proxmox/Ceph Support Needed

    We got the ceph cluster down to 2 replicas. It finished rebuilding but during that process 1 of the 2 remaining ceph nodes marked all of it's OSD's as down. Not sure why it did that but the cluster is still running ok and we are moving stuff off temporarily to local storage instead. The node 3...
  5. E

    Limiting impact of qmrestore

    I've done some large restores and while that process is going on VM's IO slows down dramatically causing issues. What is the best way to limit the impact of a qmrestore on the filesystem it is restoring to? Just use ionice or is there a way to limit the speed of the restore? Best regards, Eric
  6. E

    Urgent: Proxmox/Ceph Support Needed

    Wasim, Can you provide paid ceph support? We still need assistance badly or do you have any suggestions? Best regards, Eric
  7. E

    Urgent: Proxmox/Ceph Support Needed

    Right now we have 3 pools data size/min 3/1 metadata size/min 3/1 rbd size/min 3/1 rbd is the main pool so would I just run the following command? ceph osd pool set rbd size 2 Do I need to do this for data and metadata too? Those have 0% usage. Once this is done, would it be safe to take...
  8. E

    Urgent: Proxmox/Ceph Support Needed

    The rebuilding did not complete but because i marked the OSD's on 3 down. The system seems to have stabilized with the NOOUT option on and shows 33% degraded. I am going to try and figure out what caused this high latency on node 3's OSD's in the first place. If I can't I am just going to...
  9. E

    Urgent: Proxmox/Ceph Support Needed

    Right now I have a paid of Intel 200GB SSD's mirrored RAID 1 as the journal. The other nodes are the same but working...could there be something wrong with SSD? Best regards, Eric
  10. E

    Urgent: Proxmox/Ceph Support Needed

    I am just trying to get the cluster up and running. As you can see the third hypervisor has very high latency. I am thinking once they rebuild, if I take those OFFLINE/OUT and the cluster does not rebuild then I will have 2 nodes up and running normally. Right now HIGH IO is killing all VM's...
  11. E

    Urgent: Proxmox/Ceph Support Needed

    I have put all of the OSD's on the problem HV03 back to UP/IN and it is rebuilding. Should I wait till rebuild is done and then set the NOOUT and take them back out? Here are the commands you requested. This setup has been working great for months. Even during rebuilds. ceph osd tree # id...
  12. E

    Urgent: Proxmox/Ceph Support Needed

    We have 3 nodes with 6 OSDs (4TB/OSD). 1 of the 3 nodes started having high latency (Apply/Commit). We could not find any networking issues or errors. This node was adversely affecting the other 2 nodes because of the high latency even though the health of the ceph cluster was OK. We tried...
  13. E

    Urgent: Proxmox/Ceph Support Needed

    I did this but the rebuild continues probably cause I marked the OUT previously. Should I put them back IN and then OUT to stop the rebuild? Is there a way to change the replicas from 3/1 to 2/1 on the fly? Best regards, Eric
  14. E

    Urgent: Proxmox/Ceph Support Needed

    Hello, Are there any professional paid Proxmox/Ceph support people on the forum who could assist us? Would prefer US based but really need help quickly. Please email me at eric.merkel at sozotechnologies.com of via phone at 317-203-9222 if you can help. Our Ceph cluster has lost 33% of its...
  15. E

    Urgent: Ceph Help Needed

    Thank you that seemed to help. I have a question in regards to "--osd-client-op-priority 63" isn't that already the default? I suppose you were just wanting to make sure they were set correctly? Eric
  16. E

    Urgent: Ceph Help Needed

    Hello, We have a 3 node proxmox/ceph cluster. This morning on one node all of the OSD were marked down/out. Right now all the VM's are up but the IO is getting killed by the recovery of the OSD's and they are basically not responding. I have added the following in /etc/pve/ceph.conf osd max...
  17. E

    glibc GHOST update

    I have a cluster running which has pending updates including the kernel that would require a reboot. Unfortunately I cannot reboot at this time so would it be safe to just run the following command? apt-get install --only-upgrade libc6 Do I need to restart any services to make sure proxmox is...
  18. E

    OVSBond + OVSBridge setup problem

    I think I may have found the solution. http://forum.proxmox.com/threads/20254-Bridge-does-not-exist This seems to have been the trick. insserv -d /etc/init.d/openvswitch-switch The bond1 is now up and I am seeing an LACP neighbor. Now for some testing. Best regards, Eric
  19. E

    OVSBond + OVSBridge setup problem

    Just to followup as I continue to investigate this, it is almost as if the GUI config is not getting propagated to the OVS config. /etc/openvswitch/conf.db does not exist ... (warning). Creating empty database /etc/openvswitch/conf.db. I am not sure but perhaps this is related to...
  20. E

    Openvswitch init problems at boot

    I think I may be having the same issue as this thread. http://forum.proxmox.com/threads/20754-OVSBond-OVSBridge-setup-problem?p=105856#post105856 Was a solution ever found for this issue? Eric