Search results

  1. P

    Calling on the hive mind: networking issue from hell

    Hi, so I have a small home lab cluster of 3 PVE nodes and one additional PBS. And I have the following networks (each on separate NICs/cables/switches): - Management - Corosync - Ceph - Backup The PVE nodes are connected to all networks. The PBS in only connected to Management and Backup...
  2. P

    Search backup not for date but for file

    Hi, first of all: Thank you very much for PBS (and PVE). It is such a great piece of software. Currently, I can search the backups on PBS by selecting the VM and then an individual backup by date. There, I can dive into the file system and retrieve a file. Sometimes, I would like to first...
  3. P

    Off network PBS best practice?

    Seeing that there doesn't seem to be a solution where there is only one PBS that doesn't sit on the same network as the PVE (safe piercing a hole in the firewall that ought to separate the networks), I would like to understand whether the following would be possible: There is one small PBS on...
  4. P

    Off network PBS best practice?

    Yeah, that's what I have currently, but I was wondering whether there might be another setup that doesn't require two separate PBSs.
  5. P

    NIC keeps changing interfaces on reboots

    For me, it has been working since I adopted names which the systems wouldn't use (like lan0 and lan1).
  6. P

    Off network PBS best practice?

    So I have been running a PBS to complement my home lab PVE cluster for a while now and everything works beautifully. And yet I am not fully happy because if my (management) network ever were breached and my VMs compromised, the next steps for an attacker would be to go after my backups. And if...
  7. P

    Help configuring vGPU?

    I must have, because I got vGPU working after a while. But I don't remember the exact steps unfortunately. But I gave up on using vGPU because - it was flaky, hit and miss - the concept of dividing my card into virtual GPUs isn't actually right for my use case because I keep experimenting and...
  8. P

    Selectively delete file across all backups

    So I have PBS running to backup my home lab cluster. And it is working fine and I backup everything there and from there on tape as well. As data protection is becoming more and more important, I am wondering whether there is a way to selectively remove individual files or folders from not just...
  9. P

    [SOLVED] Ceph rebalancing just stopped

    This may have been the decisive advice. I wanted to keep my to-be-replaced node untouched until the replacement node was completely up to speed. And I had thought that monitors, OSDs, etc. on an offline node would have no negative impact. But with your advice, I booted up the to-be-replaced...
  10. P

    Looking for a "cheap" way to upgrade to 40gibt

    Oh FFS! How can there even be something like an IB card that can't do IB? Anways, so does the "EN" signify cards that can't do IB? In other words, when I buy new cards, will I be safe, if I avoid "EN" cards? Is there anything else that I would want to avoid? Is there any value in buying "VPI"...
  11. P

    Looking for a "cheap" way to upgrade to 40gibt

    Man, this is proving so much more complicated than I anticipated (and I did anticipate issues...)! So mstconfig tells me Device #1: ---------- Device type: ConnectX3Pro Device: 65:00.0 Configurations: Next Boot SRIOV_EN...
  12. P

    [SOLVED] Ceph rebalancing just stopped

    There is one monitor on each of the three (online) nodes (and one on the offline node as well). Understood. But I would need to so many OSDs per node that my energy bill would become fully unsustainable :( However, this should not affect rebalancing, right?
  13. P

    Turn off SMART error notifications till next error?

    Hi, I am getting SMART error notifications from PVE every day about the same one error (there is 1 unreadable sector on one of the SSDs). While I appreciate the notification in principle, I don't need to be reminded of the error every day. And if nothing changes, I think I can live with that...
  14. P

    [SOLVED] Ceph rebalancing just stopped

    No, I just shut the node down. Ceph complains about the missing node and the missing OSDs. But it did copy over a good part to the new node and new OSDs before it stopped. The same goes for the monitor service. But there are enough monitors in the cluster. Between 44% and 55% I don't have...
  15. P

    [SOLVED] Ceph rebalancing just stopped

    Hi, I have a small three node cluster. There are two pools with three OSDs each. Each node hosts one OSD from each pool (one HDD, one SSD). Replication rule is 3/2. When one of the OSDs in one of the nodes started acting up, I decided to not just replace the OSD but to also replace the entire...
  16. P

    Looking for a "cheap" way to upgrade to 40gibt

    So, I have made the step and connected everything but hit a wall: While I can see the cards in the PVE GUI and have assigned them IP addresses and they start up, they can't connect to one another via the switch. At first I thought that I needed to change some settings on the switch (and that...
  17. P

    Looking for a "cheap" way to upgrade to 40gibt

    Ah, thanks. I think I knew that a long time ago (because I had actually switched this to one of the 10gbe networks) but I had since forgotten about it...
  18. P

    ConnectX-3 QSFP Ethernet Interface Shows No-Carrier

    Same issue here with CX3-Pro card and PVE8.2 Has anyone got those to work?
  19. P

    Looking for a "cheap" way to upgrade to 40gibt

    I have installed the new IB cards in all hosts but have not yet made the switch because of one (hopefully final) question that came to my mind: When I migrate VMs from one host to another, which of the networks is used for that? If it is the corosync network, then switching it back to 1gbe will...
  20. P

    GUI shell connection failed with code 1005

    Not sure whether this is a config or a networking issue... I tried to connect to the shell via the GUI. It said connecting and then "Connection failed (Code: 1005)" Background: This works perfectly fine when I access the GUI directly locally. But when I access the GUI remotely via...