Recent content by radar

  1. R

    pve node stops responding and requires reboot

    Sorry for being long to respond but the guilty node keeps stop working after a day or 2. And up to now, could not figure out what's the issue.
  2. R

    Advices for configuration of high availability in a cluster

    I have been able to solve all my issues following my ability to create a zfs storage on my cluster. To be honest, I don't know what I have done compared to the last time but this time, I was able to create this zfs storage which has allowed to change the disk of all my CTs and VMs, add...
  3. R

    Advices for configuration of high availability in a cluster

    Hi, Thanks @fba for your help. I think my only option with this conf is to make a fresh install on pve2 and pve3 and use zfs. Thus, I'll have a zfs pool with the same name on the 3 nodes and then use it. I have then moved all my VMs and CTs to pve2 then reinstalled pve1 and pve3 with zfs...
  4. R

    Advices for configuration of high availability in a cluster

    Thanks @fba for your response. I'm a little bit lost. On pve1 where I have only one disk, the only ZFS disk I have has been created during the installation of proxmox on this node and is automatically named rpool. On pve2, I used the second empty disk to create a ZFS disk (I can't say if it's a...
  5. R

    Advices for configuration of high availability in a cluster

    Indeed, comments state that something is going wrong. However, I'm still not sure how to achieve HA with my services.
  6. R

    PVE UI and ssh stop responding

    That's a very good question. Actually, I don't connect often directly to the server but when I did, it was working perfectly. The only thing that makes me think that it's not a network issue is that telnet always works on this specific port while the UI does not.
  7. R

    Advices for configuration of high availability in a cluster

    Thank you @fba but I created a storage named test in node1, then I created a zfs disk on the other nodes named also test but did not add it to storage (according to this video). I run a container on node 1 and I configure replication on nodes 2 and 3. When the replication starts, I have the...
  8. R

    Advices for configuration of high availability in a cluster

    Hi, I'm new to this virtualisation things so I'm looking for advices on how to set-up my cluster. I have 3 nodes, node1 has only one disk while nodes 2 and 3 have 2 disks each.I'd like to implement HA on this cluster but I don't want any external storage. I installed pve with zfs option on node1...
  9. R

    [SOLVED] Broke my cluster

    Solved the issue by editing the corosync.conf file on all nodes. I was able to do that thanks to: systemctl stop pve-cluster systemctl stop corosync pmxcfs -l This allowed me to edit the corosync.conf file. Then, I did killall pmxcfs systemctl start pve-cluster systemctl start corosync One of...
  10. R

    [SOLVED] Broke my cluster

    Hi, I have a 3-nodes cluster that was configured as follows: node1 with ip 192.168.1.120 node2 with ip 192.168.1.121 node3 with ip 192.168.1.122 So I wanted to change and make it coherent (node1 with ip 192.168.1.121, node2 with ip 192.168.1.122, and node3 with ip 192.168.1.123). I started...
  11. R

    PVE UI and ssh stop responding

    Hi, Sorry for the long delay. 1- Yes, got the journal. It's attached here. 2- Yes, new server (but old computer) with new Proxmox. Never used proxmox on it. 3- Pretty old. Intel i5 with 8Gb of ram. 4- Attached too. Thank you very much.
  12. R

    pve node stops responding and requires reboot

    That's a tough question actually :rolleyes: I did not pay attention at all to corosync and I have only one nic per node. So control and data go through the same interface.
  13. R

    pve node stops responding and requires reboot

    Hi, Thank you very much for analyzing this. We are on the same line regarding 2/9, I should have pressed the shutdown button. Regarding HA, I'm thinking about enabling it but did not yet. I saw several issues due to quorum. Can this be the reason? On Feb 9th, I had 2 nodes only and I know 3 is...
  14. R

    pve node stops responding and requires reboot

    Too many reboots on this node and I can't figure out why. I see few warnings and errors but can't figure out if that leads to these reboots. I attach the log since feb 1st with 13 reboots. If you prefer I paste the logs in pastebin, I can (but cannot attach as several days). Thanks a lot for...
  15. R

    PVE UI and ssh stop responding

    Thank you for your response. I managed to get the journal of a longer period but the host stopped responding before I can scp it. So I'll try again later today. In the meanwhile, here is the output of smartctl where I don't see anything to worry about. root@pve:~# smartctl -A /dev/sda smartctl...