Recent content by XN-Matt

  1. X

    Watchdog Reboots

    Thanks. Not seen any slowness on our Ceph pools when it happens. Very little/low usage on the last one. Sub 1/2MB/s.
  2. X

    Watchdog Reboots

    Do we have anything further on this. Another watchdog reboot yesterday.
  3. X

    Watchdog Reboots

    I assume you mean @Michiel_1afa? The host that this happened on for us had 1 VM running.
  4. X

    Watchdog Reboots

    Logs don’t show any. Just to note, this issue occurs any time of the day. Our backups only run at very specific times. There is no correlation
  5. X

    Watchdog Reboots

    20 mins before the last reboot and one prior to that.
  6. X

    Watchdog Reboots

    Full logs as in the whole of the journal, or just specific to a process? The usually happens just out of the blue but has during an upgrade too. pveversion -v
  7. X

    Watchdog Reboots

    Interesting. Wonder when those issues were introduced. The number of VMs and HA resources we have has not changed since these issues occurred. Our issues started mid-Nov.
  8. X

    Watchdog Reboots

    Would be nice to get some Proxmox involvement as this is clearly an issue with this and other threads of the same problem.
  9. X

    Watchdog Reboots

    We are solely Intel, so I don't think chipset is relevant.
  10. X

    Watchdog Reboots

    Now you mention it, we saw that on our last upgrade. Nodes drying during apt-get dist-upgrade. Never seen that before. We just thought it was because they were busy.
  11. X

    Watchdog Reboots

    No entries around the time of the reboot. Jan 20 04:04:06 hv-5-i corosync[1826]: [QUORUM] Sync members[7]: 1 2 3 4 5 6 7 Jan 20 04:04:06 hv-5-i corosync[1826]: [QUORUM] Sync joined[6]: 1 2 3 4 6 7 Jan 20 04:04:06 hv-5-i corosync[1826]: [TOTEM ] A new membership (1.759) was formed. Members...
  12. X

    Watchdog Reboots

    Something we've seen in version 9 is an increase in watchdog reboots - in fact from none to many. Last few entries of the journal show: ``` Jan 22 04:39:09 hv-5-i watchdog-mux[1504]: client watchdog is about to expire Jan 22 04:39:09 hv-5-i systemd-journald[841]: Received client request to...
  13. X

    Proxmox Datacenter Manager - First Alpha Release

    We installed this but the web server appears unresponsive. Processes are up, server rebooted but nothing. Port 8443 does show in use. ``` proxmox-d 540 www-data 14u IPv6 2946 0t0 TCP *:8443 (LISTEN) proxmox-d 540 www-data 15u...
  14. X

    Ceph / Cluster Networking Question

    I will check when we try again but the main question remains. Do the nodes that are exclusively running virtual machines need to see/access the internal ceph network. Should they only require, at a minimum, interface for internet/VM traffic to the world and cluster traffic (which is shared with...
  15. X

    Ceph / Cluster Networking Question

    Ceph installed OK but it was during the configuration. Near immediately after nodes starting rebooting. Yes we use HA. Public interface is 95.x.x.x Ceph Public/Cluster is 10.0.0.0/24 (we also set a backup ring on the public) Ceph Internal is 192.168.0.0/24 There was no saturation of any link...