XN-Matt's latest activity

  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Thanks. Not seen any slowness on our Ceph pools when it happens. Very little/low usage on the last one. Sub 1/2MB/s.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Do we have anything further on this. Another watchdog reboot yesterday.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    I assume you mean @Michiel_1afa? The host that this happened on for us had 1 VM running.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Logs don’t show any. Just to note, this issue occurs any time of the day. Our backups only run at very specific times. There is no correlation
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    20 mins before the last reboot and one prior to that.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Full logs as in the whole of the journal, or just specific to a process? The usually happens just out of the blue but has during an upgrade too. pveversion -v
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Interesting. Wonder when those issues were introduced. The number of VMs and HA resources we have has not changed since these issues occurred. Our issues started mid-Nov.
  • X
    XN-Matt reacted to Michiel_1afa's post in the thread Watchdog Reboots with Like Like.
    Good to know this affects everyone equally :-) We have had discussion on this topic in the past on this forum, It would be nice to get a way to see the softdog status and get logging of when the watchers decide to NOT ping the watchdog for...
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Would be nice to get some Proxmox involvement as this is clearly an issue with this and other threads of the same problem.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    We are solely Intel, so I don't think chipset is relevant.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    Now you mention it, we saw that on our last upgrade. Nodes drying during apt-get dist-upgrade. Never seen that before. We just thought it was because they were busy.
  • X
    XN-Matt replied to the thread Watchdog Reboots.
    No entries around the time of the reboot. Jan 20 04:04:06 hv-5-i corosync[1826]: [QUORUM] Sync members[7]: 1 2 3 4 5 6 7 Jan 20 04:04:06 hv-5-i corosync[1826]: [QUORUM] Sync joined[6]: 1 2 3 4 6 7 Jan 20 04:04:06 hv-5-i corosync[1826]...
  • X
    Something we've seen in version 9 is an increase in watchdog reboots - in fact from none to many. Last few entries of the journal show: ``` Jan 22 04:39:09 hv-5-i watchdog-mux[1504]: client watchdog is about to expire Jan 22 04:39:09 hv-5-i...