could you share parts of the logs?
Definitely ... here are some logs from AIT3, the last node to reboot and the one that I have syslog entries for. The reboot happened just after Apr 5 05:19:30. The nodes address is 172.20.64.14. 172.20.64.253 is one of our monitoring servers.
Code:
Apr 5 05:00:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:00:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:00:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:00:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:01:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:01:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:01:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:02:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:02:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:02:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:03:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:03:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:03:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:04:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:04:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:04:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:05:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:05:18 ait3 snmpd[4329]: Connection from UDP: [172.20.64.253]:64390->[172.20.64.14]:161
Apr 5 05:05:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:05:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:06:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:06:18 ait3 pmxcfs[4785]: [dcdb] notice: data verification successful
Apr 5 05:06:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:06:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:07:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:07:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:07:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:08:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:08:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:08:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:09:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:09:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:09:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:10:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:10:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:10:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:11:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:11:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:11:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:12:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:12:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:12:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:13:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:13:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:13:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:14:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:14:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:14:35 ait3 pmxcfs[4785]: [status] notice: received log
Apr 5 05:15:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:15:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:15:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:15:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:16:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:16:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:16:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:17:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:17:01 ait3 CRON[2898136]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Apr 5 05:17:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:17:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:18:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:18:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:18:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:00 ait3 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:19:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:00 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:01 ait3 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:19:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:30 ait3 snmpd[4329]: error on subcontainer 'ia_addr' insert (-1)
'zgrep corosync * | grep " 5 " | grep " 05:"' didn't return anything. (Sticking with the command that I'm familiar with for now.)
'zgrep corosync * | grep " 5 "' doesn't show any entries untill 09:15 when I reboot a cluster node to troubleshoot.
'zgrep pmxcfs * | grep " 5 " | grep " 05:"' returns nothing as well.
Here's 4 lines before and after the reboot time using 'zgrep pmxcfs * | grep " 5 "'.
Code:
daemon.log.1:Apr 5 04:06:18 ait3 pmxcfs[4785]: [dcdb] notice: data verification successful
daemon.log.1:Apr 5 04:14:33 ait3 pmxcfs[4785]: [status] notice: received log
daemon.log.1:Apr 5 04:29:33 ait3 pmxcfs[4785]: [status] notice: received log
daemon.log.1:Apr 5 04:44:34 ait3 pmxcfs[4785]: [status] notice: received log
syslog.7.gz:Apr 5 06:29:39 ait3 pmxcfs[4793]: [status] notice: received log
syslog.7.gz:Apr 5 06:44:39 ait3 pmxcfs[4793]: [status] notice: received log
syslog.7.gz:Apr 5 06:59:40 ait3 pmxcfs[4793]: [status] notice: received log
syslog.7.gz:Apr 5 07:06:18 ait3 pmxcfs[4793]: [dcdb] notice: data verification successful
please check the logs for the same timeframe on the other nodes
On AIT2, using 'zgrep fence *' provides a page of entries with pve-ha-crm.
'zgrep pve-ha-crm * | grep " 5 05:"' shows:
Note that there aren't any entries before 5AM on the 5th ... checked by removing the '05:'.
Code:
daemon.log.1:Apr 5 05:22:27 ait2 pve-ha-crm[5642]: successfully acquired lock 'ha_manager_lock'
daemon.log.1:Apr 5 05:22:27 ait2 pve-ha-crm[5642]: watchdog active
daemon.log.1:Apr 5 05:22:27 ait2 pve-ha-crm[5642]: status change slave => master
daemon.log.1:Apr 5 05:22:27 ait2 pve-ha-crm[5642]: node 'ait3': state changed from 'online' => 'unknown'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:100': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:106': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:109': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:110': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:111': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:112': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:113': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: service 'vm:115': state changed from 'started' to 'fence'
daemon.log.1:Apr 5 05:23:27 ait2 pve-ha-crm[5642]: node 'ait3': state changed from 'unknown' => 'fence'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: successfully acquired lock 'ha_agent_ait3_lock'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: fencing: acknowledged - got agent lock for node 'ait3'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: node 'ait3': state changed from 'fence' => 'unknown'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:100' from fenced node 'ait3' to node 'ait2'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:100': state changed from 'fence' to 'started' (node = ait2)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:106' from fenced node 'ait3' to node 'ait2'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:106': state changed from 'fence' to 'started' (node = ait2)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:109' from fenced node 'ait3' to node 'ait1'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:109': state changed from 'fence' to 'started' (node = ait1)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:110' from fenced node 'ait3' to node 'ait2'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:110': state changed from 'fence' to 'started' (node = ait2)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:111' from fenced node 'ait3' to node 'ait1'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:111': state changed from 'fence' to 'started' (node = ait1)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:112' from fenced node 'ait3' to node 'ait2'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:112': state changed from 'fence' to 'started' (node = ait2)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:113' from fenced node 'ait3' to node 'ait1'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:113': state changed from 'fence' to 'started' (node = ait1)
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: recover service 'vm:115' from fenced node 'ait3' to node 'ait2'
daemon.log.1:Apr 5 05:23:37 ait2 pve-ha-crm[5642]: service 'vm:115': state changed from 'fence' to 'started' (node = ait2)
daemon.log.1:Apr 5 05:24:57 ait2 pve-ha-crm[5642]: node 'ait3': state changed from 'unknown' => 'online'
Looking in more detail, here's a bit from 'cat daemon.log.1 | grep "Apr 5 05:"'
Code:
Apr 5 05:19:00 ait2 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:19:01 ait2 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:19:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:51 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:51 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:51 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:51 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:51 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:00 ait2 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:20:01 ait2 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:20:18 ait2 snmpd[4426]: Connection from UDP: [172.20.64.253]:58590->[172.20.64.13]:161
Apr 5 05:20:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:21 ait2 snmpd[4426]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:31 ait2 corosync[5025]: notice [TOTEM ] A new membership (172.20.128.12:5452) was formed. Members left: 3
Apr 5 05:20:31 ait2 corosync[5025]: notice [TOTEM ] Failed to receive the leave message. failed: 3
Apr 5 05:20:31 ait2 corosync[5025]: [TOTEM ] A new membership (172.20.128.12:5452) was formed. Members left: 3
Apr 5 05:20:31 ait2 corosync[5025]: [TOTEM ] Failed to receive the leave message. failed: 3
Apr 5 05:20:31 ait2 corosync[5025]: warning [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait2 corosync[5025]: [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait2 corosync[5025]: [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait2 corosync[5025]: warning [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait2 pmxcfs[4870]: [dcdb] notice: members: 1/4736, 2/4870
Apr 5 05:20:31 ait2 corosync[5025]: notice [QUORUM] Members[2]: 1 2
Apr 5 05:20:31 ait2 corosync[5025]: notice [MAIN ] Completed service synchronization, ready to provide service.
Apr 5 05:20:31 ait2 pmxcfs[4870]: [dcdb] notice: starting data syncronisation
Apr 5 05:20:31 ait2 pmxcfs[4870]: [status] notice: members: 1/4736, 2/4870
Apr 5 05:20:31 ait2 pmxcfs[4870]: [status] notice: starting data syncronisation
Apr 5 05:20:31 ait2 corosync[5025]: [QUORUM] Members[2]: 1 2
Apr 5 05:20:31 ait2 corosync[5025]: [MAIN ] Completed service synchronization, ready to provide service.
On AIT1, using 'zgrep fence *' doesn't show anything.
There's something in daemon.log.1 that's no text as I had to use 'grep "Apr 5 05:" daemon.log.1 -a' to pull this:
Code:
Apr 5 05:19:01 ait1 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:19:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:19:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:00 ait1 systemd[1]: Starting Proxmox VE replication runner...
Apr 5 05:20:01 ait1 systemd[1]: Started Proxmox VE replication runner.
Apr 5 05:20:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:02 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:15 ait1 pveproxy[2190176]: Clearing outdated entries from certificate cache
Apr 5 05:20:18 ait1 snmpd[4323]: Connection from UDP: [172.20.64.253]:58593->[172.20.64.12]:161
Apr 5 05:20:29 ait1 corosync[4983]: notice [TOTEM ] A processor failed, forming new configuration.
Apr 5 05:20:29 ait1 corosync[4983]: [TOTEM ] A processor failed, forming new configuration.
Apr 5 05:20:31 ait1 corosync[4983]: notice [TOTEM ] A new membership (172.20.128.12:5452) was formed. Members left: 3
Apr 5 05:20:31 ait1 corosync[4983]: notice [TOTEM ] Failed to receive the leave message. failed: 3
Apr 5 05:20:31 ait1 corosync[4983]: [TOTEM ] A new membership (172.20.128.12:5452) was formed. Members left: 3
Apr 5 05:20:31 ait1 corosync[4983]: [TOTEM ] Failed to receive the leave message. failed: 3
Apr 5 05:20:31 ait1 corosync[4983]: warning [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait1 corosync[4983]: [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait1 corosync[4983]: warning [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait1 corosync[4983]: [CPG ] downlist left_list: 1 received
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: members: 1/4736, 2/4870
Apr 5 05:20:31 ait1 corosync[4983]: notice [QUORUM] Members[2]: 1 2
Apr 5 05:20:31 ait1 corosync[4983]: notice [MAIN ] Completed service synchronization, ready to provide service.
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: starting data syncronisation
Apr 5 05:20:31 ait1 corosync[4983]: [QUORUM] Members[2]: 1 2
Apr 5 05:20:31 ait1 corosync[4983]: [MAIN ] Completed service synchronization, ready to provide service.
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: cpg_send_message retried 1 times
Apr 5 05:20:31 ait1 pmxcfs[4736]: [status] notice: members: 1/4736, 2/4870
Apr 5 05:20:31 ait1 pmxcfs[4736]: [status] notice: starting data syncronisation
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: received sync request (epoch 1/4736/0000000A)
Apr 5 05:20:31 ait1 pmxcfs[4736]: [status] notice: received sync request (epoch 1/4736/0000000A)
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: received all states
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: leader is 1/4736
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: synced members: 1/4736, 2/4870
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: start sending inode updates
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: sent all (0) updates
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: all data is up to date
Apr 5 05:20:31 ait1 pmxcfs[4736]: [dcdb] notice: dfsm_deliver_queue: queue length 5
Apr 5 05:20:31 ait1 pmxcfs[4736]: [status] notice: received all states
Apr 5 05:20:31 ait1 pmxcfs[4736]: [status] notice: all data is up to date
Apr 5 05:20:31 ait1 pmxcfs[4736]: [status] notice: dfsm_deliver_queue: queue length 7
Apr 5 05:20:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
Apr 5 05:20:32 ait1 snmpd[4323]: error on subcontainer 'ia_addr' insert (-1)
check the fenced nodes log ... for any messages from the kernel
OK ... using 'zgrep ' 5 05:.*kernel' *' on AIT3 shows that I should look in kern.log.1 and messages.1.
'zgrep ' 5 05:' kern.log.1' This only shows the system booting ... there's nothing before. The only entries are 2 hours earlier from pveupdate.
Code:
Apr 5 03:38:35 ait3 pveupdate[2862945]: <root@pam> starting task UPID:ait3:002BAF91:02EB5BA9:5CA7302B:aptupdate::root@pam:
Apr 5 03:38:40 ait3 pveupdate[2862945]: <root@pam> end task UPID:ait3:002BAF91:02EB5BA9:5CA7302B:aptupdate::root@pam: OK
messages.1 look to be the same information.
it could also be a bug in the watchdog module you're using - which one do you have configured
We're using the IPMI watchdog.
Perhaps it's worth a few questions here from you given that any node which reboots (in an unplanned way) sometimes losses its watchdog config. Another manual reboot fixes this.
any other particularities of your setup
Hmm ... I don't think so but I'm filtering based on what I know. The setup has worked well up to this point ... no other strange issues.