I know the solution for the case that a node can become master through pveca -m.
In my cluster there is every now and then the situation that the servers on the cluster are not accessible i think it depends on the following described error.
But in my situation the master seems not to be "really" down. i get several errors in /var/log/kern.log saying that a problem with a sata drive exists (I think) i already posted the errorlog:
I googled this error and it seems that a new installation may cure the pain of the server. But now the question:
What is the procedure if i want to reinstall a cluster master??
Say, i want to shut the master down, reinstall it and then bring it back as master again.
Is that possible?? And what is the right action to do??
In my cluster there is every now and then the situation that the servers on the cluster are not accessible i think it depends on the following described error.
But in my situation the master seems not to be "really" down. i get several errors in /var/log/kern.log saying that a problem with a sata drive exists (I think) i already posted the errorlog:
Code:
Jan 12 08:04:05 Donald kernel: sd 7:0:0:0: [sdb] 2147518464 512-byte hardware sectors (1099529 MB)
Jan 12 08:04:05 Donald kernel: sd 7:0:0:0: [sdb] Write Protect is off
Jan 12 08:04:05 Donald kernel: sd 7:0:0:0: [sdb] Mode Sense: bd 00 00 08
Jan 12 08:04:05 Donald kernel: sd 7:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
Jan 12 08:05:07 Donald kernel: ata2.00: qc timeout (cmd 0xa0)
Jan 12 08:05:07 Donald kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
Jan 12 08:05:07 Donald kernel: ata2.00: cmd a0/00:00:00:00:00/00:00:00:00:00/a0 tag 0
Jan 12 08:05:07 Donald kernel: cdb 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 12 08:05:07 Donald kernel: res 51/20:03:00:00:00/00:00:00:00:00/a0 Emask 0x5 (timeout)
Jan 12 08:05:07 Donald kernel: ata2.00: status: { DRDY ERR }
Jan 12 08:05:12 Donald kernel: ata2: port is slow to respond, please be patient (Status 0xd0)
Jan 12 08:05:17 Donald kernel: ata2: device not ready (errno=-16), forcing hardreset
Jan 12 08:05:17 Donald kernel: ata2: soft resetting link
Jan 12 08:05:18 Donald kernel: ata2.01: NODEV after polling detection
Jan 12 08:05:18 Donald kernel: ata2.00: configured for UDMA/25
Jan 12 08:05:18 Donald kernel: ata2: EH complete
Jan 12 08:07:00 Donald kernel: ata2.00: qc timeout (cmd 0xa0)
Jan 12 08:07:00 Donald kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
Jan 12 08:07:00 Donald kernel: ata2.00: cmd a0/00:00:00:00:00/00:00:00:00:00/a0 tag 0
Jan 12 08:07:00 Donald kernel: cdb 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 12 08:07:00 Donald kernel: res 51/20:03:00:00:00/00:00:00:00:00/a0 Emask 0x5 (timeout)
Jan 12 08:07:00 Donald kernel: ata2.00: status: { DRDY ERR }
Jan 12 08:07:05 Donald kernel: ata2: port is slow to respond, please be patient (Status 0xd0)
Jan 12 08:07:10 Donald kernel: ata2: device not ready (errno=-16), forcing hardreset
Jan 12 08:07:10 Donald kernel: ata2: soft resetting link
Jan 12 08:07:10 Donald kernel: ata2.01: NODEV after polling detection
Jan 12 08:07:11 Donald kernel: ata2.00: configured for UDMA/25
Jan 12 08:07:11 Donald kernel: ata2: EH complete
What is the procedure if i want to reinstall a cluster master??
Say, i want to shut the master down, reinstall it and then bring it back as master again.
Is that possible?? And what is the right action to do??