Just setup a new cluster. Having issues with fencing on one node after a reboot. After I reboot node #2 it will come up and try to fence node #1
1358439064 delay post_join_delay 6 quorate_from_last_update 1
1358439070 delay of 6s leaves 1 victims
1358439070 medprox1 not a cluster member after 6 sec post_join_delay
1358439070 fencing node medprox1
1358439090 fence medprox1 dev 0.0 agent fence_ipmilan result: error from agent
1358439090 fence medprox1 failed
1358439093 fencing node medprox1
1358439114 fence medprox1 dev 0.0 agent fence_ipmilan result: error from agent
1358439114 fence medprox1 failed
1358439117 fencing node medprox1
After a few minutes I then get this on node #2.
root@medprox2:~# fence_tool dump
fence_tool: dump: can't communicate with fenced
root@medprox2:~# fence_tool join
fence_tool: can't connect to cman
It seems the only way around this issue is to reboot node #1, then once that is up, reboot node #2 and all is well. If I reboot node #2 this process happens again. I do have FENCE_JOIN="yes" uncommented in /etc/default/redhat-cluster-pve on each node. I have two other nodes in the identical setup without this issue.
1358439064 delay post_join_delay 6 quorate_from_last_update 1
1358439070 delay of 6s leaves 1 victims
1358439070 medprox1 not a cluster member after 6 sec post_join_delay
1358439070 fencing node medprox1
1358439090 fence medprox1 dev 0.0 agent fence_ipmilan result: error from agent
1358439090 fence medprox1 failed
1358439093 fencing node medprox1
1358439114 fence medprox1 dev 0.0 agent fence_ipmilan result: error from agent
1358439114 fence medprox1 failed
1358439117 fencing node medprox1
After a few minutes I then get this on node #2.
root@medprox2:~# fence_tool dump
fence_tool: dump: can't communicate with fenced
root@medprox2:~# fence_tool join
fence_tool: can't connect to cman
It seems the only way around this issue is to reboot node #1, then once that is up, reboot node #2 and all is well. If I reboot node #2 this process happens again. I do have FENCE_JOIN="yes" uncommented in /etc/default/redhat-cluster-pve on each node. I have two other nodes in the identical setup without this issue.