Quantcast
Channel: Proxmox Support Forum
Viewing all articles
Browse latest Browse all 170558

Node can't join Quorum but multicast ping works fine

$
0
0
Hello,

I have a 4 node proxmox cluster. Currently I've tried to add a fourth node to no avail. I thought perhaps the issue was related to multicast, but I can do a multicast ping between all nodes in the cluster. I can mount the /etc/pve fileystem just fine. The fourth node also shows up in the UI but with a red light. I'm at a loss of where else to check or how to continue to troubleshoot this.

Below are some commands that I've run to try to troubleshoot the issue:

Code:

root@virt4-atl:/# asmping 239.192.37.213 virt2-atl
asmping joined (S,G) = (*,239.192.37.234)
pinging 10.10.155.11 from 10.10.155.13
  unicast from 10.10.155.11, seq=1 dist=0 time=1.518 ms
multicast from 10.10.155.11, seq=1 dist=0 time=1.543 ms
  unicast from 10.10.155.11, seq=2 dist=0 time=0.257 ms
multicast from 10.10.155.11, seq=2 dist=0 time=0.266 ms
  unicast from 10.10.155.11, seq=3 dist=0 time=0.237 ms
multicast from 10.10.155.11, seq=3 dist=0 time=0.250 ms


--- 10.10.155.11 statistics ---
3 packets transmitted, time 2828 ms
unicast:
  3 packets received, 0% packet loss
  rtt min/avg/max/std-dev = 0.237/0.670/1.518/0.599 ms
multicast:
  3 packets received, 0% packet loss since first mc packet (seq 1) recvd
  rtt min/avg/max/std-dev = 0.250/0.686/1.543/0.606 ms


When I restart 'cman' on the fourth node to join the cluster I get:

Code:

starting cluster:
  Checking if cluster has been disabled at boot... [  OK  ]
  Checking Network Manager... [  OK  ]
  Global setup... [  OK  ]
  Loading kernel modules... [  OK  ]
  Mounting configfs... [  OK  ]
  Starting cman... [  OK  ]
  Waiting for quorum... Timed-out waiting for cluster


The /etc/pve filesystem mounts fine as well:

Code:

/dev/fuse on /etc/pve type fuse (rw,nosuid,nodev,default_permissions,allow_other)

When I do 'pvecm nodes' on one of the working nodes I get:

Code:

root@virt2-atl:~# pvecm nodes
Node  Sts  Inc  Joined              Name
  1  M  1148  2013-02-27 16:29:38  virt3-atl
  2  M  1148  2013-02-27 16:29:38  virt2-atl
  3  M  1148  2013-02-27 16:29:38  virt1-atl
  4  X      0                        virt4-atl


When I issue the same command on the fourth (not working) node I get:

Code:

root@virt4-atl:/#  pvecm nodes
Node  Sts  Inc  Joined              Name
  1  X      0                        virt3-atl
  2  X      0                        virt2-atl
  3  X      0                        virt1-atl
  4  M    96  2013-02-27 16:37:00  virt4-atl

I'm at a loss on how to continue to troubleshoot this. I thought if all the nodes could communicate with multicast they should be able to form the quorum. If anyone can point me in the right direction, it would be greatly appreciated!

Viewing all articles
Browse latest Browse all 170558

Trending Articles