Quantcast
Channel: Proxmox Support Forum
Viewing all articles
Browse latest Browse all 170703

Proxmox Cluster broken

$
0
0
I just did a mistake on one of my proxmox 2.3 servers.
I accidentally delete the known .ssh/known_host file and run this command "pvecm add xxx.xxx.xxx.xxx -f "
on one of my existing cluster nodes (i was in the wrong terminal...)
This cluster is a 2 nodes cluster.

and now i can't login anymore to the webinterface...
ssh is working well and the CT's are still running.
is there a possibility to rejoin the old cluster?
And one very important question:
what happens with all the CT's if i leave the cluster?
and try to join a new cluster?

Thanks for your help
I found this information in an other thread could this maybe fix my problem?but what will happen with all my CT's?

You don't need to re-install the node. Save the current VM configs in /etc/pve/local/qemu-server before removing everything.
Remove the node from the cluster with pvecm delnode <node to be removed> on one of the other nodes.

Perform the following commands on the node to be removed:

service cman stop
killall -9 corosync cman dlm_controld fenced
service pve-cluster stop
rm /etc/cluster/cluster.conf
rm -rf /var/lib/pve-cluster/* /var/lib/pve-cluster/.*
rm /var/lib/cluster/*
reboot (sometimes cluster kernel modules remain hang on the connections so it is better to reboot, to be sure).


After rebooting, you can add the node as usual:
pvecm add <IP address of one of the nodes already in the cluster>


I've broken and re-did a cluster this way for about 7-8 times in the past two weeks (it is important to me to make sure that I can restore a broken cluster before we go live).

Viewing all articles
Browse latest Browse all 170703

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>