[rabbitmq-discuss] the cluster broke and can not auto recover

Kevin Liao liaoxu at gmail.com
Mon Feb 18 06:18:29 GMT 2013


 

I was confused by the problem when the rabbitmq cluster(3 nodes 
167/188/218, mirrored quque) was running, suddenly maybe an network io 
error happened, then the cluster broker into two(167-188 and 218), each of 
them was confirmed running and able to serve but also consider the reason 
of cluster breaking was not their own fault and always waited for the other 
node to join back again.


Any clue?

<https://lh4.googleusercontent.com/-d-tKvmnFwDI/USHHKY29EmI/AAAAAAAAAF4/yWudeT64Ats/s1600/%E5%B1%8F%E5%B9%95%E5%BF%AB%E7%85%A7+2013-02-17+%E4%B8%8B%E5%8D%886.39.44.png>



218 consider 167and188 down, and it's master now.


<https://lh5.googleusercontent.com/-53xWkDetMEg/USHHZpnFYoI/AAAAAAAAAGA/iysXsLTEzlU/s1600/%E5%B1%8F%E5%B9%95%E5%BF%AB%E7%85%A7+2013-02-17+%E4%B8%8B%E5%8D%886.39.23.png>




167 and 188 both consider 218 down


And the cluster_status command returns

<https://lh4.googleusercontent.com/-lmk5YAZrNJM/USHHjmq8voI/AAAAAAAAAGI/HtJf3HWTAJU/s1600/%E5%B1%8F%E5%B9%95%E5%BF%AB%E7%85%A7+2013-02-17+%E4%B8%8B%E5%8D%8810.42.33.png>


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20130217/d4480562/attachment.htm>


More information about the rabbitmq-discuss mailing list