[rabbitmq-discuss] the cluster broke and can not auto recover
Kevin Liao
liaoxu at gmail.com
Mon Feb 18 06:18:29 GMT 2013
I was confused by the problem when the rabbitmq cluster(3 nodes
167/188/218, mirrored quque) was running, suddenly maybe an network io
error happened, then the cluster broker into two(167-188 and 218), each of
them was confirmed running and able to serve but also consider the reason
of cluster breaking was not their own fault and always waited for the other
node to join back again.
Any clue?
<https://lh4.googleusercontent.com/-d-tKvmnFwDI/USHHKY29EmI/AAAAAAAAAF4/yWudeT64Ats/s1600/%E5%B1%8F%E5%B9%95%E5%BF%AB%E7%85%A7+2013-02-17+%E4%B8%8B%E5%8D%886.39.44.png>
218 consider 167and188 down, and it's master now.
<https://lh5.googleusercontent.com/-53xWkDetMEg/USHHZpnFYoI/AAAAAAAAAGA/iysXsLTEzlU/s1600/%E5%B1%8F%E5%B9%95%E5%BF%AB%E7%85%A7+2013-02-17+%E4%B8%8B%E5%8D%886.39.23.png>
167 and 188 both consider 218 down
And the cluster_status command returns
<https://lh4.googleusercontent.com/-lmk5YAZrNJM/USHHjmq8voI/AAAAAAAAAGI/HtJf3HWTAJU/s1600/%E5%B1%8F%E5%B9%95%E5%BF%AB%E7%85%A7+2013-02-17+%E4%B8%8B%E5%8D%8810.42.33.png>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20130217/d4480562/attachment.htm>
More information about the rabbitmq-discuss
mailing list