[rabbitmq-discuss] HA active/active cluster in a bad state

Bryan Murphy bmurphy1976 at gmail.com
Tue Oct 4 21:57:09 BST 2011


I have a 3-node HA active/active cluster in our integration environment that
is currently in a bad state.

I logged into one node, and did an /etc/init.d/rabbitmq-server restart.
 This never returned.  I logged into another terminal, kill -9 ALL rabbitmq
processes and then ran /etc/init.d/rabbitmq-server start.

This brought the server back up.  However, it's not functioning correctly.
 For example, sudo rabbitmqctl cluster_status works fine:

Cluster status of node 'rabbit at domU-12-31-38-07-18-A6' ...
[{nodes,[{disc,['rabbit at domU-12-31-38-07-18-A6','rabbit at ip-10-202-209-83',
                'rabbit at domU-12-31-39-06-72-50']}]},
 {running_nodes,['rabbit at domU-12-31-39-06-72-50','rabbit at ip-10-202-209-83',
                 'rabbit at domU-12-31-38-07-18-A6']}]
...done.

however, sudo rabbitmqctl list_queues blocks and never returns.

I'm not touching anything else while the cluster is in this state.  What
diagnostics can I provide to help track down this problem?

Thanks,
Bryan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20111004/28303e43/attachment.htm>


More information about the rabbitmq-discuss mailing list