[rabbitmq-discuss] Clusters stop working/passing messages

schaibaa schaible_adam at yahoo.com
Sun Aug 19 18:18:33 BST 2012


Hi Guys,

I was hoping I might be able to get some help -- I'm running a 3 node 
rabbitmq cluster.  All 3 are currently in Disc mode.

The cluster isn't that busy ... I'm not sure how to obtain throughput 
stats, but my estimate is a few hundred messages per hour.

Anyway, every week or so, we notice an issue in our application, and I'll 
check the dashboard.  One of the nodes will be red and offline.  The 
RabbitMQ service on that machine is still running, but it appears offline 
to the other two nodes.

Restarting the nodes seems to immediately fix the problem... I'm just 
hoping to eliminate this issue.

OS:  Windows server 2008 R2
RabbitMQ Version: 2.8.0

The only thing I see in the logs is below:

=ERROR REPORT==== 19-Aug-2012::12:37:55 ===
Mnesia('rabbit at NODE3'): ** ERROR ** mnesia_event got 
{inconsistent_database, running_partitioned_network, 'rabbit at NODE1'}

=ERROR REPORT==== 19-Aug-2012::12:37:55 ===
Mnesia('rabbit at NODE3'): ** ERROR ** mnesia_event got 
{inconsistent_database, starting_partitioned_network, 'rabbit at NODE2'}

This when I restarted the service on NODE1 ... and a similar log message 
appears on the other nodes.

Any suggestions?  Relatively new to RabbitMQ ...

Thanks :)


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20120819/8130c984/attachment.htm>


More information about the rabbitmq-discuss mailing list