[rabbitmq-discuss] RabbitMQ Node recovery - How to set a node to auto restart on failure

Michael Klishin mklishin at gopivotal.com
Wed Feb 5 09:53:30 GMT 2014


On 5 Feb 2014, at 13:48, rails <stammailbox at gmail.com> wrote:

> 1. Lets say a process was shut down 
> (disk space full, internal error, someone killed it)
> 
> 2. Lets say a machine was disconnected from network. after 10 minutes it is connected again.

Like I said earlier in this thread, such failures cannot be recovered by other RabbitMQ nodes (it is easy
to see with #2). It is not even clear how to recover (e.g. how do you free up disk space).

What you need is a tool such as Monit which monitors RabbitMQ nodes, available disk space, etc
and sends out notifications and (if possible) tries to recover the system.

MK

Software Engineer, Pivotal/RabbitMQ




More information about the rabbitmq-discuss mailing list