[rabbitmq-discuss] Restarting/re-establising failed federation connections

Steve Powell spowell at rbcon.com
Thu Jul 12 14:50:56 BST 2012


Hi David,

Federated exchanges establish AMQP connections upstream, so it is the
downstream Rabbit server which is connecting to the upstream one, not
the other way around.

What is probably happening when you restart the upstream cluster is that
the downstream federation recovery process is able to re-start. Why it
is blocking, I cannot tell.

Please can you tell me what versions of RabbitMQ you are running on the
upstream and downstream clusters? Although these are allowed to be
different, some changes were made in 2.8.2 which try to take advantage
of upstream exchange mods if they are detected. If the upstream exchange
and downstream exchange have versions which straddle 2.8.2, I might
expect some odd behaviour in recovery scenarios (though we should be
fine :-).

Steve Powell
steve at rabbitmq.com
[wrk: +44-2380-111-528] [mob: +44-7815-838-558]

On 11 Jul 2012, at 13:35, David Gillies wrote:

> Hi, 
> 
> So I've got a simple federated setup where we have a bunch of rabbits in various data centres all publishing to a central upstream queue. This works well for most of them, except for one data centre which has particularly bad connectivity. I've found the upstream rabbitmq server gets into a situation where by it loses connectivity to the downstream rabbit and is never able to connect again which results me going to the lengths of restarting the upstream rabbitmq cluster (just two servers in a cluster). Is there a less heavy handed way of re-establising the connection to the downstream cluster for situations where the federation plugin can't seem to reconnect anymore?
> 
> Thanks,
> David.
> _______________________________________________
> rabbitmq-discuss mailing list
> rabbitmq-discuss at lists.rabbitmq.com
> https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss



More information about the rabbitmq-discuss mailing list