[rabbitmq-discuss] Active-active crash report

Vadim Chekan kot.begemot at gmail.com
Mon Apr 30 17:43:54 BST 2012

Thanks for looking into it Matthias,

"rabbitmq-dev": yes, it is load balancer.
Disconnects: I was too lazy to write code for it and just restarted client
I did make a short try to reproduce failure without any load but it worked
just fine. I'll try more today.


On Sat, Apr 28, 2012 at 12:13 AM, Matthias Radestock
<matthias at rabbitmq.com>wrote:

> Vadim,
> (putting the list back on cc)
> On 27/04/12 23:28, Vadim Chekan wrote:
>> I've spent some time today playing with different client settings. Seems
>> ttl does not affect failures at all.
> That's good to know. The more factors we can eliminate as possible causes
> the better.
>  I managed to reproduce crash many times today. Basic idea is: an
>> application, 40 threads create a pub/sub exchange and publish a message
>> every second. Under this load I bring the master node (all queues are
>> usually created on the same node) and often it causes another node to
>> fail. Here is my load simulator (in c#):
>> http://www.heypasteit.com/**clip/0B5W<http://www.heypasteit.com/clip/0B5W>
> That code connects to "rabbitmq-dev". Is that a load balancer sitting in
> front of the three nodes?
> How do you deal with the disconnects resulting from the shutting down of
> nodes? There doesn't seem to be any code to handle that.
> Mind you, I suspect that the failure should still be reproducible without
> any subscriptions and publishes. Would be good to try that and just watch
> the broker logs for errors.
>  As long as maillist does not allow zip attaches, I'm mailing to you guys
>> directly
> Thanks for posting these. There is an error in the logs that we haven't
> seen before:
>           {{badmatch,[]},
>            [{rabbit_mirror_queue_misc,'-**remove_from_queue/2-fun-0-',2}**
> ,
>             {mnesia_tm,apply_fun,3},
>             {mnesia_tm,execute_**transaction,5},
>             {rabbit_misc,'-execute_mnesia_**transaction/1-fun-0-',1},
>             {worker_pool_worker,handle_**call,3},
>             {gen_server2,handle_msg,2},
>             {proc_lib,wake_up,3}]}
> Looking at the code, this appears to indicate that there are no
> master/mirror processes left for the queue. Which is...unexpected. That
> should give us something to go on.
> Regards,
> Matthias.


More information about the rabbitmq-discuss mailing list