[rabbitmq-discuss] Outage, brokers refuse to start back up
James Carr
james.r.carr at gmail.com
Fri Dec 9 15:54:55 GMT 2011
Yes. I shut both down and whIen trying to restart either the master
node or the other node I got the same error.
On Fri, Dec 9, 2011 at 9:49 AM, Simon MacMullen <simon at rabbitmq.com> wrote:
> When you start a cluster up again after it's entirely shut down, any node
> other than the last one to be shut down will wait at startup for the last
> one to reappear (since the last node to shut down is authoritative). If the
> last node does not reappear in 30s then the error message you quote is
> shown. Did you get the same error on both nodes?
>
> Cheers, Simon
>
>
> On 09/12/11 15:03, James Carr wrote:
>>
>> Okay, so I backed up our mnesia dir and wiped it clean on both boxes.
>> Brokers started fine. I then copied the DCD and DCL files back over
>> and my users, exchanges, and queues were back.
>>
>> Is there a recommended way to protect against such an outage?
>>
>> Thanks,
>> James
>>
>>
>> On Fri, Dec 9, 2011 at 8:41 AM, James Carr<james.r.carr at gmail.com> wrote:
>>>
>>> So our datacenter had a power failure yesterday and the brokers now
>>> REFUSE to start back up. I can send the erl_crash.dump as needed (it
>>> is quite large) but here are the other logs.
>>>
>>> Any ideas?
>>>
>>> Thanks,
>>> James
>>>
>>>
>>> Rabbitmq log:
>>>
>>> =INFO REPORT==== 9-Dec-2011::08:35:32 ===
>>> Limiting to approx 924 file handles (829 sockets)
>>>
>>> =ERROR REPORT==== 9-Dec-2011::08:36:02 ===
>>> FAILED
>>> Reason: {error,
>>> {timeout_waiting_for_tables,
>>> [rabbit_user,rabbit_user_permission,rabbit_vhost,
>>> rabbit_durable_route,rabbit_durable_exchange,
>>> rabbit_durable_queue]}}
>>> Stacktrace: [{rabbit_mnesia,wait_for_tables,1},
>>> {rabbit_mnesia,check_schema_integrity,0},
>>> {rabbit_mnesia,ensure_schema_integrity,0},
>>> {rabbit_mnesia,init,0},
>>> {rabbit,'-run_boot_step/1-lc$^1/1-1-',1},
>>> {rabbit,run_boot_step,1},
>>> {rabbit,'-start/2-lc$^0/1-0-',1},
>>> {rabbit,start,2}]
>>>
>>> =INFO REPORT==== 9-Dec-2011::08:36:03 ===
>>> application: rabbit
>>> exited: {bad_return,{{rabbit,start,[normal,[]]},
>>> {'EXIT',{rabbit,failure_during_boot}}}}
>>> type: permanent
>>>
>>>
>>> startup_log:
>>>
>>> Activating RabbitMQ plugins ...
>>> 11 plugins activated:
>>> * amqp_client-2.6.1
>>> * erlando-2.6.1
>>> * mochiweb-1.3-rmq2.6.1-git9a53dbd
>>> * rabbitmq_federation-2.6.1
>>> * rabbitmq_management-2.6.1
>>> * rabbitmq_management_agent-2.6.1
>>> * rabbitmq_management_visualiser-2.6.1
>>> * rabbitmq_mochiweb-2.6.1
>>> * rabbitmq_shovel-2.6.1
>>> * rabbitmq_shovel_management-2.6.1
>>> * webmachine-1.7.0-rmq2.6.1-hg0c4b60a
>>>
>>>
>>> +---+ +---+
>>> | | | |
>>> | | | |
>>> | | | |
>>> | +---+ +-------+
>>> | |
>>> | RabbitMQ +---+ |
>>> | | | |
>>> | v2.6.1 +---+ |
>>> | |
>>> +-------------------+
>>> AMQP 0-9-1 / 0-9 / 0-8
>>> Copyright (C) 2007-2011 VMware, Inc.
>>> Licensed under the MPL. See http://www.rabbitmq.com/
>>>
>>> node : rabbit at brokerm02p
>>> app descriptor :
>>> /usr/lib/rabbitmq/lib/rabbitmq_server-2.6.1/sbin/../ebin/rabbit.app
>>> home dir : /var/lib/rabbitmq
>>> config file(s) : /etc/rabbitmq/rabbitmq.config
>>> cookie hash : Mg6GXWPn9Lrj9HC/D14CWA==
>>> log : /var/log/rabbitmq/rabbit at brokerm02p.log
>>> sasl log : /var/log/rabbitmq/rabbit at brokerm02p-sasl.log
>>> database dir : /var/lib/rabbitmq/mnesia/rabbit at brokerm02p
>>> erlang version : 5.8.4
>>>
>>> -- rabbit boot start
>>> starting file handle cache server
>>> ...done
>>> starting worker pool
>>> ...done
>>> starting database
>>> ...BOOT ERROR: FAILED
>>> Reason: {error,
>>> {timeout_waiting_for_tables,
>>> [rabbit_user,rabbit_user_permission,rabbit_vhost,
>>> rabbit_durable_route,rabbit_durable_exchange,
>>> rabbit_durable_queue]}}
>>> Stacktrace: [{rabbit_mnesia,wait_for_tables,1},
>>> {rabbit_mnesia,check_schema_integrity,0},
>>> {rabbit_mnesia,ensure_schema_integrity,0},
>>> {rabbit_mnesia,init,0},
>>> {rabbit,'-run_boot_step/1-lc$^1/1-1-',1},
>>> {rabbit,run_boot_step,1},
>>> {rabbit,'-start/2-lc$^0/1-0-',1},
>>> {rabbit,start,2}]
>>> {"Kernel pid
>>> terminated",application_controller,"{application_start_failure,rabbit,{bad_return,{{rabbit,start,[normal,[]]},{'EXIT',{rabbit,failure_during_boot}}}}}"}
>>>
>>> startup_err:
>>>
>>> Erlang has closed
>>>
>>> Crash dump was written to: erl_crash.dump
>>> Kernel pid terminated (application_controller)
>>>
>>> ({application_start_failure,rabbit,{bad_return,{{rabbit,start,[normal,[]]},{'EXIT',{rabbit,failure_during_boot}}}}})
>>
>> _______________________________________________
>> rabbitmq-discuss mailing list
>> rabbitmq-discuss at lists.rabbitmq.com
>> https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss
>
>
>
> --
> Simon MacMullen
> RabbitMQ, VMware
> _______________________________________________
> rabbitmq-discuss mailing list
> rabbitmq-discuss at lists.rabbitmq.com
> https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss
More information about the rabbitmq-discuss
mailing list