[rabbitmq-discuss] RabbitMQ 3.2.0 keeps crashing

Lauri Fjällström lauri.fjallstrom at gmail.com
Tue Nov 19 05:31:01 GMT 2013


Hi,

I'm running two RabbitMQ nodes in a cluster on Ubuntu 12.04. I have a
virtual host there and a ha policy for a queue. Both nodes are disc nodes.
During the last 24 hours RabbitMQ crashed twice. Producers are able to
continue publishing messages but they don't end up routed or something.
rabbitmqctl list_queues does not list the queue. In the HTTP management
console the queue is listed but the page only shows:

TypeError: Cannot read property 'ram_msg_count' of undefined
TypeError: Cannot read property 'ram_msg_count' of undefined

There should not anything special going on on the servers. Also, they are
not under heavy load or something.

After the last crash the log file on both nodes were empty. Unfortunately I
didn't backup the logs after the first crashes. I was able to find the
following information in xxx-sasl.log:

=CRASH REPORT==== 19-Nov-2013::04:17:22 ===
  crasher:
    initial call: gen:init_it/6
    pid: <0.259.0>
    registered_name: []
    exception exit: {{badmatch,{error,not_found}},
                     [{rabbit_mirror_queue_master,stop_all_slaves,2},
                      {rabbit_mirror_queue_master,delete_and_terminate,2},

{rabbit_amqqueue_process,'-terminate_delete/3-fun-1-',6},
                      {rabbit_amqqueue_process,terminate_shutdown,2},
                      {gen_server2,terminate,3},
                      {proc_lib,wake_up,3}]}
      in function  gen_server2:terminate/3
    ancestors: [rabbit_amqqueue_sup,rabbit_sup,<0.192.0>]
    messages: []
    links: [<0.257.0>,<0.452.0>,#Port<0.13309>]
    dictionary: [{delegate,delegate_15},
                  {{credit_to,<0.788.0>},20},
                  {{#Ref<0.0.0.1271>,fhc_handle},
                   {handle,{file_descriptor,prim_file,{#Port<0.13309>,20}},
                           774680,false,0,infinity,[],true,
                           "/var/lib/rabbitmq/mnesia/rabbit at mq1-api-test
/queues/B1433I2VYA7YIS4WJY5I6NO4R/journal.jif",
                           [write,binary,raw,read],
                           [{write_buffer,infinity}],
                           true,true,
                           {1384,827437,174496}}},
                  {{credit_to,<0.563.0>},1},
                  {fhc_age_tree,{1,
                                 {{1384,827437,174496},
                                  #Ref<0.0.0.1271>,nil,nil}}},
                  {{"/var/lib/rabbitmq/mnesia/rabbit at mq1-api-test
/queues/B1433I2VYA7YIS4WJY5I6NO4R/journal.jif",
                    fhc_file},
                   {file,1,true}},
                  {credit_blocked,[]},
                  {{ch,<0.8244.0>},
                   {cr,<0.8244.0>,#Ref<0.0.10.78984>,
                       {[],[]},
                       1,
                       {queue,[],[],0},
                       {qstate,<0.8243.0>,active,{0,nil}},
                       25}},
                  {{ch,<0.8274.0>},
                   {cr,<0.8274.0>,#Ref<0.0.10.91831>,
                       {[],[]},
                       1,
                       {queue,[],[],0},
                       {qstate,<0.8273.0>,active,{0,nil}},
                       49}},
                  {{credit_to,<0.13668.0>},41},
                  {guid,{{1330803274,3383937130,2004047782,2434038047},1}},
                  {{credit_from,<0.251.0>},1826}]
    trap_exit: true
    status: running
    heap_size: 10946
    stack_size: 24
    reductions: 1514324983
  neighbours:
    neighbour: [{pid,<0.453.0>},
                  {registered_name,[]},
                  {initial_call,{gen,init_it,
                                     ['Argument__1','Argument__2',
                                      'Argument__3','Argument__4',
                                      'Argument__5','Argument__6']}},
                  {current_function,{gen_server2,process_next_msg,1}},
                  {ancestors,[<0.452.0>,<0.259.0>,rabbit_amqqueue_sup,
                              rabbit_sup,<0.192.0>]},
                  {messages,[]},
                  {links,[<0.452.0>]},
                  {dictionary,[{random_seed,{24781,9822,29460}}]},
                  {trap_exit,false},
                  {status,waiting},
                  {heap_size,610},
                  {stack_size,7},
                  {reductions,660407699}]
    neighbour: [{pid,<0.452.0>},
                  {registered_name,[]},
                  {initial_call,{gen,init_it,
                                     ['Argument__1','Argument__2',
                                      'Argument__3','Argument__4',
                                      'Argument__5','Argument__6']}},
                  {current_function,{gen_server2,process_next_msg,1}},
                  {ancestors,[<0.259.0>,rabbit_amqqueue_sup,rabbit_sup,
                              <0.192.0>]},
                  {messages,[]},
                  {links,[<0.259.0>,<0.453.0>]},
                  {dictionary,[]},
                  {trap_exit,false},
                  {status,waiting},
                  {heap_size,2584},
                  {stack_size,7},
                  {reductions,59821}]

=SUPERVISOR REPORT==== 19-Nov-2013::04:17:22 ===
     Supervisor: {local,rabbit_amqqueue_sup}
     Context:    child_terminated
     Reason:     {{badmatch,{error,not_found}},
                  [{rabbit_mirror_queue_master,stop_all_slaves,2},
                   {rabbit_mirror_queue_master,delete_and_terminate,2},
                   {rabbit_amqqueue_process,'-terminate_delete/3-fun-1-',6},
                   {rabbit_amqqueue_process,terminate_shutdown,2},
                   {gen_server2,terminate,3},
                   {proc_lib,wake_up,3}]}
     Offender:   [{pid,<0.259.0>},
                  {name,rabbit_amqqueue},
                  {mfargs,{rabbit_amqqueue_process,start_link,undefined}},
                  {restart_type,temporary},
                  {shutdown,4294967295},
                  {child_type,worker}]


=CRASH REPORT==== 19-Nov-2013::04:30:20 ===
  crasher:
    initial call: gen:init_it/6
    pid: <0.13842.0>
    registered_name: []
    exception exit: {{badmatch,true},
                     [{rabbit_queue_index,init,2},
                      {rabbit_variable_queue,init,5},
                      {rabbit_mirror_queue_master,init,3},
                      {rabbit_amqqueue_process,declare,3},
                      {gen_server2,handle_msg,2},
                      {proc_lib,init_p_do_apply,3}]}
      in function  gen_server2:terminate/3
    ancestors: [rabbit_amqqueue_sup,rabbit_sup,<0.192.0>]
    messages: []
    links: [<0.257.0>]
    dictionary: [{{xtype_to_module,direct},rabbit_exchange_type_direct}]
    trap_exit: true
    status: running
    heap_size: 1597
    stack_size: 24
    reductions: 1224
  neighbours:

=SUPERVISOR REPORT==== 19-Nov-2013::04:30:20 ===
     Supervisor: {local,rabbit_amqqueue_sup}
     Context:    child_terminated
     Reason:     {{badmatch,true},
                  [{rabbit_queue_index,init,2},
                   {rabbit_variable_queue,init,5},
                   {rabbit_mirror_queue_master,init,3},
                   {rabbit_amqqueue_process,declare,3},
                   {gen_server2,handle_msg,2},
                   {proc_lib,init_p_do_apply,3}]}
     Offender:   [{pid,<0.13842.0>},
                  {name,rabbit_amqqueue},
                  {mfargs,{rabbit_amqqueue_process,start_link,undefined}},
                  {restart_type,temporary},
                  {shutdown,4294967295},
                  {child_type,worker}]



I have no clue what's going on here. Can anyone give me a hint?

Best Regards,
Lauri
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20131119/f581acb0/attachment.htm>


More information about the rabbitmq-discuss mailing list