[rabbitmq-discuss] RabbitMQ 3.2.0 keeps crashing
Lauri Fjällström
lauri.fjallstrom at gmail.com
Tue Nov 19 05:31:01 GMT 2013
Hi,
I'm running two RabbitMQ nodes in a cluster on Ubuntu 12.04. I have a
virtual host there and a ha policy for a queue. Both nodes are disc nodes.
During the last 24 hours RabbitMQ crashed twice. Producers are able to
continue publishing messages but they don't end up routed or something.
rabbitmqctl list_queues does not list the queue. In the HTTP management
console the queue is listed but the page only shows:
TypeError: Cannot read property 'ram_msg_count' of undefined
TypeError: Cannot read property 'ram_msg_count' of undefined
There should not anything special going on on the servers. Also, they are
not under heavy load or something.
After the last crash the log file on both nodes were empty. Unfortunately I
didn't backup the logs after the first crashes. I was able to find the
following information in xxx-sasl.log:
=CRASH REPORT==== 19-Nov-2013::04:17:22 ===
crasher:
initial call: gen:init_it/6
pid: <0.259.0>
registered_name: []
exception exit: {{badmatch,{error,not_found}},
[{rabbit_mirror_queue_master,stop_all_slaves,2},
{rabbit_mirror_queue_master,delete_and_terminate,2},
{rabbit_amqqueue_process,'-terminate_delete/3-fun-1-',6},
{rabbit_amqqueue_process,terminate_shutdown,2},
{gen_server2,terminate,3},
{proc_lib,wake_up,3}]}
in function gen_server2:terminate/3
ancestors: [rabbit_amqqueue_sup,rabbit_sup,<0.192.0>]
messages: []
links: [<0.257.0>,<0.452.0>,#Port<0.13309>]
dictionary: [{delegate,delegate_15},
{{credit_to,<0.788.0>},20},
{{#Ref<0.0.0.1271>,fhc_handle},
{handle,{file_descriptor,prim_file,{#Port<0.13309>,20}},
774680,false,0,infinity,[],true,
"/var/lib/rabbitmq/mnesia/rabbit at mq1-api-test
/queues/B1433I2VYA7YIS4WJY5I6NO4R/journal.jif",
[write,binary,raw,read],
[{write_buffer,infinity}],
true,true,
{1384,827437,174496}}},
{{credit_to,<0.563.0>},1},
{fhc_age_tree,{1,
{{1384,827437,174496},
#Ref<0.0.0.1271>,nil,nil}}},
{{"/var/lib/rabbitmq/mnesia/rabbit at mq1-api-test
/queues/B1433I2VYA7YIS4WJY5I6NO4R/journal.jif",
fhc_file},
{file,1,true}},
{credit_blocked,[]},
{{ch,<0.8244.0>},
{cr,<0.8244.0>,#Ref<0.0.10.78984>,
{[],[]},
1,
{queue,[],[],0},
{qstate,<0.8243.0>,active,{0,nil}},
25}},
{{ch,<0.8274.0>},
{cr,<0.8274.0>,#Ref<0.0.10.91831>,
{[],[]},
1,
{queue,[],[],0},
{qstate,<0.8273.0>,active,{0,nil}},
49}},
{{credit_to,<0.13668.0>},41},
{guid,{{1330803274,3383937130,2004047782,2434038047},1}},
{{credit_from,<0.251.0>},1826}]
trap_exit: true
status: running
heap_size: 10946
stack_size: 24
reductions: 1514324983
neighbours:
neighbour: [{pid,<0.453.0>},
{registered_name,[]},
{initial_call,{gen,init_it,
['Argument__1','Argument__2',
'Argument__3','Argument__4',
'Argument__5','Argument__6']}},
{current_function,{gen_server2,process_next_msg,1}},
{ancestors,[<0.452.0>,<0.259.0>,rabbit_amqqueue_sup,
rabbit_sup,<0.192.0>]},
{messages,[]},
{links,[<0.452.0>]},
{dictionary,[{random_seed,{24781,9822,29460}}]},
{trap_exit,false},
{status,waiting},
{heap_size,610},
{stack_size,7},
{reductions,660407699}]
neighbour: [{pid,<0.452.0>},
{registered_name,[]},
{initial_call,{gen,init_it,
['Argument__1','Argument__2',
'Argument__3','Argument__4',
'Argument__5','Argument__6']}},
{current_function,{gen_server2,process_next_msg,1}},
{ancestors,[<0.259.0>,rabbit_amqqueue_sup,rabbit_sup,
<0.192.0>]},
{messages,[]},
{links,[<0.259.0>,<0.453.0>]},
{dictionary,[]},
{trap_exit,false},
{status,waiting},
{heap_size,2584},
{stack_size,7},
{reductions,59821}]
=SUPERVISOR REPORT==== 19-Nov-2013::04:17:22 ===
Supervisor: {local,rabbit_amqqueue_sup}
Context: child_terminated
Reason: {{badmatch,{error,not_found}},
[{rabbit_mirror_queue_master,stop_all_slaves,2},
{rabbit_mirror_queue_master,delete_and_terminate,2},
{rabbit_amqqueue_process,'-terminate_delete/3-fun-1-',6},
{rabbit_amqqueue_process,terminate_shutdown,2},
{gen_server2,terminate,3},
{proc_lib,wake_up,3}]}
Offender: [{pid,<0.259.0>},
{name,rabbit_amqqueue},
{mfargs,{rabbit_amqqueue_process,start_link,undefined}},
{restart_type,temporary},
{shutdown,4294967295},
{child_type,worker}]
=CRASH REPORT==== 19-Nov-2013::04:30:20 ===
crasher:
initial call: gen:init_it/6
pid: <0.13842.0>
registered_name: []
exception exit: {{badmatch,true},
[{rabbit_queue_index,init,2},
{rabbit_variable_queue,init,5},
{rabbit_mirror_queue_master,init,3},
{rabbit_amqqueue_process,declare,3},
{gen_server2,handle_msg,2},
{proc_lib,init_p_do_apply,3}]}
in function gen_server2:terminate/3
ancestors: [rabbit_amqqueue_sup,rabbit_sup,<0.192.0>]
messages: []
links: [<0.257.0>]
dictionary: [{{xtype_to_module,direct},rabbit_exchange_type_direct}]
trap_exit: true
status: running
heap_size: 1597
stack_size: 24
reductions: 1224
neighbours:
=SUPERVISOR REPORT==== 19-Nov-2013::04:30:20 ===
Supervisor: {local,rabbit_amqqueue_sup}
Context: child_terminated
Reason: {{badmatch,true},
[{rabbit_queue_index,init,2},
{rabbit_variable_queue,init,5},
{rabbit_mirror_queue_master,init,3},
{rabbit_amqqueue_process,declare,3},
{gen_server2,handle_msg,2},
{proc_lib,init_p_do_apply,3}]}
Offender: [{pid,<0.13842.0>},
{name,rabbit_amqqueue},
{mfargs,{rabbit_amqqueue_process,start_link,undefined}},
{restart_type,temporary},
{shutdown,4294967295},
{child_type,worker}]
I have no clue what's going on here. Can anyone give me a hint?
Best Regards,
Lauri
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20131119/f581acb0/attachment.htm>
More information about the rabbitmq-discuss
mailing list