[rabbitmq-discuss] Frequent crashes on one cluster node but not the other

Matthew Ward mward at globalpersonals.co.uk
Fri Jul 20 10:06:08 BST 2012


I've just brought a two-node RabbitMQ cluster online after updating it to RabbitMQ 2.8.2 and we're seeing an issue with the first node that was brought up where it seems to be constantly crashing (but not actually bringing RabbitMQ down, as it continues to function throughout) with the following repeating set of errors - just wondering if anyone can give a heads-up as to what the error message for the original crash indicates, and whether we can do anything to mitigate the child termination/shutdown?

=CRASH REPORT==== 20-Jul-2012::09:58:15 ===
  crasher:
    initial call: gen:init_it/6
    pid: <0.29185.2>
    registered_name: []
    exception exit: {function_clause,
                        [{rabbit_binary_parser,parse_field_value,
                             [<<"\tlongitude">>],
                             []},
                         {rabbit_binary_parser,parse_table,1,[]},
                         {rabbit_binary_parser,parse_table,1,[]},
                         {rabbit_framing_amqp_0_9_1,decode_properties,2,[]},
                         {rabbit_binary_parser,ensure_content_decoded,1,[]},
                         {rabbit_channel,handle_method,3,[]},
                         {rabbit_channel,handle_cast,2,[]},
                         {gen_server2,handle_msg,2,[]}]}
      in function  gen_server2:terminate/3 
    ancestors: [<0.29182.2>,<0.29180.2>,<0.29178.2>,rabbit_tcp_client_sup,
                  rabbit_sup,<0.201.0>]
    messages: []
    links: [<0.29182.2>]
    dictionary: [{{credit_to,<0.29181.2>},49},
                  {{xtype_to_module,topic},rabbit_exchange_type_topic},
                  {permission_cache,[{{resource,<<"/">>,exchange,<<"event">>},
                                      write},
                                     {{resource,<<"/">>,exchange,<<"event">>},
                                      configure}]}]
    trap_exit: true
    status: running
    heap_size: 1597
    stack_size: 24
    reductions: 1189
  neighbours:

=SUPERVISOR REPORT==== 20-Jul-2012::09:58:15 ===
     Supervisor: {<0.29182.2>,rabbit_channel_sup}
     Context:    child_terminated
     Reason:     {function_clause,
                     [{rabbit_binary_parser,parse_field_value,
                          [<<"\tlongitude">>],
                          []},
                      {rabbit_binary_parser,parse_table,1,[]},
                      {rabbit_binary_parser,parse_table,1,[]},
                      {rabbit_framing_amqp_0_9_1,decode_properties,2,[]},
                      {rabbit_binary_parser,ensure_content_decoded,1,[]},
                      {rabbit_channel,handle_method,3,[]},
                      {rabbit_channel,handle_cast,2,[]},
                      {gen_server2,handle_msg,2,[]}]}
     Offender:   [{pid,<0.29185.2>},
                  {name,channel},
                  {mfa,
                      {rabbit_channel,start_link,
                          [1,<0.29181.2>,<0.29183.2>,<0.29181.2>,
                           "10.177.131.144:34764 -> 192.168.1.56:5672",
                           rabbit_framing_amqp_0_9_1,
                           {user,<<"wld">>,[],rabbit_auth_backend_internal,
                               {internal_user,<<"wld">>,
                                   <<202,1,90,179,53,221,214,208,253,59,77,117,
                                     206,159,235,145,183,209,199,210>>,
                                   []}},
                           <<"/">>,[],<0.29179.2>,
                           {token,<0.29184.2>,false}]}},
                  {restart_type,intrinsic},
                  {shutdown,4294967295},
                  {child_type,worker}]


=SUPERVISOR REPORT==== 20-Jul-2012::09:58:15 ===
     Supervisor: {<0.29182.2>,rabbit_channel_sup}
     Context:    shutdown
     Reason:     reached_max_restart_intensity
     Offender:   [{pid,<0.29185.2>},
                  {name,channel},
                  {mfa,
                      {rabbit_channel,start_link,
                          [1,<0.29181.2>,<0.29183.2>,<0.29181.2>,
                           "10.177.131.144:34764 -> 192.168.1.56:5672",
                           rabbit_framing_amqp_0_9_1,
                           {user,<<"wld">>,[],rabbit_auth_backend_internal,
                               {internal_user,<<"wld">>,
                                   <<202,1,90,179,53,221,214,208,253,59,77,117,
                                     206,159,235,145,183,209,199,210>>,
                                   []}},
                           <<"/">>,[],<0.29179.2>,
                           {token,<0.29184.2>,false}]}},
                  {restart_type,intrinsic},
                  {shutdown,4294967295},
                  {child_type,worker}]



---
Matthew Ward
Operations Engineer
Global Personals Limited

Email: mward at globalpersonals.co.uk
Phone: +44 1753 757552 (x406)


-- 


Global Personals is a limited company registered in England and Wales.

Registered number: 04880697

Registered office: Minton Place, Victoria Street, Windsor, Berkshire, SL4 
1EG, United Kingdom
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20120720/e0f1a68c/attachment.htm>


More information about the rabbitmq-discuss mailing list