[rabbitmq-discuss] rabbitmq connections blocking but memory is blow watermark

xzhang84 at gmail.com xzhang84 at gmail.com
Fri Mar 1 14:39:55 GMT 2013


we are use rabbitmq in our application,  two hours ago, one of our app server is blocked when try to connect to rabbitmq, after check rabbitmq server ,  we found one node's memory is over watermark, a few minutes later, this node is down. after restart this node, the whole cluster sames work fine, but i notice there's a lot of connection in blocking and blocked state from web management,but use  `rabbitmqctl list_connections pid name peer_address state` in all nodes shows there is no connection in blocking/blocked…so this really make me confuse:  
1.  after  one node of whole cluster over watermark, but other node is work fine, my application can't connect to rabbitmq cluster?  ps:  we use spring.amqp & spring-rabbit with version 1.1.0.RELEASE
2.  node will down for what reason when over watermark?
3.  why after restart node, there is still blocking connection, but with rabbitmqctl they all in running state?

here is some logs from my rabbitmq server:

=INFO REPORT==== 1-Mar-2013::19:36:21 === vm_memory_high_watermark clear. Memory used:1656590680 allowed:1658778419 =INFO REPORT==== 1-Mar-2013::19:36:21 === alarm_handler: {clear,{resource_limit,memory,rabbit at cos22}}

when i try to close blocked connection from web management, it goes error:

=INFO REPORT==== 1-Mar-2013::20:55:24 === Closing connection <0.17197.115> because "Closed via management plugin" =ERROR REPORT==== 1-Mar-2013::20:55:24 === webmachine error: path="/api/connections/10.64.13.200%3A45891%20-%3E%2010.64.12.226%3A5672" {throw, {error,{not_a_connection_pid,<0.17197.115>}}, [{rabbit_networking,close_connection,2, [{file,"src/rabbit_networking.erl"},{line,317}]}, {rabbit_mgmt_wm_connection,delete_resource,2, [{file,"rabbitmq-management/src/rabbit_mgmt_wm_connection.erl"}, {line,52}]}, {webmachine_resource,resource_call,3, [{file, "webmachine-wrapper/webmachine-git/src/webmachine_resource.erl"}, {line,169}]}, {webmachine_resource,do,3, [{file, "webmachine-wrapper/webmachine-git/src/webmachine_resource.erl"}, {line,128}]}, {webmachine_decision_core,resource_call,1, [{file, "webmachine-wrapper/webmachine-git/src/webmachine_decision_core.erl"}, {line,48}]}, {webmachine_decision_core,decision,1, [{file, "webmachine-wrapper/webmachine-git/src/webmachine_decision_core.erl"}, {line,416}]}, {webmachine_decision_core,handle_request,2, [{file, "webmachine-wrapper/webmachine-git/src/webmachine_decision_core.erl"}, {line,33}]}, {rabbit_webmachine,'-makeloop/1-fun-0-',3, [{file,"rabbitmq-mochiweb/src/rabbit_webmachine.erl"},{line,75}]}]}


use rabbitmqctl shows all in running state:

rabbitmqctl list_connections pid name peer_address state
Listing connections ...
<rabbit at cos23.1.1271.51>        10.64.13.197:57321 -> 10.64.12.225:5672 10.64.13.197    running
<rabbit at cos23.1.1100.51>        10.64.13.196:57240 -> 10.64.12.225:5672 10.64.13.196    running
<rabbit at cos23.1.1056.51>        10.64.12.196:58608 -> 10.64.12.225:5672 10.64.12.196    running
<rabbit at cos23.1.1079.51>        10.64.11.235:48962 -> 10.64.12.225:5672 10.64.11.235    running
<rabbit at cos23.1.1419.51>        10.64.13.228:49857 -> 10.64.12.225:5672 10.64.13.228    running
<rabbit at cos23.1.1049.51>        10.64.11.193:36387 -> 10.64.12.225:5672 10.64.11.193    running
<rabbit at cos23.1.1159.51>        10.64.10.123:52017 -> 10.64.12.225:5672 10.64.10.123    running
<rabbit at cos23.1.26289.45>       10.64.12.247:38504 -> 10.64.12.225:5672 10.64.12.247    running
<rabbit at cos23.1.1121.51>        10.64.10.29:51483 -> 10.64.12.225:5672  10.64.10.29     running
<rabbit at cos23.1.1067.51>        10.64.11.234:50244 -> 10.64.12.225:5672 10.64.11.234    running
<rabbit at cos23.1.1149.51>        10.64.11.178:33795 -> 10.64.12.225:5672 10.64.11.178    running
<rabbit at cos23.1.1136.51>        10.64.10.28:39557 -> 10.64.12.225:5672  10.64.10.28     running
<rabbit at cos23.1.1370.51>        10.64.13.233:38766 -> 10.64.12.225:5672 10.64.13.233    running
<rabbit at cos23.1.1388.51>        10.64.13.229:50932 -> 10.64.12.225:5672 10.64.13.229    running
<rabbit at cos23.1.1254.51>        10.64.13.241:49311 -> 10.64.12.225:5672 10.64.13.241    running
<rabbit at cos23.1.1031.51>        10.64.11.195:39455 -> 10.64.12.225:5672 10.64.11.195    running
<rabbit at cos23.1.1038.51>        10.64.10.27:58938 -> 10.64.12.225:5672  10.64.10.27     running
<rabbit at cos23.1.1167.51>        10.64.13.240:37777 -> 10.64.12.225:5672 10.64.13.240    running
<rabbit at cos23.1.1442.51>        10.64.10.130:37251 -> 10.64.12.225:5672 10.64.10.130    running
<rabbit at cos22.3.2659.0> 10.64.13.200:54840 -> 10.64.12.226:5672 10.64.13.200    running
...done.



and there is a connection with a lot of channel is in blocked state, but i can't find this connection by use rabbitctl list_connections:

AMQP 0-9-1
10.64.13.200:45891 -> 10.64.12.226:5672 (http://10.64.12.226:55672/#/connections/10.64.13.200%3A45891%20-%3E%2010.64.12.226%3A5672)
rabbit at cos22
0B/s(49.2MB total)
0B/s(2.4MB total)
0s
60920





thanks a lot for any help and suggestion.

--  
xzhang84 at gmail.com
已使用 Sparrow (http://www.sparrowmailapp.com/?sig)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20130301/da22c7ab/attachment.htm>


More information about the rabbitmq-discuss mailing list