<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">There ought to be further information in the log files from the brokers in question during the stop operation. Can you post that, or put it somewhere accessible please? Why do you have both `rabbitmq-server stop' and `rabbitmqctl stop' running at the same time? Are those pointing to different rabbits? <div><div><br><div><div>On 10 Dec 2013, at 01:32, Tyrrill, Ed wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size: 14px; "><div style="font-family: Calibri, sans-serif; ">Hi All,</div><div style="font-family: Calibri, sans-serif; "><br></div><div style="font-family: Calibri, sans-serif; ">We are using rabbitmq-server rpms on linux. Recently we upgraded from 3.1.1-1 to 3.2.0-1, and we are seeing intermittent hangs when stopping rabbitmq. Here is the ps output:</div><div style="font-family: Calibri, sans-serif; "><br></div><div><div><div><font class="Apple-style-span" face="Courier">root 31052 31051 0 Dec06 ? 00:00:00 /bin/sh /sbin/service rabbitmq-server stop</font></div><div><font class="Apple-style-span" face="Courier">root 31055 31052 0 Dec06 ? 00:00:00 /bin/sh /etc/init.d/rabbitmq-server stop</font></div><div><font class="Apple-style-span" face="Courier">root 31100 31055 0 Dec06 ? 00:00:00 /bin/sh /usr/sbin/rabbitmqctl stop /var/run/rabbitmq/pid</font></div><div><font class="Apple-style-span" face="Courier">root 31111 31100 0 Dec06 ? 00:00:00 su rabbitmq -s /bin/sh -c /usr/lib/rabbitmq/bin/rabbitmqctl "stop" "/var/run/rabbitmq/pid"</font></div><div><font class="Apple-style-span" face="Courier">rabbitmq 31112 31111 0 Dec06 ? 00:24:10 /usr/lib64/erlang/erts-5.10.3/bin/beam.smp -- -root /usr/lib64/erlang -progname erl -- -home /var/lib/rabbitmq -- -pa /usr</font><span class="Apple-style-span" style="font-family: Courier; ">/lib/rabbitmq/lib/rabbitmq_server-3.2.0/sbin/../ebin -noshell -noinput -hidden -sname rabbitmqctl31112 -boot start_clean -s rabbit_control_main -nodename rabbit@vm-ave29 </span><span class="Apple-style-span" style="font-family: Courier; ">-extra stop /var/run/rabbitmq/pid</span></div></div></div><div style="font-family: Calibri, sans-serif; "><br></div><div style="font-family: Calibri, sans-serif; ">The CPU time column on the erlang process does slowly go up. I don't know if it plays a factor, but this broker has shovels defined to a remote broker, and the remote broker was down at the time of this stop.</div><div style="font-family: Calibri, sans-serif; "><br></div></div></blockquote><div><br></div><div>How long do these hangs take? The shovel workers will wait 10 seconds for both their inbound and outbound connections to close cleanly. If you examine the log files for both the source and destination (i.e., remote) brokers during the shutdown, there may be some useful indication of whether this is the cause of the problem or not.</div><br><blockquote type="cite"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size: 14px; "><div style="font-family: Calibri, sans-serif; ">Is this a known issue? We've been seeing this a couple times a week (over > 100 brokers), and I need to get a fix for this.</div><div style="font-family: Calibri, sans-serif; "><br></div></div></blockquote><div><br></div><div>We have fixed bugs with shutdown delays and deadlocks in the past, but they're mostly dusted and released now. We do have an open issue that can cause long delays during broker shutdown, which is mediated by having a lot of durable queues (regardless of whether they contain messages or not). Could that be what you're seeing? How many durable queues do these brokers have running on them?</div><div><br></div><div>Tim</div></div></div></div></body></html>