[rabbitmq-discuss] Many RabbitMQ Status Processes

Tim Watson tim at rabbitmq.com
Thu Mar 21 12:00:08 GMT 2013


Hi Eric,

On 19 Mar 2013, at 16:18, Eric Berg wrote:
> Hi! I have a 2 node cluster running for a prototype we are developing, it handles over 1000 queues and over 1000 msg/s when running at full speed. We had an issue recently with some run away processes that published messages from production app servers vi the php library:
[snip]
> The cause of that is still under investigation, but it left many thousands of 'stale' idle connections that rabbit was maintaining.

How did you identify that - using the management UI? What version of rabbit are you running and on which os/platform?

> Attempting to stop/start each node hangs indefinitely, the only way get a full restart of the server is to actually restart the hardware. I see 58 'status' processes attributed to rabbit:
> 
> rabbitmq 16066 16033  0 Mar18 ?        00:00:00 inet_gethost 4
> rabbitmq 16067 16066  0 Mar18 ?        00:00:00 inet_gethost 4
> root     17713 20340  0 05:27 ?        00:00:00 /bin/sh /etc/init.d/rabbitmq-server status
> root     17716 17713  0 05:27 ?        00:00:00 /bin/sh /usr/sbin/rabbitmqctl status
> root     17725 17716  0 05:27 ?        00:00:00 su rabbitmq -s /bin/sh -c /usr/lib/rabbitmq/bin/rabbitmqctl  "status"
> rabbitmq 17726 17725  0 05:27 ?        00:00:00 /usr/lib64/erlang/erts-5.8.5/bin/beam.smp -- -root /usr/lib64/erlang -progname erl -- -home /var/lib/rabbitmq -- -pa /usr/lib/rabbitmq/lib/rabbitmq_server-3.0.1/sbin/../ebin -noshell -noinput -hidden -sname rabbitmqctl17726 -s rabbit_control_main -nodename rabbit at rabbit -extra status
> 

Where's the 58 coming from?

> 
> When stopping the node I see this in the log file:
> =INFO REPORT==== 19-Mar-2013::11:18:32 ===
> Stopping RabbitMQ
> 
> =INFO REPORT==== 19-Mar-2013::11:18:32 ===
>     application: rabbitmq_management_visualiser
>     exited: stopped
>     type: temporary
> 
> =INFO REPORT==== 19-Mar-2013::11:18:32 ===
>     application: rabbitmq_federation_management
>     exited: stopped
>     type: temporary
> 

Those look perfectly normal. I thought you said you couldn't stop the node(s) at all?

> 
> Has anyone see issues with many thousands of stale connections, in combination with many status processes and hanging stop/start? Currently there are are no producers or consumers connected to this box.

The 'status' processes are only ever launched by the init daemon or by a user issuing `rabbitmq-server status` - are you sure you don't have some monitoring infrastructure that is triggering these commands?

Cheers,
Tim

> 
> Thanks!
> _______________________________________________
> rabbitmq-discuss mailing list
> rabbitmq-discuss at lists.rabbitmq.com
> https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss



More information about the rabbitmq-discuss mailing list