[rabbitmq-discuss] Connection blocked by "flow" for more than 600 seconds
Simon MacMullen
simon at rabbitmq.com
Fri Oct 11 12:29:57 BST 2013
OK, so your screenshot shows 750 queues and 753 connections. Was this
from the same time as you had ~10k file descriptors in use? That sounds
wrong.
I think your publishing connections are going into flow control because
there's a squeeze on file descriptors, which is causing the queues to
have to share a small number of file descriptors between them - thus
slowing them down.
If you do have far more file descriptors in use than queues +
connections, do you have any exotic plugins in use? What does "lsof -lnp
<pid of server process>" say?
Cheers, Simon
On 11/10/2013 3:22AM, Choo wrote:
> Hi Simon,
>
> As memory is plenty, I found that file descriptors hit the default limit,
> so, I bumped the limit up to 5,120, and finally to 10,240 on each nodes. It
> turned out that the file descriptors also touched the limit (around 10,086),
> and things started to go downhill.
>
> <http://rabbitmq.1065348.n5.nabble.com/file/n30402/ScreenShot.jpg>
>
> I started processes in reverse order, by starting subscriber-side first
> (1:42), then the bigger publishers later (1:45). The number of published
> messages bounced up&down, then just after 1:48, the most of the publishers
> were blocked.
>
> There are more than 350 of blocked connections like below now (and file
> descriptors are running at 7,558 + 4,647 on 2 nodes):
> 10.95.212.11:33751 -> 10.95.212.13:5672 blocked 1261.558817 flow
> 10.95.212.11:33752 -> 10.95.212.13:5672 blocked 1326.324919 flow
> 10.95.212.11:33753 -> 10.95.212.13:5672 blocked 1326.45322 flow
> 10.95.212.11:33754 -> 10.95.212.13:5672 blocked 1278.581221 flow
> 10.95.212.11:33755 -> 10.95.212.13:5672 blocked 1312.584426 flow
> 10.95.212.11:33756 -> 10.95.212.13:5672 blocked 1279.623625 flow
> 10.95.212.11:33757 -> 10.95.212.13:5672 blocked 1294.492535 flow
> 10.95.212.11:33758 -> 10.95.212.13:5672 blocked 1276.134377 flow
> 10.95.212.11:33759 -> 10.95.212.13:5672 blocked 1292.862081 flow
> 10.95.212.11:33760 -> 10.95.212.13:5672 blocked 1290.695249 flow
> 10.95.212.11:33761 -> 10.95.212.13:5672 blocked 1255.599642 flow
> 10.95.212.11:33762 -> 10.95.212.13:5672 blocked 1284.984752 flow
>
> Please kindly suggest.
>
> Thank you and Best Regards,
> Choo
>
>
>
> --
> View this message in context: http://rabbitmq.1065348.n5.nabble.com/Connection-blocked-by-flow-for-more-than-600-seconds-tp30349p30402.html
> Sent from the RabbitMQ mailing list archive at Nabble.com.
> _______________________________________________
> rabbitmq-discuss mailing list
> rabbitmq-discuss at lists.rabbitmq.com
> https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss
>
--
Simon MacMullen
RabbitMQ, Pivotal
More information about the rabbitmq-discuss
mailing list