[rabbitmq-discuss] Strange start error issue
Samuel Chen
samuel.net at gmail.com
Fri Sep 21 03:46:58 BST 2012
Hi Simon,
liubida is my colleague. So we are facing the same problem. We know these
each other's mail at the next morning :)
Thank you very much for you kindly answer.
Samuel
On Fri, Sep 21, 2012 at 1:05 AM, Simon MacMullen <simon at rabbitmq.com> wrote:
> Hi.
>
> I think you must be from the same organisation as "liubida", the log file
> is identical, so see my answer there:
>
> http://lists.rabbitmq.com/**pipermail/rabbitmq-discuss/**
> 2012-September/022560.html<http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/2012-September/022560.html>
>
> Cheers, Simon
>
>
> On 19/09/12 17:10, Samuel Chen wrote:
>
>> Hi all
>>
>> I'm new to RabbitMQ. We are facing a strange issue against
>> rabbit_mgmt_db. I could not find any similar issues by searching on
>> google or stackoverflow. So I wonder if anyone could help to diagnose
>> this problem. Thanks in advance.
>>
>> We have a 2-node clustered RabbitMQ integrated with Celery. It worked
>> well for several months.
>> The issue occurred the first time on Jul 4th. After restarted, it worked
>> for about 2 months. Yesterday the issue occurred twice (one is after
>> restarted).
>> The stat was that a child (rabbit_mgmt_db??) was killed automatically.
>> By some failures of restarting automatically, it reached the max
>> restart intensity. Eventually it was shutdown.
>> (Anther situation is that we deployed to 2-node cluster from 1 node
>> server at the end of June. Note sure if it caused this issue.)
>>
>> The hosts are virtual servers with 8/12G ram and 30G disk.
>> One node is disc node and the other is ram.
>> The load balance was very low (around 100M ram, few tasks) . Disk has
>> 2.5G free space.
>> Log as below.
>>
>> Thanks for any help.
>>
>> SUPERVISOR REPORT==== 18-Sep-2012::14:53:13 ===
>>
>> Supervisor: {<0.28802.1777>,mirrored_**supervisor}
>>
>> Context: child_terminated
>>
>> Reason: killed
>>
>> Offender: [{pid,<0.28804.1777>},
>>
>> {name,rabbit_mgmt_db},
>>
>> {mfa,{rabbit_mgmt_db,start_**link,[]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> =SUPERVISOR REPORT==== 18-Sep-2012::14:53:14 ===
>>
>> Supervisor: {<0.28802.1777>,mirrored_**supervisor}
>>
>> Context: start_error
>>
>> Reason: {already_started,<19704.13906.**2335>}
>>
>> Offender: [{pid,<0.28804.1777>},
>>
>> {name,rabbit_mgmt_db},
>>
>> {mfa,{rabbit_mgmt_db,start_**link,[]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> =SUPERVISOR REPORT==== 18-Sep-2012::14:53:14 ===
>>
>> Supervisor: {<0.28802.1777>,mirrored_**supervisor}
>>
>> Context: start_error
>>
>> Reason: {already_started,<19704.13906.**2335>}
>>
>> Offender: [{pid,<0.28804.1777>},
>>
>> {name,rabbit_mgmt_db},
>>
>> {mfa,{rabbit_mgmt_db,start_**link,[]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> =SUPERVISOR REPORT==== 18-Sep-2012::14:53:14 ===
>>
>> Supervisor: {<0.28802.1777>,mirrored_**supervisor}
>>
>> Context: start_error
>>
>> Reason: {already_started,<19704.13906.**2335>}
>>
>> Offender: [{pid,<0.28804.1777>},
>>
>> {name,rabbit_mgmt_db},
>>
>> {mfa,{rabbit_mgmt_db,start_**link,[]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> =SUPERVISOR REPORT==== 18-Sep-2012::14:53:14 ===
>>
>> Supervisor: {<0.28802.1777>,mirrored_**supervisor}
>>
>> Context: start_error
>>
>> Reason: {already_started,<19704.13906.**2335>}
>>
>> Offender: [{pid,<0.28804.1777>},
>>
>> {name,rabbit_mgmt_db},
>>
>> {mfa,{rabbit_mgmt_db,start_**link,[]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> =SUPERVISOR REPORT==== 18-Sep-2012::14:53:14 ===
>>
>> Supervisor: {<0.28802.1777>,mirrored_**supervisor}
>>
>> Context: start_error
>>
>> Reason: {already_started,<19704.13906.**2335>}
>>
>> Offender: [{pid,<0.28804.1777>},
>>
>> {name,rabbit_mgmt_db},
>>
>> {mfa,{rabbit_mgmt_db,start_**link,[]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> =SUPERVISOR REPORT==== 18-Sep-2012::14:53:14 ===
>>
>> Supervisor: {local,rabbit_mgmt_sup}
>>
>> Context: shutdown
>>
>> Reason: reached_max_restart_intensity
>>
>> Offender: [{pid,<0.28803.1777>},
>>
>> {name,mirroring},
>>
>> {mfa,
>>
>> {mirrored_supervisor,start_**internal,
>>
>> [rabbit_mgmt_sup,
>>
>> [{rabbit_mgmt_db,
>>
>> {rabbit_mgmt_db,start_link,[]}**
>> ,
>>
>> permanent,4294967295,worker,
>>
>> [rabbit_mgmt_db]}]]}},
>>
>> {restart_type,permanent},
>>
>> {shutdown,4294967295},
>>
>> {child_type,worker}]
>>
>>
>>
>> - Sam
>>
>>
>>
>> ______________________________**_________________
>> rabbitmq-discuss mailing list
>> rabbitmq-discuss at lists.**rabbitmq.com<rabbitmq-discuss at lists.rabbitmq.com>
>> https://lists.rabbitmq.com/**cgi-bin/mailman/listinfo/**rabbitmq-discuss<https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss>
>>
>
>
> --
> Simon MacMullen
> RabbitMQ, VMware
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20120921/42d90a50/attachment.htm>
More information about the rabbitmq-discuss
mailing list