[rabbitmq-discuss] rabbitmq-discuss Digest, Vol 18, Issue 34
Ilya Grigorik
ilya at aiderss.com
Mon Nov 17 04:25:07 GMT 2008
Man, I feel silly.. I don't know how I haven't caught this before, but the
box I'm running the tests on has no swap partition - completely disabled, I
must have booted the wrong image!
Long story short:
/usr/sbin/rabbitmq-server: line 76: 28477 Killed erl -pa
"`dirname $0`/../ebin" ${START_RABBIT} -sname ${NODENAME} -boot start_sasl
+W w ${ERL_ARGS} -rabbit tcp_listeners '[{"'${NODE_IP_ADDRESS}'",
'${NODE_PORT}'}]' -sasl errlog_type error -kernel error_logger
'{file,"'${LOGS}'"}' -sasl sasl_error_logger '{file,"'${SASL_LOGS}'"}'
-os_mon start_cpu_sup true -os_mon start_disksup false -os_mon start_memsup
true -os_mon start_os_sup false -os_mon memsup_system_only true -os_mon
system_memory_high_watermark 0.90 -mnesia dir "\"${MNESIA_DIR}\""
${CLUSTER_CONFIG} ${RABBIT_ARGS} "$@"
---
Nov 16 04:11:45 ip-10-251-102-223 kernel: Out of Memory: Kill process 28470
(rabbitmq-server) score 1084241 and children.
Nov 16 04:11:45 ip-10-251-102-223 kernel: Out of memory: Killed process
28477 (beam.smp).
Nov 16 04:11:45 ip-10-251-102-223 kernel: oom-killer: gfp_mask=0x201d2,
order=0
---
-rw-r--r-- 1 root root 8 Nov 16 04:11 rabbit_persister.LOG
-rw-r--r-- 1 root root 368M Nov 16 04:11 rabbit_persister.LOG.previous
---
Needless to say, nothing to recover in the first log file (the process must
have been killed while dumping from memory). Renaming the .previous file
brought rabbit back online in ~30 seconds. What's interesting is that the
persister size is really small, not sure how the process could have run out
of memory? All messages sent to it very marked as persistent. (it was
running overnight)
I'll do some testing with SWAP tomorrow.
ig
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20081116/0d19718a/attachment.htm
More information about the rabbitmq-discuss
mailing list