[rabbitmq-discuss] Failing on restart and recovering queues

Pierre Payet amnema at gmail.com
Mon Nov 28 19:20:29 GMT 2011


Hello,

I am using a persistent queue to store different URL. After a crash of the
Rabbitmq server, I tried to restart it but it get stuck on the step of
recovering the different exchanges, queues and bindings. I have checked in
/msg_store_persistent directory of the node and it seems the data are still
here.

So here little information about my configuration:
I am using Ubuntu 10.04 64-bit and Rabbitmq 2.7.0

Here is a copy of the startup_log :
-- rabbit boot start
starting file handle cache server
...done
starting worker pool
...done
starting database
...done
starting codec correctness check
...done
-- external infrastructure ready
starting plugin registry
...done
starting auth mechanism cr-demo
...done
starting auth mechanism amqplain
...done
starting auth mechanism plain
...done
starting statistics event manager
...done
starting logging server
...done
starting exchange type direct
...done
starting exchange type fanout
...done
starting exchange type headers
...done
starting exchange type topic
...done
-- kernel ready
starting alarm handler
...done
starting node monitor
...done
starting cluster delegate
...done
starting guid generator
...done
starting memory monitor
...done
-- core initialized
starting empty DB check
...done
starting management agent
...done
starting exchange, queue and binding recovery                         ...


And here is a copy of the crash report on start:
=CRASH REPORT==== 28-Nov-2011::09:09:36 ===
  crasher:
    initial call: gen:init_it/6
    pid: <0.155.0>
    registered_name: []
    exception exit: {{badmatch,eof},
                     [{rabbit_queue_index,load_segment,2},
                      {rabbit_queue_index,segment_entries_foldr,3},
                      {rabbit_queue_index,
                          '-queue_index_walker_reader/2-lc$^0/1-0-',4},
                      {rabbit_queue_index,queue_index_walker_reader,2},
                      {worker_pool_worker,handle_cast,2},
                      {gen_server2,handle_msg,2},
                      {proc_lib,init_p_do_apply,3}]}
      in function  gen_server2:terminate/3
    ancestors: [worker_pool_sup,rabbit_sup,<0.136.0>]
    messages: []
    links: [#Port<0.4788>,<0.151.0>,#Port<0.4724>]
    dictionary: [{{"/var/lib/rabbitmq/mnesia/rabbit at curtiz
/queues/67BZ7QWHYIY1S405V2E0881G9/journal.jif",
                    fhc_file},
                   {file,1,true}},
                  {{xtype_to_module,topic},rabbit_exchange_type_topic},
                  {{#Ref<0.0.0.556>,fhc_handle},
                   {handle,{file_descriptor,prim_file,{#Port<0.4724>,15}},
                           0,false,0,infinity,[],true,
                           "/var/lib/rabbitmq/mnesia/rabbit at curtiz
/queues/67BZ7QWHYIY1S405V2E0881G9/journal.jif",
                           [write,binary,raw,read],
                           [{write_buffer,infinity}],
                           true,true,
                           {1322,500175,181273}}},
                  {worker_pool_worker,true},
                  {{#Ref<0.0.0.79730>,fhc_handle},
                   {handle,{file_descriptor,prim_file,{#Port<0.4788>,17}},
                           0,false,0,0,[],true,
                           "/var/lib/rabbitmq/mnesia/rabbit at curtiz
/queues/67BZ7QWHYIY1S405V2E0881G9/37.idx",
                           [{read_ahead,491520},binary,raw,read],
                           [],false,true,
                           {1322,500176,916570}}},
                  {fhc_age_tree,{2,
                                 {{1322,500175,181273},
                                  #Ref<0.0.0.556>,nil,
                                  {{1322,500176,916570},
                                   #Ref<0.0.0.79730>,nil,nil}}}},
                  {{"/var/lib/rabbitmq/mnesia/rabbit at curtiz
/queues/67BZ7QWHYIY1S405V2E0881G9/37.idx",
                    fhc_file},
                   {file,1,false}}]
    trap_exit: false
    status: running
    heap_size: 514229


And finally the crash report itself:
=SUPERVISOR REPORT==== 23-Nov-2011::17:36:29 ===
     Supervisor: {<0.1220.149>,
                                           rabbit_channel_sup_sup}
     Context:    shutdown_error
     Reason:     shutdown
     Offender:   [{pid,<0.1222.149>},
                  {name,channel_sup},
                  {mfa,{rabbit_channel_sup,start_link,[]}},
                  {restart_type,temporary},
                  {shutdown,infinity},
                  {child_type,supervisor}]


Can anyone help me to recover the messages in the queue ?

Payet P
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20111128/b72efd90/attachment.htm>


More information about the rabbitmq-discuss mailing list