[rabbitmq-discuss] debugging lost queue on rabbit server switchover
Chris Friesen
chris.friesen at windriver.com
Mon Mar 24 01:39:41 GMT 2014
Hi,
I'm looking for some help with debugging what seems to be a rabbit
server issue.
I've got a rabbit server being used for OpenStack, and it's set up in a
active-standby config using pacemaker.
Yesterday we had some guys doing testing by triggering a controlled
switchover from active to standby and back, multiple times.
One of our other servers started showing some odd behaviour, and when we
dug into it the problem seemed to be that the queue that it thought it
was using didn't seem to exist on the rabbit server anymore.
# The message trace of the RPC request shows the required reply queue of
the following.
reply_5e65eb22a7bd4bd98a5bc6b7c354fa5b
21:16:02.279850 00:1e:67:51:57:09 > 00:1e:67:54:aa:39, ethertype IPv4
(0x0800), length 678: (tos 0x0, ttl 64, id 21886, offset 0, flags [DF],
proto TCP (6), length 664)
192.168.204.194.47946 > 192.168.204.2.5672: Flags [P.], cksum
0x1ca1 (incorrect -> 0xcadb), seq 33263:33875, ack 2765, win 123,
options [nop,nop,TS val 72448895 ecr 74959841], length 612
0x0000: 4500 0298 557e 4000 4006 c8cb c0a8 ccc2 E...U~@. at .......
0x0010: c0a8 cc02 bb4a 1628 dd1a 91e2 01f0 e9ef .....J.(........
0x0020: 8018 007b 1ca1 0000 0101 080a 0451 7b7f ...{.........Q{.
0x0030: 0477 cbe1 0300 0100 0002 5c7b 226f 736c .w........\{"osl
0x0040: 6f2e 6d65 7373 6167 6522 3a20 227b 5c22 o.message":."{\"
0x0050: 5f63 6f6e 7465 7874 5f72 6f6c 6573 5c22 _context_roles\"
0x0060: 3a20 5b5c 2261 646d 696e 5c22 5d2c 205c :.[\"admin\"],.\
0x0070: 225f 6d73 675f 6964 5c22 3a20 5c22 6532 "_msg_id\":.\"e2
0x0080: 6135 3465 6630 6466 3334 3436 3132 6133 a54ef0df344612a3
0x0090: 3363 3464 6431 6231 6334 3238 3164 5c22 3c4dd1b1c4281d\"
0x00a0: 2c20 5c22 5f63 6f6e 7465 7874 5f72 6561 ,.\"_context_rea
0x00b0: 645f 6465 6c65 7465 645c 223a 205c 226e d_deleted\":.\"n
0x00c0: 6f5c 222c 205c 225f 7265 706c 795f 715c o\",.\"_reply_q\
0x00d0: 223a 205c 2272 6570 6c79 5f35 6536 3565 ":.\"reply_5e65e
0x00e0: 6232 3261 3762 6434 6264 3938 6135 6263 b22a7bd4bd98a5bc
0x00f0: 3662 3763 3335 3466 6135 625c 222c 205c 6b7c354fa5b\",.\
However, rabbitmq does not have any information for this queue
rabbitmqadmin -H 192.168.204.2 list queues name
| reply_09e9cf08bfd84f8f903b527080ac28fc |
| reply_1d7e7b12580e46109c5a01e54b5cfbbf |
| reply_215c6a00191f480daafa9daa18949700 |
| reply_22f18026daad41219463ce64fae3d33f |
| reply_2d2486e8b9264f8a98086ec59ddb24a8 |
| reply_2fc4fac0fa3d4345a2bd9d6313694a97 |
| reply_3022cb2f5cae45de897fac4a6bf6ac45 |
| reply_33636e6f24bd429a8f91776b493c8c8f |
| reply_42fd8c79cb3440c4b35134b973dba709 |
| reply_484bf7df103a44a6ba0860198eb642e8 |
| reply_515d88f6ebdc4c3cb1e46064b900cd56 |
| reply_63f15fb7c91d4bd7b6803e85fd61788a |
| reply_6716118331874dc4a5aa65b001546647 |
| reply_67382c459e4849c28d80a02a4c9e3086 |
| reply_690f93f4379d49a9bedef3a872b02f5b |
| reply_6b665fbbd5c3487f9d829e4f461c1ab3 |
| reply_6c9aa49582f347d28fa3c928a71083e9 |
| reply_6f4dcbf71d864038872177514633387f |
| reply_81f3d1b37c7e4cdfb064c8b28321e4c1 |
| reply_84391d9200b34b64bede108723decb13 |
| reply_9536e8be4ca4440ea894486814b60f6d |
| reply_96ab63f2ba0a4fe69adabbd70e974dfb |
| reply_9ebe0711815a4808951f78b2a1b2cd10 |
| reply_b18ed1503d7943ed90390ddd2392244a |
| reply_ba6a767f92554a6ab9cc532d77d50d33 |
| reply_be244933bfba4ec58b922c549f2ebdc8 |
| reply_be79b68595394fb886bfc2109a925c5c |
| reply_e6cc599265e240f0a95164c6ecf48989 |
| reply_ef1dbac27378427584820be544080fd1 |
What's the best way for me to figure out how it got into this state?
Thanks,
Chris
More information about the rabbitmq-discuss
mailing list