<div dir="ltr">Ah, I do have more information though:<div><br></div><div><div><font face="courier new, monospace">DIAGNOSTICS</font></div><div><font face="courier new, monospace">===========</font></div><div><font face="courier new, monospace"><br>
</font></div><div><font face="courier new, monospace">nodes in question: ['RMQ1']</font></div><div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">hosts, their running nodes and ports:</font></div>
<div><font face="courier new, monospace">- unable to connect to epmd on RMQ1: nxdomain (non-existing domain)</font></div><div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">current node details:</font></div>
<div><font face="courier new, monospace">- node name: 'rabbitmqctl1577@RMQ2'</font></div><div><font face="courier new, monospace">- home dir: /var/lib/rabbitmq</font></div><div><font face="courier new, monospace">- cookie hash: ohQKEF09peb6bAgNqawvKA==</font></div>
</div><div><br></div><div>And just to be clear, the cookie is the same:</div><div><br></div><div><div><font face="courier new, monospace"><b>01</b>:~$ sudo md5sum /var/lib/rabbitmq/.erlang.cookie </font></div><div><font face="courier new, monospace">a2140a105d3da5e6fa6c080da9ac2f28 /var/lib/rabbitmq/.erlang.cookie</font></div>
</div><div><div><font face="courier new, monospace"><b>02</b>:~$ sudo md5sum /var/lib/rabbitmq/.erlang.cookie</font></div><div><font face="courier new, monospace">a2140a105d3da5e6fa6c080da9ac2f28 /var/lib/rabbitmq/.erlang.cookie</font></div>
</div><div><br></div><div>Somehow, telnet to epmd works just fine, but something that RMQ is doing fails to make that happen. Is there some sort of DNS work that it's doing, instead of using the hosts files?</div><div>
<br></div><div>i.e. one thing I found is that nslookup fails:</div><div><br></div><div><font face="courier new, monospace">02:~$ nslookup RMQ1</font></div><div><div><font face="courier new, monospace">;; Got SERVFAIL reply from <ipaddress>, trying next server</font></div>
<div><font face="courier new, monospace">Server:<span class="" style="white-space:pre">                </span><ipaddress></font></div><div><font face="courier new, monospace">Address:<span class="" style="white-space:pre">        </span><ipaddress></font></div>
<div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">** server can't find RMQ1: SERVFAIL</font></div></div><div><br></div><div>But if I ping RMQ1 it works fine. <font face="courier new, monospace">/etc/nsswitch.conf</font> specifies that files should be tried first, before DNS w.r.t. hosts.</div>
<div><br></div><div>So, it looks like RMQ is doing something more rigorous to resolve the host, and I don't know how to change that. I also don't have access to the DNS server configuration in order to modify it in any way.</div>
<div><br></div></div><div class="gmail_extra"><br><br><div class="gmail_quote">On 25 September 2013 09:57, Jason McIntosh <span dir="ltr"><<a href="mailto:mcintoshj@gmail.com" target="_blank">mcintoshj@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Check your erlang cookie on both servers to make sure it matches I think it's in - /var/lib/rabbitmq/ - then you can use rabbitmqctl from one machine and see if you can connect to another to list queues. I THINK that's rabbitmqctl -n <servernode> list_queues for example. If both servers can talk to each other then it should be rabbitmqctl stop_app, join_cluster, start_app.<br>
<br>Jason</div><div class="gmail_extra"><br><br><div class="gmail_quote">On Wed, Sep 25, 2013 at 8:50 AM, Derek Wyatt <span dir="ltr"><<a href="mailto:derek@derekwyatt.org" target="_blank">derek@derekwyatt.org</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Hi,<div><br></div><div>I've seen a number of people failing to get clustering running and, unfortunately, I can't get it going either. Here's the summary of what I've got:</div>
<div><div><ul>
<li>Two nodes - RMQ1 and RMQ2</li><li>I can ping RMQ1 from RMQ2, and vice versa</li><li>I can telnet from RMQ1 to RMQ2:epmd, and vice versa</li><li>I can telnet from RMQ1 to RMQ2:amqp, and vice versa</li><li>The cookie file is identical, as is clear from the startup INFO</li>
</ul><div>My goal is to have RMQ2 join RMQ1 in a cluster.</div></div><div><br></div><div>The servers are started using the init script in Ubuntu (i.e. service rabbitmq-server start). This is different than the script at <a href="http://www.rabbitmq.com/clustering.html" target="_blank">http://www.rabbitmq.com/clustering.html</a>, which says to start with "rabbitmq-server -detached". I've tried that and it doesn't seem to make any difference so I always use the init script instead.</div>
<div><br></div><div>So, the script says to stop the RMQ2 server and then join the cluster. The following transcript shows how well all this goes:</div><div><br></div><div><div><font face="courier new, monospace">02:~$ sudo rabbitmqctl stop_app</font></div>
<div><font face="courier new, monospace">Stopping node 'rabbit@RMQ2' ...</font></div><div><font face="courier new, monospace">...done.</font></div><div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">02:~$ sudo rabbitmqctl join_cluster --ram rabbit@RMQ1</font></div>
<div><font face="courier new, monospace">Clustering node 'rabbit@RMQ2' with 'rabbit@RMQ1' ...</font></div><div><font face="courier new, monospace">Error: {cannot_discover_cluster,"The nodes provided are either offline or not running"}</font></div>
<div><font face="arial, helvetica, sans-serif"><br></font></div><div><font face="arial, helvetica, sans-serif">However, as I said above, telnetting to the ports works just fine:</font></div><div><font face="arial, helvetica, sans-serif"><br>
</font></div><div><font face="courier new, monospace">02:~$ telnet RMQ1 epmd</font></div><div><font face="courier new, monospace">Trying <ip address>...</font></div><div><font face="courier new, monospace">Connected to RMQ1</font></div>
<div><font face="courier new, monospace">Escape character is '^]'.</font></div><div><font face="courier new, monospace">booger!</font></div><div><font face="courier new, monospace">Connection closed by foreign host.</font></div>
<div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">02:~$ telnet RMQ1 amqp</font></div><div><font face="courier new, monospace">Trying <ip address>...</font></div><div>
<font face="courier new, monospace">Connected to RMQ1</font></div><div><font face="courier new, monospace">Escape character is '^]'.</font></div><div><font face="courier new, monospace">booger!</font></div><div><font face="courier new, monospace">AMQP<span style="white-space:pre-wrap">        </span>Connection closed by foreign host.</font></div>
</div><br></div><div>I'm stuck for what else to test. Does anyone know how to troubleshoot this thing further?</div><div><br></div><div>Thanks,</div><div>Derek</div><div><br></div></div>
<br>_______________________________________________<br>
rabbitmq-discuss mailing list<br>
<a href="mailto:rabbitmq-discuss@lists.rabbitmq.com" target="_blank">rabbitmq-discuss@lists.rabbitmq.com</a><br>
<a href="https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss" target="_blank">https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss</a><br>
<br></blockquote></div><span class="HOEnZb"><font color="#888888"><br><br clear="all"><div><br></div>-- <br>Jason McIntosh<br><a href="http://mcintosh.poetshome.com/blog/" target="_blank">http://mcintosh.poetshome.com/blog/</a><br>
<a href="tel:573-424-7612" value="+15734247612" target="_blank">573-424-7612</a>
</font></span></div>
<br>_______________________________________________<br>
rabbitmq-discuss mailing list<br>
<a href="mailto:rabbitmq-discuss@lists.rabbitmq.com">rabbitmq-discuss@lists.rabbitmq.com</a><br>
<a href="https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss" target="_blank">https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss</a><br>
<br></blockquote></div><br></div>