<div dir="ltr">Hi,<div><br></div><div>I've seen a number of people failing to get clustering running and, unfortunately, I can't get it going either. Here's the summary of what I've got:</div><div><div><ul>
<li>Two nodes - RMQ1 and RMQ2</li><li>I can ping RMQ1 from RMQ2, and vice versa</li><li>I can telnet from RMQ1 to RMQ2:epmd, and vice versa</li><li>I can telnet from RMQ1 to RMQ2:amqp, and vice versa</li><li>The cookie file is identical, as is clear from the startup INFO</li>
</ul><div>My goal is to have RMQ2 join RMQ1 in a cluster.</div></div><div><br></div><div>The servers are started using the init script in Ubuntu (i.e. service rabbitmq-server start). This is different than the script at <a href="http://www.rabbitmq.com/clustering.html">http://www.rabbitmq.com/clustering.html</a>, which says to start with "rabbitmq-server -detached". I've tried that and it doesn't seem to make any difference so I always use the init script instead.</div>
<div><br></div><div>So, the script says to stop the RMQ2 server and then join the cluster. The following transcript shows how well all this goes:</div><div><br></div><div><div><font face="courier new, monospace">02:~$ sudo rabbitmqctl stop_app</font></div>
<div><font face="courier new, monospace">Stopping node 'rabbit@RMQ2' ...</font></div><div><font face="courier new, monospace">...done.</font></div><div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">02:~$ sudo rabbitmqctl join_cluster --ram rabbit@RMQ1</font></div>
<div><font face="courier new, monospace">Clustering node 'rabbit@RMQ2' with 'rabbit@RMQ1' ...</font></div><div><font face="courier new, monospace">Error: {cannot_discover_cluster,"The nodes provided are either offline or not running"}</font></div>
<div><font face="arial, helvetica, sans-serif"><br></font></div><div><font face="arial, helvetica, sans-serif">However, as I said above, telnetting to the ports works just fine:</font></div><div><font face="arial, helvetica, sans-serif"><br>
</font></div><div><font face="courier new, monospace">02:~$ telnet RMQ1 epmd</font></div><div><font face="courier new, monospace">Trying <ip address>...</font></div><div><font face="courier new, monospace">Connected to RMQ1</font></div>
<div><font face="courier new, monospace">Escape character is '^]'.</font></div><div><font face="courier new, monospace">booger!</font></div><div><font face="courier new, monospace">Connection closed by foreign host.</font></div>
<div><font face="courier new, monospace"><br></font></div><div><font face="courier new, monospace">02:~$ telnet RMQ1 amqp</font></div><div><font face="courier new, monospace">Trying <ip address>...</font></div><div>
<font face="courier new, monospace">Connected to RMQ1</font></div><div><font face="courier new, monospace">Escape character is '^]'.</font></div><div><font face="courier new, monospace">booger!</font></div><div><font face="courier new, monospace">AMQP<span class="" style="white-space:pre">        </span>Connection closed by foreign host.</font></div>
</div><br></div><div>I'm stuck for what else to test. Does anyone know how to troubleshoot this thing further?</div><div><br></div><div>Thanks,</div><div>Derek</div><div><br></div></div>