[rabbitmq-discuss] Node scheduling with RabbitMQ

Mon Aug 9 03:55:22 BST 2010

Hi Ask -

I am a Pythonista and I have been looking into Celery, so this is very
cool that you have responded!

I was considering writing an x-script exchange for RabbitMQ and
creating one queue per worker to dump messages in; this is a feasible
way to go, right? The exchange could probably handle data locality,
however, what happens when a worker dies with messages sitting on the
queue?

I also am relatively new to the non-Amazon SQS message queuing world,
so I guess that this becomes less a question of how to do proper data
locality and more about how to do reliability with RabbitMQ. On SQS,
due to the visibility timeout semantics I am assured that if I design
idempotent tasks they will eventually occur even if there is a node
failure.

Right now, the solution that makes the most sense to me is to have a
single consumer that farms the tasks out to the worker nodes, however,
this means that I need to build reliability capabilities like
visibility timeouts into the consumer rather than relying on a battle
tested piece of software like RabbitMQ. Is there some sort of
functionality in RabbitMQ that accounts for this? Am I missing an
important piece of documentation somewhere?

Thanks,

Pete

On Sat, Aug 7, 2010 at 5:04 AM, Ask Solem <askh at opera.com> wrote:
>
> On Aug 7, 2010, at 10:43 AM, Ask Solem wrote:
>>>
>>
>> cache-aware? You mean data locality?
>>
>> I've been contemplating this as well.
>> I think this could be solved by having a proxy in front of RabbitMQ, that routes
>> the messages to the correct hosts (using routing keys).
>>
>> E.g. if each worker is also consuming from a private queue, this proxy could
>> just reroute the message so the task is received by a single worker.
>
>
> On second thought it doesn't need to be a proxy at all,
> just some service knowing what worker(s) is near the data.
>
> node1.example.com consumes from
>    queue: tasks exchange: tasks type: direct
>    queue: node1.example.com exchange: directly_to type: direct
>
> node2.example.com consumes from
>    queue: tasks exchange: tasks rkey=tasks
>    queue: node2.example.com exchange: directly_to rkey=node2.example.com
>
> Then these workers will receive tasks from the "tasks" queue in round robin style,
> and it's also possible to send tasks directly to one or more workers.
>
> When you publish a task that could benefit from optimal data locality, you consult
> the central authority to know what nodes is closest to the file(s):
>
> routing_key = node_close_to(file)
> basic_publish(message, exchange="directly_to", routing_key=routing_key)
>
> This doesn't take into account failing nodes, but maybe you could use
> the immediate flag for this.
>
> As the maintainer of Celery (http://celeryproject.org), I'm very interested
> in what you end up with here.
>
>
>
> --
> {Ask Solem,
>  +47 98435213 | twitter.com/asksol }.
>
> _______________________________________________
> rabbitmq-discuss mailing list
> rabbitmq-discuss at lists.rabbitmq.com
> https://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss
>