[rabbitmq-discuss] Deduplication filters, and distributed key-value stores

Eishay Smith eishay at gmail.com
Tue Nov 10 17:58:21 GMT 2009


Voldemort has key expiration as well.

On Tue, Nov 10, 2009 at 8:29 AM, Kevin A. Smith
<kevin at hypotheticalabs.com>wrote:

> Redis already supports explicit timeouts for entries via the expire and ttl
> commands so it can take care of that as well.
>
> --Kevin
> On Nov 10, 2009, at 10:19 AM, Tony Garnock-Jones wrote:
>
> > We talk every now and then about deduplication filters, for avoiding
> > repeated work. One problem is that it can be hard to know what to do if
> > you have a worker pool, because you have to share the state about what
> > messages the whole pool has seen, across the whole pool.
> >
> > Sounds like a job for a distributed key-value store. What if redis,
> > riak, even memcached, were used as the dedup filter? Each worker pool
> > (a.k.a "queue", heh) would have a key-value store of their own. The
> > store would hold, for each labelled request, either
> >
> > - nothing: the request is fresh to the pool
> >
> > - an indication of partial completeness and/or receipt
> >
> > - the response that was sent back to the requestor, that can be
> >   sent back again if a replay is detected, without doing any more
> >   work
> >
> > The nice thing is that not only are these distributed key-value stores
> > pretty quick these days, they also scale up tremendously well and
> > furthermore you can "shard" them because you can scope them to the
> > collection of workers that are sharing a queue.
> >
> > The only missing piece is clearing out old, definitely stale records
> > from the filter after a while. For that you can use a separate
> > garbage-collection/expiry process, I guess; I haven't run any
> > experiments here, though. Maybe some of these new stores even include
> > time-to-live for their records, which solves the problem for us!
> >
> > So: rabbitmq for the routing, buffering, relaying and delivery of
> > requests and responses; and a distributed key-value store for
> deduplication.
> >
> > Tony
> >
> > _______________________________________________
> > rabbitmq-discuss mailing list
> > rabbitmq-discuss at lists.rabbitmq.com
> > http://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss
>
>
> _______________________________________________
> rabbitmq-discuss mailing list
> rabbitmq-discuss at lists.rabbitmq.com
> http://lists.rabbitmq.com/cgi-bin/mailman/listinfo/rabbitmq-discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.rabbitmq.com/pipermail/rabbitmq-discuss/attachments/20091110/45993906/attachment.htm 


More information about the rabbitmq-discuss mailing list