Consistent Hash Exchange (2024)

Messages sent to a consistent hash exchange are distributed among bound queues, depending on the hash of the routing key or a header value. The consistent hash exchange also ensures that messages with the same computed hash end up in the same queue. This allows for all messages of the same character, e.g. the same booking or client id, to end up in the same queue. This can ensure a causal message order of events that relies on consistency for that specific ID assuming no bindings have changed.

What is Consistent Hash Exchange?

The consistent hash exchange is an exchange type available in LavinMQ. It distributes messages among queues bound to it, while still ensuring that messages of a certain character are sent to the same queue, assuming no bindings have changed.

Message distribution depends on the computed hash of the routing key or header value. Queues are bound with an integer-based weight rather than a routing key or header values. This weight is used as part of the algorithm that determines the delivery of the message.

With consistent hash exchange, the broker itself is used as the unit of parallelism instead of the consumers.

When should I use Consistent Hash Exchange?

The consistent hash exchange can be used to load-balance messages between queues when causal message ordering is still needed. Manually distributing messages can be hard, since publishers aren’t connected with the information about the number of queues and bindings.

Description

LavinMQ distributes messages in sequential order, but to avoid long queues, data is often spread among multiple queues or handled by several consumers. Although, data that are evenly distributed over multiple queues might lose the order.

With Consistent Hash Exchange, the hash of the routing key or the header values on the messages determines the destination queue. All queues bound to the exchange are potential destinations.

Consistent hashing

The process where data is distributed to a certain location using a hashing algorithm is known as consistent hashing. Only the hash can be used to determine exactly where that data should be routed. This location hashing is usually termed a “ring”, or “hash ring”.

Distribution of messages

The hash can be distributed over a specified span among an interval, into buckets, which makes it easy to add or remove more bounded queues into the exchange. The figure below shows how messages with given ClientIDs as routing keys are distributed among buckets, depending on the number of queues and their values.

The image illustrates messages routed by the Consistent Hash Exchange by their hashed routing key.

Setting up a consistent hash exchange is as easy as setting up an exchange:

Declare a number of queues

channel.queue_declare(queue=q1, durable=True)channel.queue_declare(queue=q2, durable=True)

Define the exchange

channel.exchange_declare(exchange="ce", exchange_type="x-consistent-hash", durable=True)

Bind the queues to the exchange with a given routing key. The routing key on the binding must be set as a numeric value.
```
ch.queue_bind(exchange="ce", queue=q1, routing_key="1")ch.queue_bind(exchange="ce", queue=q2, routing_key="2")
```

Dead messages and resends

Messages that are unable to be consumed for some reason will only be resent to their given queues and be requeued according to configurations like TTL and x-delivery-count. If the system suffers from consumer failures, the messages will stay in the queue until the given consumer/consumers are ready to handle messages.

Removing or adding queues with consistent hash exchange for LavinMQ

It is possible to remove or add queues connected to the consistent hash exchange in runtime. Consistent hash exchange is built for distributing messages as evenly as possible over the queues while keeping the messages with the same routing key/header value in the same queue.

Example: Imagine 100 entries distributed over 4 queues. In this example, all calculated hash values between 0-24 end up in the first queue. All calculated hash values between 25 and 49 end up in the second queue and so on.

The example below has a binding key of 1, meaning that one queue has one bucket:

One Queue per customer Use case

When you add more consumers for a queue to go faster you never know which consumer gets what messages, it might be that messages related to one customer end up in different consumers every time. Instead, by routing the message for a specific customer to one queue, you can be sure that the consumer on that queue will always get ALL the messages for that customer.

Important takeaways

The consistent hash exchange can be used when you want to load-balance messages between queues, where you still need causal message ordering.
Note that the consistent hash exchange does not necessarily distribute messages evenly among the given queues.
Messages routed to a queue via the consistent hash exchange are not sent to other queues or consumers in case of a consumer failure but stay in the given queue until the consumer is ready to handle them, or until dropped or dead-lettered.
The consistent hash exchange is not used for scaling, scaling is done on the queue by adding more consumers, adding more queues to an exchange doesn’t make it faster.

Ready to take the next steps? Here are some things you should keep in mind:

Managed LavinMQ instance on CloudAMQP

LavinMQ has been built with performance and ease of use in mind - we've benchmarked a throughput of about 1,000,000 messages/sec. You can try LavinMQ without any installation hassle by creating a free instance on CloudAMQP. Signing up is a breeze.

Help and feedback

We welcome your feedback and are eager to address any questions you may have about this piece or using LavinMQ. Join our Slack channel to connect with us directly. You can also find LavinMQ on GitHub.

FAQs

What are the problems with consistent hashing? ›

Key challenges due to the allocation scheme: Node Addition/Removal: Changing node numbers necessitates token recomputation, creating significant overhead. Hotspots: A single large range per node can lead to uneven data distribution, causing hotspots.

Read On ›

What does consistent hashing solve? ›

Consistent hashing is used in distributed systems to keep the hash table independent of the number of servers available to minimize key relocation when changes of scale occur.

Discover More Details ›

What is the difference between consistent hashing and normal hashing? ›

Data Distribution: Consistent hashing provides a more stable data distribution in a dynamic environment, whereas traditional hashing can lead to load imbalance.

Is consistent hashing used in practice? ›

Consistent hashing has also been used to reduce the impact of partial system failures in large web applications to provide robust caching without incurring the system-wide fallout of a failure.

See Details ›

Why is hashing not enough? ›

Password hashing is a means of protecting users' passwords from getting into the hands of hackers. However, password hashing isn't risk-free. In this method, passwords are transformed into a predictable and consistent pattern which can be attacked using dictionary, brute-force, or rainbow table attacks.

Find Out More ›

What are pros and cons of consistent hashing? ›

Consistent hashing offers good load balancing but can suffer from hotspot issues. On the other hand, rendezvous hashing generally provides better load balancing and reduces hotspot problems.

Tell Me More ›

What is an alternative to consistent hashing? ›

An alternative to consistent hashing is rendezvous hashing. Consistent hashing has been a popular distributed hashing technique used in computer science and distributed systems to achieve load balancing and minimize the need for rehashing when the number of nodes in a system changes.

Show Me More ›

Does Kafka use consistent hashing? ›

Kafka uses a consistent hashing algorithm to map each message key to a specific partition.

Explore More ›

What is the most efficient hashing method? ›

To protect passwords, experts suggest using a strong and slow hashing algorithm like Argon2 or Bcrypt, combined with salt (or even better, with salt and pepper). (Basically, avoid faster algorithms for this usage.) To verify file signatures and certificates, SHA-256 is among your best hashing algorithm choices.

Does Kubernetes use consistent hashing? ›

Kubernetes uses consistent hashing to map each request to a specific pod, ensuring that requests for the same resource are consistently routed to the same pod. This helps in maintaining session affinity and ensures that requests related to the same session are processed by the same pod.

Show Me More ›

Is consistent hashing a load balancing algorithm? ›

Consistent Hash

This algorithm is best for load balancing large numbers of cache servers with dynamic content. It is 'consistent' because adding or removing a server does not cause a complete recalculation of the hash table.

Read The Full Story ›

Is consistent hashing sharding? ›

Out of many different ways/algorithms of sharding our dataset, one of the most efficient algorithms is consistent hashing. So, it is pretty simple but a subtle difference. Sharding is a general term whereas consistent hashing is a specific type of algorithm to achieve data sharding.

See Details ›

What is a real life example of consistent hashing? ›

Many real-world applications use consistent hashing to distribute data across a cluster of servers. Some major use cases include: Distributed caching Consistent hashing is a popular technique for distributed caching systems like Memcached and Dynamo. In these systems, the caches are distributed across many servers.

Get More Info Here ›

What is the time complexity of consistent hashing? ›

The time complexity of consistent hashing is O(logn), where n is the number of cache shards. Consistent hashing uses a binary search algorithm to locate the correct cache shard for a given key. Binary search has a time complexity of O(logn).

What is the principle of consistent hashing? ›

In consistent hashing, both data and servers undergo hashing, mapping them to a shared range of values [0, n]. To simplify and visualize this concept, imagine these hash values positioned on a circular structure, like a ring or a clock. In this setup, each server is allocated its unique range within the hash values.

What are common problems of hashing? ›

1 Collision handling. One of the main challenges of using a hash table is how to deal with collisions, which occur when two or more keys map to the same index in the table. ...
2 Dynamic resizing. ...
3 Hash function design. ...
4 Key ordering. ...
5 Security risks. ...
6 Here's what else to consider.

Oct 18, 2023

View Details ›

What is the weakness of hashing? ›

Collisions play a central role in a hashing algorithm's usefulness; the easier it is to orchestrate a collision, the less useful the hash. If an attacker is able to manufacture two distinct inputs that will result in an identical hash value, they are exploiting collision resistance weakness.

What are the problems with static hashing? ›

Static Hashing has the following Properties

It is inefficient and inaccurate when the data size dynamically varies because we have limited space and the hash function always generates the same value for every specific input.

Learn More ›