Day 4 of 5
⏱ ~60 minutes
Distributed Systems in 5 Days — Day 4

Sharding & Partitioning

Hash vs range partitioning, consistent hashing, hot spots, rebalancing

What You'll Cover Today

Day 4 of Distributed Systems in 5 Days pushes into advanced territory. You have enough foundation now to tackle real-world complexity. Today's exercise is more open-ended than earlier days — that's intentional.

ℹ️
Topics today: consistent hashing, hot spots, rebalancing. Each section has code you can copy and run immediately.

consistent hashing

Understanding consistent hashing is the core goal of Day 4. The concept is straightforward once you see it in practice — most confusion comes from skipping the mental model and jumping straight to implementation. Start with the model, then write the code.

consistent hashing
# consistent hashing — Working Example
# Study this pattern carefully before writing your own version

class consistenthashingExample:
    """
    Demonstrates core consistent hashing concepts.
    Replace placeholder values with your real implementation.
    """
    
    def __init__(self, config: dict):
        self.config = config
        self._validate()
    
    def _validate(self):
        required = ['name', 'type']
        for field in required:
            if field not in self.config:
                raise ValueError(f"Missing required field: {field}")
    
    def process(self) -> dict:
        # Core logic goes here
        result = {
            'status': 'success',
            'topic': 'consistent hashing',
            'data': self.config
        }
        return result


# Usage
example = consistenthashingExample({
    'name': 'my-implementation',
    'type': 'consistent hashing'
})
output = example.process()
print(output)
💡
Key insight: When working with consistent hashing, always start with the simplest possible case that works end-to-end. Complexity is easier to add than simplicity is to recover.

hot spots

hot spots is the practical application of consistent hashing in real projects. Once you understand the underlying model, hot spots becomes the natural next step.

💡
Pro tip: When working with hot spots, always read the official documentation for the exact version you're using. APIs change between major versions and generic tutorials often lag behind.

rebalancing

rebalancing rounds out today's lesson. It connects consistent hashing and hot spots into a complete picture. You'll use all three concepts together in the exercise below.

Common Mistakes on Day 4

📝 Day 4 Exercise
Sharding & Partitioning — Hands-On
  1. Set up your environment for today's topic: install required tools and verify the basics work before writing any logic.
  2. Implement a minimal working version of consistent hashing using the code example in this lesson as your starting point.
  3. Extend your implementation to incorporate hot spots — this is where the two concepts connect.
  4. Test your implementation with both valid and invalid inputs. What happens at the boundaries?
  5. Review your code: is there anything you'd name differently? Any function doing more than one thing? Refactor one thing.

Day 4 Summary

  • consistent hashing is the foundation of today's lesson — understand it before moving on.
  • hot spots is how you apply it in real projects.
  • rebalancing ties the day's concepts together into a complete pattern.
  • Error handling and input validation belong in the first version, not as an afterthought.
  • Read error messages carefully — they usually tell you exactly what's wrong.
Challenge

Extend today's exercise by adding one feature that wasn't in the instructions. Document what you built in a comment at the top of the file. This habit of going one step further is what separates engineers who grow fast from those who stay stuck.

Finished this lesson?