Search Engine 2 - Ranking, Autocomplete & Global Scale | Databases System Design Challenge

Problem Statement

FindIt has expanded from developer docs to a full-scale web search engine indexing 1 billion pages. The system must now handle:

- Link-based ranking (PageRank) - compute a global authority score for every page based on the web link graph. This is a massive offline computation that runs periodically over billions of nodes and edges.•Autocomplete / query suggestions - as the user types, suggest completions from a trie of popular queries (updated hourly). Autocomplete must respond in < 50 ms.•Personalized results - use search history and click-through data to re-rank results per user.•Spell correction - handle misspelled queries ("javscript tutorial" → "javascript tutorial").•Multi-region deployment - serve search results from the nearest data center. Index replicas in each region.•Query throughput - handle 100,000 search queries per second at peak.

This is a capstone-level challenge combining information retrieval, graph algorithms, ML ranking, and planetary-scale infrastructure.

What You'll Learn

Scale to 1 B pages with PageRank, autocomplete, personalized results, and multi-region serving. Build this architecture under realistic production constraints, then validate tradeoffs in the design lab simulation.

DatabasesCachingShardingAnalyticsGeo DistributionSearch

Constraints

Indexed pages~1,000,000,000

Index size~100 TB

PageRank computationRuns daily (batch)

Query throughput~100,000 QPS

Search latency (P99)< 300 ms

Autocomplete latency< 50 ms

Regions6

Availability target99.99%

Learn the Concept

Databases Topic Hub Caching Topic Hub Sharding Topic Hub Analytics Topic Hub Geo Distribution Topic Hub Search Topic Hub

Related guided labs:

Database Replication & Read Scaling NoSQL & Document Databases Schema Design Workshop

Search Engine 2 - Ranking, Autocomplete & Global Scale

Problem Statement

What You'll Learn

Constraints

Interview-Ready Approach

1) Clarify Scope and SLOs

2) Capacity Planning Method

3) Architecture Decisions

4) Reliability and Failure Strategy

5) Validation Plan

6) Trade-offs to Call Out in Interviews

Practical Notes

Hints (5)

Learn the Concept

Practice Next