Log Aggregation System

Problem Statement

LogStream is building a centralized log management platform where engineering teams send application logs for search, alerting, and analysis. Features:

- Log ingestion - accept logs via HTTP API, syslog, and agents installed on servers. Each log entry has timestamp, severity, message, and structured metadata (service name, host, trace ID).•Full-text search - search across all logs by keyword, severity, service, and time range. Results in < 3 seconds even when scanning billions of entries.•Live tail - stream new log entries in real time (like `tail -f`) filtered by service or keyword.•Alerting - define alert rules (e.g., "alert if error count > 100 in 5 minutes for service=payments"). Notify via Slack, email, or PagerDuty.•Retention & archival - hot storage for 7 days (fast search), warm storage for 30 days (slower search), cold archive for 1 year (restore-on-demand).•Log patterns - automatically detect and group similar log messages into patterns ("Connection timeout from [IP]" appears 50,000 times today).

Ingest 1 TB of logs per day from 200 services across 5,000 servers.

What You'll Learn

Design a centralized logging system (like Datadog Logs) that ingests, indexes, and searches 1 TB of logs per day. Build this architecture under realistic production constraints, then validate tradeoffs in the design lab simulation.

DatabasesMessage QueuesStorageSearchMonitoring

Constraints

Log volume/day~1 TB

Log entries/day~5,000,000,000

Services~200

Search latency (7-day range)< 3 seconds

Live tail latency< 2 seconds

Hot retention7 days

Availability target99.9%

Learn the Concept

Databases Topic Hub Message Queues Topic Hub Storage Topic Hub Search Topic Hub Monitoring Topic Hub

Related guided labs:

Database Replication & Read Scaling NoSQL & Document Databases Schema Design Workshop

Problem Statement

What You'll Learn

Constraints

Interview-Ready Approach

1) Clarify Scope and SLOs

2) Capacity Planning Method

3) Architecture Decisions

4) Reliability and Failure Strategy

5) Validation Plan

6) Trade-offs to Call Out in Interviews

Practical Notes

Hints (4)

Learn the Concept

Practice Next