Back to Walmart
Walmart logo

System design

Walmart system design

6 system design prompts sourced from real Walmart loops. Each one ships with scale targets, evaluation rubrics, and a reference architecture diagram.

mid System design

Design a URL shortener (TinyURL)

Design a service that takes a long URL and returns a short alias. Must support billions of redirects per day with low latency, predictable IDs, and analytics on click-through.

Scale targets

Users: 500M MAUQPS: 100k reads / 10k writesLatency: <50ms p99 redirectStorage: ~1TB / year of links

Key areas

Hash vs counter-based ID generationRead-heavy cachingAnalytics pipelineCustom alias collisionsAbuse + spam detection
senior System design

Design a social media news feed

Design the system that powers a personalized timeline. Each user follows others; the home feed must be fresh, ranked, and personalized within hundreds of milliseconds.

Scale targets

Users: 1.5B DAUQPS: 5M feed pulls/secLatency: <300ms p99Storage: ~50PB total media

Key areas

Push vs pull vs hybrid fanoutRanking pipeline (candidate generation → ranking → re-ranking)Cold start for new usersCache hierarchy + edge deliveryMedia CDN + transcoding
senior System design

Design a real-time chat / messenger

Design a 1:1 and group chat service with delivery receipts, presence, and offline message delivery. Messages must arrive in-order and survive disconnects.

Scale targets

Users: 800M MAUQPS: 60M msgs/sec peakLatency: <200ms p99 deliveryStorage: ~100PB messages

Key areas

WebSocket gateway + sticky sessionsMessage ordering + idempotencyPresence + typing indicatorsPush notifications for offline usersEnd-to-end encryption boundaries
staff System design

Design a video streaming platform

Design a Netflix/YouTube-style streaming platform that supports upload, transcoding, adaptive playback, and personalization. Optimize for global low-latency playback and storage cost.

Scale targets

Users: 250M paid subsQPS: 15M concurrent streamsLatency: <2s startup timeStorage: ~exabyte

Key areas

Upload + transcoding pipelineAdaptive bitrate streaming (HLS/DASH)Edge CDN strategyDRM + licensingRecommendation + personalization
senior System design

Design a distributed rate limiter

Design a rate limiter that protects internal services from abusive traffic. Must work across regions, share state between gateway instances, and degrade gracefully if the central store is down.

Scale targets

Users: per-user, per-IP, per-API-keyQPS: 3M req/sec peakLatency: <1ms decisionStorage: counters only — tiny

Key areas

Token bucket vs leaky bucket vs fixed windowSliding window + log trimmingLocal fast path + central reconciliationMulti-region coordinationDegraded mode (fail open vs fail closed)
senior System design

Design a search autocomplete service

Design the typeahead suggestion service for a large product search. Each keystroke must return ranked completions in well under 100ms, with personalization and trending awareness.

Scale targets

Users: 200M DAUQPS: 500k req/secLatency: <50ms p99Storage: ~500GB index

Key areas

Trie / inverted-index structureTop-k ranking + popularity decayPersonalization + history biasIndex updates without downtimeLatency budget per hop