Push vs Pull: The Economics of Scale [ukr]
Most systems start with push because it feels natural — the server knows when something changes, so it notifies clients immediately. But at scale, push embeds a hidden cost multiplier: connected users × open items × update rate. Every millisecond of freshness is paid for by every connected user, even those who aren't watching.
This talk is based on a real-world production case study at DraftKings. We’ll explore how the data delivery model shapes the cost curve, rather than just the latency profile. We walk through:
- the original push-based system and how it accumulated complexity over the years
- the scaling pain points that made it unsustainable under peak load
- the evaluation process that led to short polling as the answer
- a zero-downtime migration strategy across four phases
The results were counter-intuitive: a pull-based architecture with short polling outperformed push on data freshness at peak load, while achieving significant CPU and infrastructure cost reduction. The talk closes with a practical framework for deciding when push wins, when pull wins, and what question to ask first.
- Software architect with 10+ years in architecture and 20+ years in the industry overall, specialising in high-load distributed systems
- Currently at DraftKings Inc., working on core platform architecture in the sports betting domain
- Background spans financial and gaming technology; experienced in systems design, stream processing, and scalability at scale
- Graduate of the National Technical University of Ukraine "Kyiv Polytechnic Institute"