Push vs Pull: The Economics of Scale [ukr]

Talk presentation

Most systems start with push because it feels natural — the server knows when something changes, so it notifies clients immediately. But at scale, push embeds a hidden cost multiplier: connected users × open items × update rate. Every millisecond of freshness is paid for by every connected user, even those who aren't watching.

This talk is based on a real-world production case study at DraftKings. We’ll explore how the data delivery model shapes the cost curve, rather than just the latency profile. We walk through:

the original push-based system and how it accumulated complexity over the years
the scaling pain points that made it unsustainable under peak load
the evaluation process that led to short polling as the answer
a zero-downtime migration strategy across four phases

The results were counter-intuitive: a pull-based architecture with short polling outperformed push on data freshness at peak load, while achieving significant CPU and infrastructure cost reduction. The talk closes with a practical framework for deciding when push wins, when pull wins, and what question to ask first.

Artem Kuzmyk

Software Architect, DraftKings Inc.

Software architect with 10+ years in architecture and 20+ years in the industry overall, specialising in high-load distributed systems
Currently at DraftKings Inc., working on core platform architecture in the sports betting domain
Background spans financial and gaming technology; experienced in systems design, stream processing, and scalability at scale
Graduate of the National Technical University of Ukraine "Kyiv Polytechnic Institute"
LinkedIn