Filter by tag

Agent in the Loop: Architecture for Highload Data Pipeline Recovery [ukr]

A real-world-inspired architecture talk about embedding an AI agent into the operational workflow of a highload data pipeline. We walk through a cascade failure scenario: corrupted data enters the pipeline, Kafka queues get stuck, storage pressure grows, thousands of Kubernetes pods start failing and rescheduling, etcd degrades, and PostgreSQL becomes a secondary pressure point. Then we show how an agent built with AWS Bedrock AgentCore, LangChain, and MCP/Gateway could detect early signals, isolate corrupted messages, suggest human-approved fixes, protect cluster stability, and turn noisy telemetry into actionable recovery steps.

Kyrylo Dubovyk

(AI Solutions Architect at EPAM | Founder “Digital Brain”),

Maksym Borodin

(Systems Architect @ EPAM),
Highload fwdays'26 conference
Sign in
Or by mail
Sign in
Or by mail
Register with email
Register with email
Forgot password?