Computer Use Agents: From SFT to Classic RL [ukr]

Talk presentation

Let’s explore Computer/Browser/Mobile Use agents. We’ll start with the APIs provided by OpenAI and Claude that support such use cases. Then, we’ll recall how LLMs and VLMs are trained, what Reinforcement Learning (RL) is, and how it can be applied in this context. We'll also look into some recent open-source agent models, and discuss how to evaluate these agents effectively.

Maksym Shamrai

Research Scientist at MacPaw

Research Scientist at MacPaw AI Research (AIR), working on applied research in the field of AI
PhD student at the Institute of Mathematics of the National Academy of Sciences of Ukraine, researching optimization methods for neural network models and optimal control theory
Teaches "Computer Vision" to Master's students at Kyiv Academic University (KAU)
Interested in the mathematics behind AI, particularly in Reinforcement Learning
Has experience in both research and production
LinkedIn

Buy tickets for the next conference Fwdays Tech Summit!

Computer Use Agents: From SFT to Classic RL [ukr]

Talk presentation