Scaling RAG Applications to serve millions of users [eng]
Talk presentation
How we managed to grow and scale a RAG application from zero to thousands of users in 7 months. Lessons from technical challenges around managing high load for LLMs, RAGs and Vector databases.