Abstract

We’re all excited to build and deliver agentic AI services. But what about running at the exponentially greater scale that agents create? LLMs suffer from poor latency and availability issues. More frequent model training drives more frequent updates to agentic services. Most of all, the LLM cost of running at an agentic scale breaks the bank—fast. So, what can you do?

In this session, we’ll dig into how engineering and operations can address:

Making agentic services fail-proof when their LLMs are not
Managing a two-order-of-magnitude increase in TPS, including a 2M TPS RAG case
study
Navigating cost vs. quality tradeoffs, with LLMs costing up to 100,000x more than a
database transaction
Continuously redeploying agents that require frequent retraining

After this session, we invite you to attend part 2 of the discussion: “From Concept to Code: Navigating Agentic AI Services"

Speaker

Duncan DeVore

Sr. Director & Architect Advocate @Akka

Duncan DeVore is the co-author of Reactive Application Development, a hands-on guide for building reliable enterprise applications using reactive design patterns. An avid Java developer since 2001, he has earned three patents for innovative software design and led the launch of one of the first large-scale distributed reactive applications in 2012. Duncan is an expert in open source development, an enthusiastic proponent of AI, and a regular writer and speaker on both topics.

Session Sponsored By

Akka is used to develop resilient, low latency, large scale, cloud-to-edge distributed applications.

Speaker

Duncan DeVore

Sr. Director & Architect Advocate @Akka

From the same track

Session

Building a Streaming Agentic AI Pipeline with Redpanda and Snowflake

Monday Apr 7 / 02:45PM BST

In this technical talk for developers, architects and the technically curious, Paul will cover recent developments within Redpanda Connect.

Paul Wilkinson

Principal Solutions Architect @Redpanda

Session

AI Developer Tools Are Focused on the Wrong Problem

Monday Apr 7 / 05:05PM BST

For all the claims about AI increasing developer productivity, why aren't developers seeing more of an impact?What we've heard from thousands of developers is that writing code is not their main challenge.

Dennis Pilarinos

Founder & CEO @Unblocked

Session

Zero Trust or Bust. Bring Your Own Cloud To Real-Time Data

Monday Apr 7 / 11:45AM BST

As organizations increasingly look to adopt Bring Your Own Cloud (BYOC) models to deploy scalable, real-time data streaming solutions, security concerns around data governance, access control, and compliance become critical business considerations.

Karin Landers

Senior Product Marketing Manager @Ververica

A Blueprint for Agentic AI Services

Abstract

Speaker

Duncan DeVore

Find Duncan DeVore at:

Session Sponsored By

Speaker

Duncan DeVore

Date

Location

Track

Share

From the same track

Building a Streaming Agentic AI Pipeline with Redpanda and Snowflake

AI Developer Tools Are Focused on the Wrong Problem

Zero Trust or Bust. Bring Your Own Cloud To Real-Time Data

Follow QCon

Contact

Menu

Conferences around the World