A “simple” API request rarely stays simple. In distributed systems, one call quickly turns into fan-out across gateways, services, caches, and databases — and your p99 becomes the sum of every hop and every flaky dependency. Worse, it’s often not a clean outage; it’s grey failures and intermittent slowdowns that are hard to reproduce and easy for customers to feel.

In this session, I’ll share a practical playbook for designing sub-100ms APIs when fan-out is unavoidable. We’ll start with latency budgets, so performance becomes a design constraint, not a hope. Then we’ll cover the patterns that keep tail latency predictable: safe parallelism, timeouts and retries that don’t amplify failure, idempotency, bulkheads/circuit breakers with fallbacks, and caching strategies where invalidation is treated as a correctness problem. We’ll close with trace-driven observability — the minimal signals that let you quickly answer: where did the milliseconds go, what changed, and is it us or a dependency?

Main takeaways:

How to budget latency across service boundaries and enforce it with guardrails
How to use timeouts/retries/idempotency + bulkheads without creating new p99 spikes
How to use traces + a few key metrics to pinpoint the slow hop fast

From the same track

Session Platform Engineering

APIs for Agents: Rethinking API Programs in the MCP Era

Monday Mar 16 / 01:35PM GMT

As API programs mature, a familiar gap emerges: some teams operate with strong standards, reusable platforms, and clear governance,  while others rely on informal guidance and best-effort consistency.

Jim Gough

Distinguished Engineer, API Platform Lead Architect @Morgan Stanley, Co-Author of Optimizing Java

Andreea Niculcea

Vice President @Morgan Stanley

Session

Beyond the Dashboard: Why 'Query-ability' is the New Observability

Monday Mar 16 / 05:05PM GMT

Details coming soon.

Ian Cooper

Senior Principal Engineer @Just Eat Takeaway

Session Observability

Uncorking Queueing Bottlenecks with OpenTelemetry

Monday Mar 16 / 11:45AM GMT

Queues are an essential component in a scalable distributed system, but going beyond the simple implementation creates an explosion of complexity to manage.

Julian Wreford

Team Lead of Operability Team @Gearset, Software Engineer Turned Accidental SRE

Oli Lane

Engineering Team Lead @Gearset, Focusing on Engineering Culture, Observability, and Platform Reliability

Session

Enchant your AI and Apis with eBPF magic 🪄

Monday Mar 16 / 03:55PM GMT

It is a common occurrence to see applications thrown over the fence, landing somewhere in production without a second thought about their lifecycle or how they may need maintaining in the future to connect to more efficient API endpoints.

Dan Finneran

Principal Community Advocate at Isovalent @Cisco

Session

Unconference: Connecting Systems

Monday Mar 16 / 02:45PM GMT

From Fan-Out to Fast: Sub-100ms API Design in Distributed Systems

Abstract

Speaker

Saranya Vedagiri

Speaker

Saranya Vedagiri

Date

Location

Track

Topics

Share

From the same track

APIs for Agents: Rethinking API Programs in the MCP Era

Beyond the Dashboard: Why 'Query-ability' is the New Observability

Uncorking Queueing Bottlenecks with OpenTelemetry

Enchant your AI and Apis with eBPF magic 🪄

Unconference: Connecting Systems

Follow QCon

Contact

Menu

Conferences around the World