You are viewing content from a past/completed QCon

Presentation: Amplifying Sources of Resilience: What Research Says

Track: Architecting for Failure: Chaos, Complexity, and Resilience

Location: Windsor, 5th flr.

Duration: 4:10pm - 5:00pm

Day of week: Wednesday

Share this on:

This presentation is now available to view on InfoQ.com

Watch video with transcript

Abstract

Building robust software systems means anticipating how failures may occur with components and subsystems and developing answers to the question:   

“What is needed for the design of systems that prevents or limits catastrophic failure?”   Investing in, developing, and sustaining the adaptive capacity to cope with unexpected situations is at the core of Resilience Engineering. In the software community, this means developing (continually!) ever-better answers to the question:   

“When our preventative designs fail us, what are ways that teams of engineers successfully anticipate, resolve, and learn from those catastrophes?”

  

The Resilience Engineering community has been studying how people in high-consequence/high-tempo domains answer this latter question. Applying Resilience Engineering thinking and paradigms to the world of software engineering and operations is still in its infancy, but we have some promising routes for making progress. This talk will outline productive avenues to locate, amplify, support, and build this capacity that exists (sometimes invisibly) in the expertise of your organization. Spoiler: looking closely at the origins, handling, and perception of incidents is part of this story.

Speaker: John Allspaw

DevOps/Resilience Engineering Thought Leader, Previously CTO @Etsy & Co-founder of @AdaptiveCLabs

John Allspaw has worked in software systems engineering and operations for over twenty years in many different environments. John’s publications include the books The Art of Capacity Planning (2009) and Web Operations (2010) as well as the forward to “The DevOps Handbook.”  His 2009 Velocity talk with Paul Hammond, “10+ Deploys Per Day: Dev and Ops Cooperation” helped start the DevOps movement.

John served as CTO at Etsy, and holds an MSc in Human Factors and Systems Safety from Lund University.

Find John Allspaw at

Similar Talks

Managing Systems in an Age of Dynamic Complexity

Qcon

Site Reliability Engineer @Slack, Contributor to Seeking SRE, & SRECon Steering Committee

Laura Nolan

Tesla Virtual Power Plant

Qcon

Sr. Staff Software Engineer @Tesla

Colin Breck

Preparing for the Unexpected

Qcon

Principal Engineer @FinancialTimes

Samuel Parkinson

Rethinking the Linux Kernel

Qcon

Founding Member @CiliumProject

Thomas Graf

Learning From Incidents: How Things Went Right

Qcon

Senior Applied Resilience Engineer @Netflix

Jessica DeVita

Tracks

Monday, 2 March

Tuesday, 3 March

Wednesday, 4 March

Download the QCon App!

Available on iOS and Android

The QCon app helps you make the most of your conference experience. Easily browse and follow the conference schedule, star the talks you want to attend, and keep tabs on your personal itinerary. Download the app now for free on iOS and Android.

QCon - iOS QCon - Android
QCon