Incident Management
Past Presentations
Best Practices Building Resilient Systems
Architecting for Failure covers the challenges (both technical and organizational) of constantly improving service delivery of a growing global company with a 24x7x365 service redundancy requirement. The talk focuses on best practices and lessons learned in building resilient systems. Topics...
Chaos Engineering: Why the World Needs More Resilient Systems
There are those of us that are motivated to build resilient systems, improve uptime, move fast and keep systems reliable. Then there are those of us who feel overwhelmed by our to-do lists and the features or projects we feel we need to get out the door. The world needs more resilient systems...
Rethinking How the Industry Approaches Chaos Engineering:
In order to determine and envision how to achieve reliability and resilience that drive our businesses forward, organizations must be able to look back at past blunders unobscured by hindsight bias. Resilient organizations don’t take past successes as a reason for confidence. Instead, they...