Track: Architecting for Failure


Day of week:

Building fault tolerate systems that are truly resilient

Track Host:
Peter Morgan
Chief Engineer @SkyBet
Peter was responsible for all development projects at William Hill and has been a strong adopter of distributed architecture, including NoSQL Riak database. Previously, Peter was co-founder and lead Java programmer of jcoverage which subsequently morphed into cobertura and co-founder at EJBHome, acquired by IONA technologies.
10:35am - 11:25am

by Eugene Kirpichov
Cloud Dataflow Sr SE @Google

One of the main causes of performance problems in distributed data processing systems (from the original MapReduce to modern Spark and Flink) is "stragglers." Stragglers are parts of the input that take an unexpectedly long time to process, delaying the completion of the whole job, and wasting resources that stay idle. Stragglers can happen due to imbalance of data distribution or processing complexity, hardware/networking anomalies, and a variety of other...

11:50am - 12:40pm

by Edward Hiley
Principal Engineer @NHSDigital

by Dan Rathbone
Technical Director @InfinityWorks

How NHS Digital have built a highly secure and resilient system for processing patient data, applying techniques more often used in the cloud to bare metal servers. See how we’ve built automated performance tests, an immutable infrastructure and a scalable distributed compute cluster utilising multiple data centres and bare metal servers.

1:40pm - 2:30pm

by Jonas Bonér
Founder & CTO @Lightbend / Creator of Akka

Everyone is talking about microservices, and there is more confusion than ever about what the promise of microservices really means and how to deliver on it. To address this we will explore microservices from first principles, distilling their essence and putting them in their true context: distributed systems.

What many people forget is that microservices are distributed and collaborative by nature and only make sense as systems—one collaborator is no collaborator. It is in between...

2:55pm - 3:45pm

by Greg Murphy
Chief Architect, Infrastructure & Operations @GameSparks

GameSparks is a globally-distributed Backend-as-a-Service platform that serves tens of billions of API requests per month for hundreds of live games which have tens of millions of active users, hundreds of thousands of whom are concurrently connected at any one time. All of our players connect across the public internet – many from mobile devices – and all of our resources run in public clouds, so failure is all around us! Oh, and our customers can write custom...

4:10pm - 5:00pm

Open Space
5:25pm - 6:15pm

by Greg Hawkins
Chief Technology Officer @StarlingBank

"Fail fast and fail often" - not only does the Silicon Valley mantra speak volumes on the relentless pursuit of innovation, it also highlights technology's power of unpredictability. But when creating a bank from scratch, the art is in combining pace of change and stability.

So when your boss thinks you are the next Google, the regulators say "what the cloud" and customers only care about being there 24/7 - what do you do? With...