Presentation: The Future of Distributed Databases Is Relational

Track: Distributed Stateful Systems

Location: Churchill, G flr.

Duration: 4:10pm - 5:00pm

Day of week: Wednesday

Level: Intermediate - Advanced

Share this on:

Abstract

Years ago when working at Amazon on shopping cart infrastructure and the precursor to DynamoDB, my co-founder and I realized that while distributed key value stores were useful for a few use-cases, we missed many of the benefits of relational databases: transactions, joins, and the power of the lingua franca of RDBMS’s: SQL. So we challenged ourselves to modernize the traditional relational database, to take a robust open source relational database and transform it into a distributed database.

This talk is about my team’s journey to create a more modern relational database. I’ll talk about the distributed systems problems we had to solve in order to scale out the Postgres open source database, in order to achieve parallelism and a concomitant increase in performance. I'll describe the architecture of the distributed query planner; how we extend traditional relational algebra operators to plan distributed queries and scale reads. I’ll also describe distributed deadlock detection, and how that enabled us to scale out transactions spanning multiple machines.

Speaker: Sumedh Pathak

VP Engineering & Co-Founder @CitusData

Sumedh is a co-founder and the VP of Engineering at Citus Data, where he leads the effort to make sure the Citus scale-out database is useful for application developers, so they can focus on their application and not their infrastructure. Before Citus, Sumedh worked as a software engineer at Amazon on the shopping cart app, working with distributed data stores like Dynamo and relational data stores such as Oracle. Sumedh has an M.S. in Computer Science from Stanford and a B.S. from University of Minnesota and he's passionate about Postgres and distributed databases.

Find Sumedh Pathak at

Last Year's Tracks

Monday, 5 March

Tuesday, 6 March

Wednesday, 7 March