Presentation: Osiris: When Big Data Is Too Big for HBase

Track: Solutions Track I

Location: Westminster, 4th flr.

Duration: 11:50am - 12:40pm

Day of week: Monday

Share this on:

Abstract

As the #1 job site in the world, Indeed delivers hundreds of millions of searches per day to job seekers. To give our users the best experience possible, we analyze petabytes of data per day for machine learning, A/B testing, and reporting.

Learn how the Search Quality team at Indeed developed Osiris, a horizontally scalable key-value store built on Hadoop. Osiris is flexible enough to be used in everything from big data analysis to latency-sensitive, user-facing applications.

This talk will cover the requirements and scaling challenges we faced that led to the development of Osiris. We'll also discuss the details of how we built Osiris, including its unique key design and highly configurable storage engine -- both of which allow for use in a wide variety of applications. We'll end this talk with specific examples of how Osiris is used at Indeed.

Speaker: Josh Slocum

Software Engineer @Indeed

Josh Slocum is a Software Engineer at Indeed on the Search Quality team. As the principal developer on Osiris, he oversees the development of Indeed's highly flexible and scalable key-value store. 
He received a Bachelor of Science in Computer Science degree from The University of Texas at Austin, where he focused his time studying distributed systems and computer graphics. His computer science passions include data structures and algorithms, computer graphics, video game engines, and distributed systems.
When he's not programming, Josh enjoys basketball, video games, music, drinking craft beer, and spending time with his wife and his dog in Austin,Texas.

Find Josh Slocum at

Tracks

Monday, 5 March

Tuesday, 6 March

Wednesday, 7 March