Lessons Learned From Building LinkedIn’s AI Data Platform

Taking AI from lab to business is notoriously difficult. It is not just about picking which model flavor of the day to use. More important is making every step of the process reliable and productive. From training, experimentation, deployment, validation and everything in between, there are lots of moving pieces.

This talk will provide a high level overview of LinkedIn’s AI ecosystem, and then zoom in on the data platform underneath it: an open source database called Venice which we’ve been running in production for 7 years.

Building a data platform specifically tailored for AI requires some careful considerations. Among other things, it must support rapid experimentation, high throughput ingestion, and low latency queries for online inference applications.

You will come out of this session with an understanding of these various challenges, what we did to solve them, and how we pivoted along the way to keep up with changing workloads and requirements.


Speaker

Felix GV

Principal Staff Engineer @LinkedIn

Felix joined LinkedIn's data infrastructure team in 2014, first working on Voldemort, the predecessor of Venice. Over the years, Felix participated in all phases of the development lifecycle of Venice, from requirements gathering and architecture, to implementation, testing, roll out, integration, stabilization, scaling and maintenance.

Read more

Date

Tuesday Apr 9 / 05:05PM BST ( 50 minutes )

Location

Fleming (3rd Fl.)

Topics

AI/ML Data infrastructure architecture Venice

Share

From the same track

Session AI/ML

Mind Your Language Models: An Approach to Architecting Intelligent Systems

Tuesday Apr 9 / 11:45AM BST

As large language models (LLMs) emerge from the realm of proof-of-concept (POC) and into mainstream production, the demand for effective architectural strategies intensifies.

Speaker image - Nischal HP
Nischal HP

Vice President of Data Science @Scoutbee, Decade of Experience Building Enterprise AI

Session

Flawed ML Security: Mitigating Security Vulnerabilities in Data & Machine Learning Infrastructure with MLSecOps

Tuesday Apr 9 / 02:45PM BST

The operation and maintenance of large scale production machine learning systems has uncovered new challenges which require fundamentally different approaches to that of traditional software.

Speaker image - Adrian Gonzalez-Martin
Adrian Gonzalez-Martin

Senior MLOps Engineer @Bloomberg

Session

Large Language Models for Code: Exploring the Landscape, Opportunities, and Challenges

Tuesday Apr 9 / 03:55PM BST

In the rapidly evolving landscape of software development, Large Language Models (LLMs) for code have emerged as a groundbreaking tool for code completion, synthesis and analysis.

Speaker image - Loubna Ben Allal
Loubna Ben Allal

Machine Learning Engineer @Hugging Face

Session

When AIOps Meets MLOps: What Does It Take To Deploy ML Models at Scale

Tuesday Apr 9 / 10:35AM BST

In this talk, we introduce the concept of AIOps referring to using AI and data-driven tooling to provision, manage and scale distributed IT infra. We particularly focus on how AIOps can be leveraged to help train and deploy machine learning models and pipelines at scale.

Speaker image - Ghida Ibrahim
Ghida Ibrahim

Chief Architect, Head of Data @Sector Alarm Group, Ex-Facebook/Meta

Session

Connecting the Dots: Applying Generative AI

Tuesday Apr 9 / 01:35PM BST

Details coming soon.