Speaker: Onur Satici

He / him / his

Staff Engineer @SpiralDB & a Core Maintainer of Vortex (LF AI & Data), Previously Building Distributed Systems @Palantir

Onur is a Staff Engineer at SpiralDB and a core maintainer of Vortex, an open source columnar file format now part of the Linux Foundation (LF AI & Data). He focuses on high-performance data systems, GPU acceleration, and making analytical workloads faster at every layer of the stack.

Find Onur Satici at:

Session

From S3 to GPU in One Copy: Rethinking Data Loading for ML Training

ML training pipelines treat data as static. Teams spend weeks preprocessing datasets into WebDataset or TFRecords, and when they want to experiment with curriculum learning or data mixing, they reprocess everything from scratch.

Read more

Date

Tuesday Mar 17 / 11:45AM GMT ( 50 minutes )

Location

Windsor (5th Fl.)

Topics

Machine Learning Infrastructure GPU File Formats Performance Engineering

Share