Simplifying Real-Time ML Pipelines with Quix Streams: An Open Source Python Library for ML Engineers

As data volume and velocity continue to increase, the need for real-time machine learning (ML) is becoming more pressing. However, building real-time ML pipelines can be complex and time-consuming, requiring expertise in both ML and streaming application development.

This talk will address this problem by introducing Quix Streams, an open-source Python library that makes it easy for data scientists and ML engineers to build real-time ML pipelines without having to learn the intricacies of building a streaming application from scratch.

In this talk, we’ll cover:

  • The growing importance of real-time ML in today's application stack, and the use cases for real-time ML processing.
  • A comparison of different ML architectures (batch, request-response, stream, and hybrid) and their pros and cons
  • The current state of streaming architecture, which is typically Java-based, and the challenges this poses for data scientists and ML engineers who primarily work in Python
  • An overview of Quix Streams and its features, including a demo of how to use it to build real-time ML pipelines

This talk is relevant for data scientists, ML engineers, and software engineers who are looking to adopt new technologies and practices in order to build real-time ML pipelines and stay current in their field.

What's the focus of your work these days?

I work as a technical authority for the engineering team here at Quix and I’m responsible for the direction of the company across the full technical stack. On top of that, I’m often on the road speaking at conferences and meetups about enabling data teams to do stream processing.

What's the motivation for your talk at QCon London 2023?

It’s traditionally been very difficult for data teams to do stream processing. It’s something I saw first-hand working at McLaren F1. Enabling Data Scientists to work directly with the product using ML/AI has the potential to revolutionalize next-gen products. The current streaming stack is not built to help with this transition to empower data teams and that’s the driving motivation behind open-sourcing Quix Streams and this talk.

How would you describe your main persona and target audience for this session?

Senior data scientists, software engineers and ML engineers.

Is there anything specific that you'd like people to walk away with after watching your session?

I'd like them to walk away with an idea of how to build next-gen real-time products powered by ML.


Speaker

Tomáš Neubauer

CTO & Co-Founder @Quix

Tomáš Neubauer is a co-founder and the CTO at Quix, works as a technical authority for the engineering team and is responsible for the direction of the company across the full technical stack. He was previously technical lead at McLaren, where he led architecture uplift for Formula 1 racing real-time telemetry acquisition. He later led platform development outside motorsport, reusing the know-how he gained from racing.

Read more
Find Tomáš Neubauer at:

Date

Wednesday Mar 29 / 04:10PM BST ( 50 minutes )

Location

Mountbatten (6th Fl.)

Topics

python library Machine Learning real time open source

Share

From the same track

Session Machine Learning

Strategy & Principles to Scale and Evolve MLOps @DoorDash

Wednesday Mar 29 / 10:35AM BST

MLOps has become a major enabler to successfully operationalize ML applications and for ML practitioners to realize the power of ML to bring impact to business.  The journey to implementing MLOps will be unique to each company.

Speaker image - Hien Luu
Hien Luu

Sr. Engineering Manager @DoorDash & Author of Beginning Apache Spark 3, Speaker and Conference Committee Chair

Session AI

Responsible AI: From Principle to Practice!

Wednesday Mar 29 / 01:40PM BST

Enabling responsible development of artificial intelligent technologies is one of the major challenges we face as the field moves from research to practice. Researchers and practitioners from different disciplines have highlighted the ethical and legal challenges posed by the use of machine learn

Speaker image - Mehrnoosh Sameki
Mehrnoosh Sameki

Principal PM Manager @Microsoft

Session Digital Twins

Cognitive Digital Twins: A New Era of Intelligent Automation

Wednesday Mar 29 / 11:50AM BST

Traditionally, Digital Twins have been helping businesses make data-driven decisions, increase efficiency, and improve the overall performance of their physical assets.

Speaker image - Yannis Georgas
Yannis Georgas

Intelligent Industry Lead @Capgemini

Session Graphs

Graph Learning at the Scale of Modern Data Warehouses

Wednesday Mar 29 / 02:55PM BST

Data warehouses have become a staple for enterprises, providing a wealth of information that can be harnessed to improve decision-making through the use of machine learning (ML).

Speaker image - Subramanya Dulloor
Subramanya Dulloor

Founding Engineer @Kumo.ai