Presentation: Real-Time Decisions Using ML on the Google Cloud Platform

Track: Distributed Stateful Systems

Location: Mountbatten, 6th flr.

Duration: 2:55pm - 3:45pm

Day of week: Wednesday

Level: Advanced

Share this on:

Abstract

Ocado Technology is providing a full solution to put the world’s retailers online using the cloud, robotics, AI and IoT. Processing tens of thousands of orders every day, we generate millions of events every minute, leading to huge amount of data to be managed. We will present how this Big Data is handled in Google Cloud Platform to build a end-to-end machine learning pipeline: how data is stored and processed in BigQuery, post-processed and copied with Dataflow, then used to train Deep Neural Network models with TensorFlow, how all this is orchestrated using our in-house scheduling software called Query Manager, and how predictions are finally run in real-time using Cloud ML Engine and Datastore.

Question: 

What is the focus of your work today?

Answer: 

Carlos: The team is currently building the new ML-powered fraud application, to be used by fraud agents at Ocado and also other retailers using our Ocado Smart Platform. The team is currently focused on building the pipelines that allow us to integrate the real-time production systems with the “big data” stored in Google Cloud and on adding more features to the ML models that will improve their accuracy.

Przemek: My team is building a machine learning platform on top of Google Cloud to help our data scientists be more productive. We would like them to focus on the things they do best - data exploration - instead of doing purely engineering work. We hope that by giving them a set of proper tools they will become self-sufficient to productionise whatever machine learning models they create.

At the same time, we aim to lower the entry barrier to machine learning for all non-data scientists. We firmly believe that this is the way to be successful in the AI space - to have all engineers incorporate simple ML models into their products instead of just having a handful of state-of-the-art solutions created for and by data scientists.

Question: 

What’s the motivation for this talk? 

Answer: 

Carlos: Most talks about ML are usually given by data scientists who have a research background. This is usually intimidating for Software Engineers, but also, we believe they overlook important aspects such as how to deploy those models to production, build automated pipelines, monitor their accuracy, etc.

Przemek: We’ve heard people say that all processes and tricks known for years in the software industry no longer apply in the machine learning domain and one needs to fundamentally change the way of thinking to be successful in AI. While there may be some truth to it, we still think that the plain, old software engineering methods established over the last decades can enable the success of machine learning projects.

Question: 

How you you describe the persona and level of the target audience?

Answer: 

Carlos: I expect to see mainly software engineers who have had some exposure or interest in machine learning. On the other hand, I also think this is relevant for data scientists who want to learn more about how others have “productionised” ML-based systems. 

Przemek: To add what Carlos has said - we will cover lots of services available in the Google Cloud Platform, so anyone interested in knowing more about GCP will surely benefit from our talk.

Question: 

What do you “that” persona to walk away from your talk knowing that they might not have known 50 minutes before? 

Answer: 

Carlos: We expect they will have a better understanding of components in Google Cloud Platform and what use cases they work best for. Also, we want to give the attendees some first-hand advice based on the lessons we’ve learnt building production ML systems.

Przemek:

  • Make full use of the services available in the cloud, so that your unicorn data scientists can focus on data exploration and building models rather than worrying about underlying infrastructure.
  • Machine learning solutions might serve different business purposes (such as recommendations and fraud detection), but they share a lot in terms of architecture
  • You can do everything using a single Google Cloud Platform stack: data exploration, feature engineering, modelling, training and serving. There’s no need to go anywhere else.
  • The Google Cloud Machine Learning Engine makes neural networks dead easy to use, so go ahead and try it.
Question: 

What trend in the next 12 months would you recommend an early adopter/early majority SWE to pay particular attention to?

Answer: 

Carlos: When it comes to Google services, I think this year we’ll hear more about Spanner, since it was made publicly available last year.

Amazon SageMaker, released at the end of 2017, is Amazon’s bet to become a first-class player in the ML ecosystem and will probably attract data scientists and software engineers, as AWS seems to become more mature in the ML space.

Przemek: Currently, a natural step for a companies that would like to start using machine learning is to try out high-level APIs available in the cloud, like Amazon Rekognition or Google Vision API. Those are great tools as long as you don’t need customizations. If you do, then the only possibility is to hire machine learning experts and create neural nets tailored to your problem. This is of course very expensive and not every company is prepared for such investment.

Recently there’s a new kind of services emerging, that are positioned somewhere in the middle between high-level APIs and low-level neural net frameworks. One example is Google Cloud AutoML, which will train a custom vision model on the data you’ve provided. I believe in the next 12 months we will see more of these specialized “model trainers” in different domains and it’ll be a huge game-changer for smaller companies which couldn’t afford to do ML before.

Speaker: Przemyslaw Pastuszka

ML Engineer @Ocado

Przemek is a software engineer with six years of experience in the Big Data space. He started his career working for a US-based startup called Hadapt (now acquired by Teradata), building a distributed in-house file system which was designed as a highly-performant replacement for HDFS. After joining Ocado Technology, he has helped the company move away from Oracle-centric analytics and transition all data-related operations into the cloud by building Google Cloud Platform tools. Recently, he's been working more closely with data scientists to help them get their work into production. He's also building a machine learning platform which will democratize ML within the company and allow all engineers to build their own models.

Find Przemyslaw Pastuszka at

Speaker: Carlos Garcia

Ocado Smart Platform Fraud Team Lead

Carlos has eight years of experience developing software, most of it in high-traffic applications in the travel industry. Carlos joined Ocado Technology one year ago, as team leader of the Fraud Detection team. He has participated in the definition and implementation of a production-ready architecture of the new Ocado fraud systems. This new system has transformed the way fraud is detected at Ocado, allowing fraud agents to interact with multiple Machine Learning systems in the background.

Find Carlos Garcia at

Last Year's Tracks

Monday, 5 March

Tuesday, 6 March

Wednesday, 7 March