Workshop: Apache Kakfa and KSQL in Action: Let's Build a Streaming Data Pipeline

Location: Henry Moore, 4th flr.

Duration: 1:00pm - 4:00pm

Day of week: Thursday

Level: Beginner

Key Takeaways

  • Brief introduction to Apache Kafka, and streaming concepts

  • Integrating systems with Kafka using Kafka Connect

  • Exploring data with KSQL

  • Using KSQL to filter, join, and aggregate data in Kafka

Prerequisites

A knowledge of SQL is assumed. Participants will benefit from an existing knowledge of the concepts of Kafka (topics, streaming, etc). Participants must have their own laptop and complete the prerequisite steps detailed here: https://github.com/confluentinc/examples/blob/5.0.0-post/ksql-workshop/pre-requisites.adoc.
For reasons of compatibility we strongly recommend the use of Mac or Linux for the workshop.

Have you ever thought that you needed to be a programmer to do stream processing and build streaming data pipelines? Think again! Apache Kafka is a distributed, scalable, and fault-tolerant streaming platform, providing low-latency pub-sub messaging coupled with native storage and stream processing capabilities. Integrating Kafka with RDBMS, NoSQL, and object stores is simple with Kafka Connect, which is part of Apache Kafka. KSQL is the open-source SQL streaming engine for Apache Kafka, and makes it possible to build stream processing applications at scale, written using a familiar SQL interface. 

In this workshop you will learn the architectural reasoning for Apache Kafka and the benefits of real-time integration, and then build a streaming data pipeline using nothing but your bare hands, Kafka Connect, and KSQL. 

Gasp as we filter events in real time! Be amazed at how we can enrich streams of data with data from RDBMS! Be astonished at the power of streaming aggregates for anomaly detection!

Speaker: Robin Moffatt

Developer Advocate @confluentinc

Robin is a Developer Advocate at Confluent, the company founded by the creators of Apache Kafka, as well as an Oracle Groundbreaker Ambassador and ACE Director (Alumnus). His career has always involved data, from the old worlds of COBOL and DB2, through the worlds of Oracle and Hadoop, and into the current world with Kafka. His particular interests are analytics, systems architecture, performance testing and optimization. He blogs at http://cnfl.io/rmoff and http://rmoff.net/ (and previously http://ritt.md/rmoff) and can be found tweeting grumpy geek thoughts as @rmoff. Outside of work he enjoys drinking good beer and eating fried breakfasts, although generally not at the same time.

Find Robin Moffatt at

Tracks