Kafka Summit Logo
Organized by

Kafka Summit London 2018

April 23-24, 2018 | London

Building a Queryable Kafka Streams Analytics Engine and Integrating it with Druid

Session Level:
Video & Slides

In this session we present our journey of building a queryable real-time streaming analytics engine using the Kafka Streams API. Keeping reliability and fault tolerance in mind, we created a system, across data centers, to process millions of events per second and to provide real-time insights into user behavior on the internet. Using powerful tools like KSQL we were able to generate intuitive insights into session based metrics like active sessions, current active users, open orders, etc. Further, we funneled all our data into Druid to perform low latency OLAP queries that powers our free-form reporting engine. We also highlight the lessons we learnt along the way and describe certain metrics that were important in insuring the integrity of our Kafka cluster.


We use cookies to understand how you use our site and to improve your experience. Click here to learn more or change your cookie settings. By continuing to browse, you agree to our use of cookies.