Kafka Summit Logo

Kafka Summit New York

Streaming platforms at massive scale.

May 8, 2017 | New York

Data Processing at LinkedIn with Apache Kafka

Video & Slides

Kafka is a cornerstone of LinkedIn’s data infrastructure. It is the replication stream for Espresso; the message transport for Brooklin (our change capture system), Samza and Venice (our derived data serving store). We describe Kafka’s fundamental roles: data storage, movement, processing and analysis; and discuss the requirements to serve these data systems, issues that we hit and how we addressed them.