Operating Kafka and its ecosystem at LinkedIn’s scale presents unique challenges and even seemingly small bugs, monitoring gaps or operational errors can have considerable site-wide impact. We will summarize takeaways from the most significant issues of 2015 and some of the features in Kafka that we are working on that will help eliminate or mitigate such incidents in the future.