What if the host goes down? What if the DC goes down? What if the broker is not responding? What if the client dies before an ack? What if I bounce the cluster? What if we run out of disk? In this talk we explore these and many other questions that we answered in our journey to getting comfortable running critical parts of our infrastructure on Kafka and offer our solutions along the way.