As streaming platforms become central to data strategies, companies both small and large are re-thinking their architecture with real-time context at the forefront. Monoliths are evolving into Microservices. Datacenters are moving to the cloud. What was once a ‘batch’ mindset is quickly being replaced with stream processing as the demands of the business impose more and more real-time requirements on developers and architects.
This revolution is transforming industries.
What started at companies like LinkedIn, Uber, Netflix and Yelp has made its way to countless others in a variety of sectors. Today, thousands of companies across the globe build their businesses on top of Apache Kafka®. The developers responsible for this revolution need a place to share their experiences on this journey.
Kafka Summit is the premier event for data architects, engineers, devops professionals, and developers who want to learn about streaming data. It brings the Apache Kafka community together to share best practices, write code, and discuss the future of streaming technologies.
Welcome to Kafka Summit San Francisco 2019!
Established in 1999, the ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors. Our all-volunteer board oversees more than 350 leading Open Source projects, including Apache HTTP Server — the world’s most popular Web server software.
The ASF provides an established framework for intellectual property and financial contributions that simultaneously limits potential legal exposure for our project committers. Through the ASF’s meritocratic process known as “The Apache Way,” more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation’s official user conference, trainings, and expo.
Confluent, founded by the original creators of Apache Kafka®, pioneered the enterprise-ready event streaming platform. With Confluent, organizations benefit from the first event streaming platform built for the enterprise with the ease-of-use, scalability, security and flexibility required by the most discerning global companies to run their business in real time. Companies leading their respective industries have realized success with this new platform paradigm to transform their architectures to streaming from batch processing, spanning on-premises and multi-cloud environments. Backed by Benchmark, Index Ventures and Sequoia, Confluent is headquartered in Palo Alto and London with offices globally. To learn more, please visit www.confluent.io. Download Confluent Platform at www.confluent.io/download.
Aiven provides Kafka as a Service along with 7 others across 6 different clouds and their regions, making Aiven the largest provider of managed open source data systems in terms of number of clouds, services, and plan options.
Google Cloud is widely recognized as a global leader in delivering a secure, open, intelligent and transformative enterprise cloud platform. Customers across more than 150 countries trust Google Cloud’s simply engineered set of tools and unparalleled technology to modernize their computing environment for today’s digital world.
Founded in 1975, Microsoft (Nasdaq “MSFT”) is a Cloud first, Mobile first company delivering technologies that help businesses worldwide take advantage of mobile, enterprise social, and cloud computing trends to drive growth. Microsoft helps you to move to the cloud on your terms; getting the most value from your existing IT investments while giving you the flexibility to respond quickly to changing business needs. www.Microsoft.com
Slower is a customer-driven consulting firm that assists our customers to achieve successful outcomes in their businesses. Our team consists of a highly talented-unique group of thinkers, makers, and doers. Our offerings in Cloud, Data and A.I. are at the forefront of the people, process and technology opportunities our customers are facing. We believe transforming our customer’s businesses is as important as transforming the businesses of services delivery. Our team of people carry a continuous humility and drive to learn, adapt and evolve. All customer possibilities are achievable with Slower thinking.
Attunity, a leading provider of data integration and data management software solutions, enables moving, preparing and analyzing data efficiently to increase business productivity and enable better insights for competitive advantage. The company’s high-performance, easy-to-use software solutions include data replication, data warehouse automation, data usage analytics, test data management, data connectivity, and cloud data delivery. Learn more: www.attunity.com.
Our products and services are designed to spark enthusiasm, improve quality of life, and help conserve natural resources. We want to deliver top quality and reliability. In short: we want to create technology that is “Invented for life.”
Camunda builds software for workflow and decision automation. The company develops the popular open source Camunda platform that supports the BPMN and DMN standards. Many organizations world-wide use Camunda for mission-critical business process automation, including Allianz, AT&T, NASA, T-Mobile and Universal Music. Headquartered in Berlin, Camunda has local presences in San Francisco and Denver and official partnerships with more than 100 IT system integrators in more than 30 countries.
CrowdStrike is the leader in cloud-delivered endpoint protection. Leveraging artificial intelligence (AI), the CrowdStrike Falcon platform offers instant visibility and protection across the enterprise and prevents attacks on endpoints on or off the network. CrowdStrike Falcon deploys in minutes to deliver actionable intelligence and real-time protection from Day One. It seamlessly unifies next-generation AV with best-in-class endpoint detection and response, backed by 24/7 managed hunting. Its cloud infrastructure and single-agent architecture take away complexity and add scalability, manageability, and speed. CrowdStrike Falcon protects customers against all cyber attack types, using sophisticated signatureless AI and Indicator-of-Attack (IOA) based threat prevention to stop known and unknown threats in real time. Powered by the CrowdStrike Threat Graph™, Falcon instantly correlates over 150 billion security events a day from across the globe to immediately prevent and detect threats. There’s much more to the story of how Falcon has redefined endpoint protection but there’s only one thing to remember about CrowdStrike: We stop breaches
Datadog is a monitoring and analytics platform for cloud-scale infrastructure and applications. Datadog provides full-stack observability by combining logs, infrastructure metrics and events, application performance metrics and end-to-end tracing. With flexible graphs and dashboards, sophisticated alerting, and machine learning functionality for anomaly and outlier detection, the platform provides actionable insight into dynamic, modern environments. Datadog features 250+ vendor-supported integrations, with simple configuration and built-in template dashboards.
HVR is the leading independent provider of real-time data replication technology powered by log-based Change Data Capture (CDC). Log-based CDC enables customers to stream information from the many places it is stored within their organization, such as Oracle, SQL Server, SAP and more. This gives them the ability to fully optimize their use Apache Kafka technology for a better business. Learn More: hvr-software.com
IBM Event Streams is an event-streaming platform based on the open-source Apache Kafka® project. Event Streams helps you build intelligent, responsive applications that react to events in real-time, to deliver more engaging experiences for your customers.
– Builds on the popular open source Apache Kafka streaming technology
– Helps you deploy and use Apache Kafka in an intuitive and easy manner
– Is ready for mission-critical workloads with geo-replication to assist with disaster recovery initiatives
– Can connect to your existing IBM MQ backbone, making real-time business critical data available to the next generation of event-driven applications
– Is backed by IBM Worldwide support, enabling you to build intelligent apps on Kafka with the confidence IBM is there for you should something go wrong
Imply provides an enterprise-ready, real-time analytics solution around Apache Druid. Apache Druid is a high performance, open source analytics database built for event-driven datasets. Many major enterprises have deployed Druid and Kafka together to deliver, analyze, and store events immediately after they occur. Druid works out of the box with Kafka, provides exactly-once consumption from Kafka, and is paired Kafka to build modern end-to-end streaming analytics stacks.
Digital transformation is changing our world. As the leader in Enterprise Cloud Data Management, we’re prepared to help you intelligently lead the way and provide you with the foresight to become more agile, realize new growth opportunities or even create new inventions. We invite you to explore all that Informatica has to offer—and unleash the power of data to drive your next intelligent disruption. Not just once, but again and again.
Lenses.io is a DataOps platform for streaming technologies like Apache Kafka. Lenses® enables a seamless experience for running your Data Platform on-prem, cloud or hybrid and put dataOps in the heart of your business operations. Provides self-service data-in-motion control, build and monitor your data flows whilst security, data governance and data ethics are treated as first-class citizens. As a streaming platform overlay technology, Lenses® integrates with Kubernetes and can run with any distribution of Apache Kafka including AWS MKS and Azure HDInsight. Wanna give it a try? Find more at https://lenses.io
Lyft was founded in 2012 by Logan Green and John Zimmer to improve people’s lives with the world’s best transportation, and is available to approximately 95 percent of the United States population as well as select cities in Canada. Lyft is committed to effecting positive change for our cities by offsetting carbon emissions from all rides, and by promoting transportation equity through shared rides, bikeshare systems, electric scooters, and public transit partnerships.
MemSQL is The No-Limits DatabaseTM, powering modern applications and analytical systems with a cloud-native, massively scalable architecture for maximum ingest and query performance at the highest concurrency. MemSQL envisions a world where every business can make decisions in real time and every experience is optimized through data. Global enterprises use the MemSQL distributed database to easily ingest, process, analyze, and act on data in order to thrive in today’s insight-driven economy. MemSQL is optimized to run on any public cloud or on-premises with commodity hardware. Visit www.memsql.com or follow us @memsql.
Redis Labs, home of Redis, the world’s most popular in-memory database, and provider of Redis Enterprise, delivers superior performance, reliability and flexibility for personalization, machine learning, IoT, search, ecommerce, social and metering solutions worldwide. Modern businesses depend on Redis Labs to deliver instant experiences, reliably and at scale.
Rockset is a serverless search and analytics engine that delivers millisecond-latency SQL over TBs of raw data, without any ETL. Rockset integrates with Kafka to continuously ingest event streams without requiring a schema, while providing full SQL support for filtering, aggregations and joining streaming data with other data sets. Rockset powers data-driven applications and interactive dashboards without requiring users to manage custom pipelines, servers or databases. Try Rockset, and go from useful data to useful applications in minutes, at rockset.com
Scylla is the real-time big data database, with scale-up performance of 1,000,000 OPS per node, scale-out to hundreds of nodes and 99P latency of <1 msec. Fully compatible with Apache Cassandra, Scylla embraces a shared-nothing approach that increases throughput and storage capacity to 10X that of Cassandra. From the team responsible for the KVM hypervisor, Scylla helps organizations realize order-of-magnitude performance improvements, reduce hardware costs and lessen administration. For more information: ScyllaDB.com
Solace provides the only unified advanced event broker technology that supports publish/subscribe, queueing, request/reply, message replay and streaming using open APIs and protocols across hybrid cloud and IoT environments. Established enterprises such as SAP, Barclays and the Royal Bank of Canada as well as high-growth companies such as VoiceBase and Jio use Solace to modernize legacy applications and successfully pursue analytics, hybrid cloud and IoT strategies.
StreamSets transforms how enterprises flow big and fast data from myriad sources into data centers and cloud analytics platforms. Its DataOps platform helps companies build and operate continuous dataflow topologies, combining award-winning open source data movement software with a cloud-native Control Hub. Enterprises use StreamSets to enable cloud analytics, data lakes, Apache Kafka, IoT and cybersecurity. For more information, visit www.streamsets.com.
Tinder is the world’s leading app for meeting new people. Available in 190 countries and 40+ languages, Tinder is a top 5 grossing non-gaming app globally. Kafka at Tinder plays the following critical roles:
1. A central messaging system to power Tinder’s data pipeline that collects, aggregates and transforms billions of events each day for our BI and ML;
2. A robust event processing pipeline that powers critical applications such as payment processing, push notifications, user behavioral classification and abuse detections and much more;
3. A streaming platform to provide consumable, real-time streaming events for change data capture. It enables our backend systems to move toward event-based processing and decouples inter-service dependencies;
4. A highly scaled messaging bus for collecting and transporting logs and observability metrics.