Feb 23, 2017

Building an Open Source Streaming Analytics Stack with Kafka and Druid

Analytics pipelines running purely on batch processing systems can suffer from hours of data lag. Initial attempts to solve this problem often lead to inflexible solutions, where the queries must be known ahead of time, or fragile solutions where the integrity of the data cannot be assured. Combining Kafka, and Druid can guarantee system availability, maintain data integrity, and support fast and flexible queries.