Dec 06, 2023

Druid Operator: Bridging Kubernetes and Apache Druid

Apache Druid is a real-time distributed data store designed for low-latency queries. It can ingest data in real-time and make it available for querying as soon as an event occurs. The standard service level agreement (SLA) for running Druid requires continuous data ingestion, uninterrupted data availability, and constant data queryability.

However, running Druid on Kubernetes poses significant challenges. Druid comprises multiple components, each with a specific role. Implementing scalable logic for these components, choosing between StatefulSets or Deployments, and managing external dependencies such as ZooKeeper and object storage make it complex. Even a brief downtime can result in significant latency issues, affecting data ingestion to query responsiveness.

The Druid Operator addresses these challenges by encapsulating Druid-specific logic for Kubernetes deployment. It serves as a bridge, providing insights into the current state of the Druid cluster and how to interpret it within the Kubernetes environment.

This talk aims to provide Druid engineers and data operations professionals with valuable insights into leveraging the Druid Kubernetes Operator. It enables efficient management of a distributed Druid cluster’s state, facilitates complex scaling operations, supports ordered rolling upgrades, and automates maintenance actions for seamless operations.

See similar videos

No records found...
Jan 29, 2024

Physical Hardware, Digital Analytics: IoT Challenges, Best Practices, and Solutions

Electric vehicle maker Rivian and German startup Thing-it were kind enough to talk us through how real-time data and analytics play a key part in the evolving landscape of IoT (Internet of Things). The wealth...

Watch now
Dec 11, 2023

Analyzing streaming data with Apache Druid

Streaming data is not only data in motion—it’s a potential source of valuable insights, ready to be harvested and utilized. The challenge is to analyze streaming data at scale and extract these insights—before...

Watch now
Dec 06, 2023

Real-Time Analytics in the Real World

Engineering teams increasingly have to deliver insights in real-time. But as they aim to reduce latency from event-to-insight, they also face the challenge of dealing with larger and more complex data and concurrent...

Watch now

Let us help with your analytics apps

Request a Demo