Dec 06, 2023

Druid Operator: Bridging Kubernetes and Apache Druid

Apache Druid is a real-time distributed data store designed for low-latency queries. It can ingest data in real-time and make it available for querying as soon as an event occurs. The standard service level agreement (SLA) for running Druid requires continuous data ingestion, uninterrupted data availability, and constant data queryability.

However, running Druid on Kubernetes poses significant challenges. Druid comprises multiple components, each with a specific role. Implementing scalable logic for these components, choosing between StatefulSets or Deployments, and managing external dependencies such as ZooKeeper and object storage make it complex. Even a brief downtime can result in significant latency issues, affecting data ingestion to query responsiveness.

The Druid Operator addresses these challenges by encapsulating Druid-specific logic for Kubernetes deployment. It serves as a bridge, providing insights into the current state of the Druid cluster and how to interpret it within the Kubernetes environment.

This talk aims to provide Druid engineers and data operations professionals with valuable insights into leveraging the Druid Kubernetes Operator. It enables efficient management of a distributed Druid cluster’s state, facilitates complex scaling operations, supports ordered rolling upgrades, and automates maintenance actions for seamless operations.

See similar videos

No records found...
Aug 16, 2024

Imply Polaris + Natural Intelligence: Powering Real-time Analytics & Observability

Imply Polaris provides the fastest path to real-time analytics by offering seamless data ingestion, auto-scaling capabilities, built-in visualization, and a secure, fully managed infrastructure. Watch this...

Watch now
Jan 29, 2024

Physical Hardware, Digital Analytics: IoT Challenges, Best Practices, and Solutions

From supporting ad-hoc queries across massive time series datasets to ensuring data freshness and scalability, our panelists will share insights into selecting the right database solutions that can deliver...

Watch now
Dec 11, 2023

Analyzing streaming data with Apache Druid

Streaming data is not only data in motion—it’s a potential source of valuable insights, ready to be harvested and utilized. The challenge is to analyze streaming data at scale and extract these insights—before...

Watch now

Let us help with your analytics apps

Request a Demo