Splunk: Druid on Kubernetes with Druid-operator

October 3, 2020

We went through the journey of deploying Apache Druid clusters on Kubernetes(K8s) and created a druid-operator (https://github.com/druid-io/druid-operator). This talk introduces the druid kubernetes operator, how to use it to deploy druid clusters and how it works under the hood. We will share how we use this operator to deploy Druid clusters at Splunk. Kubernetes is an open-source system for automating deployment, scaling, and management of containerized applications. Druid is a complex stateful distributed system and a Druid cluster consists of multiple web services such as Broker, Historical, Coordinator, Overlord, MiddleManager etc each deployed with multiple replicas. Deploying a single web service on K8s requires creating few K8s resources via YAML files and it multiplies due to multiple services inside of a Druid cluster. Now doing it for multiple Druid clusters (dev, staging, production environments) makes it even more tedious and error prone. K8s enables creation of application (such as Druid) specific extension, called “Operator”, that combines kubernetes and application specific knowledge into a reusable K8s extension that makes deploying complex applications simple.

Building Data Applications with Apache Druid

October 7, 2020

One of the most popular use cases for Apache Druid is building data applications.

GameAnalytics: Building a Real-Time Gaming Analytics Service w/ Druid

October 6, 2020

In this talk, you will learn how we managed to migrate our legacy backend system from using an in-house built streaming analytics service to Apache Druid.

Zeotap: Data Modeling in Druid for Non temporal and Nested Data

October 5, 2020

Druid has been the production workhorse for the past 2+ years at Zeotap powering the core Audience planning across our Connect and Targeting products.

Nielsen: Casting the Spell - Druid in Practice

October 4, 2020

At Nielsen Identity, we leverage Druid to provide our customers with real-time analytics tools for various use-cases.

Splunk: Druid on Kubernetes with Druid-operator

October 3, 2020

This talk introduces the druid kubernetes operator, how to use it to deploy druid clusters and how it works under the hood.

Archmage, Pinterest’s Real-time Analytics Platform on Druid

October 2, 2020

Pinterest shares why they moved from Hbase to Druid & the architectural design and current use cases running in Pinterest's real-time analytics platform.

How TrafficGuard uses Druid to Fight Ad Fraud and Bots

September 3, 2020

In this session, TrafficGuard’s Head of Data Science, Raigon Jolly, will discuss how TrafficGuard uses Imply/Druid

Maximizing Apache Druid performance: Beyond the basics

September 2, 2020

In this talk, Gian Merlino will walk you through some advanced techniques that can provide a multiplier to your Druid performance.

Enterprise Scale Analytics Platform Powered by Druid at Target

September 1, 2020

In this talk we’ll cover why Target chose to create our own analytics platform and specifically how Druid makes this platform successful.

Self Service Analytics at Twitch

August 31, 2020

In this talk, learn how Twitch implemented a common analytics platform for the needs of many different team.

How Netflix Uses Apache Druid to Ensure a High-Quality Experience

August 30, 2020

Ensuring a consistently great Netflix experience while continuously pushing innovative technology updates is no easy feat.

Apache Druid Vision and Roadmap

April 17, 2020

Gian Merlino, Apache Druid PMC Chair, offers his reflections on the Druid journey to date, plus describes his vision for what Druid will become.

Automating CI/CD for Druid Clusters at Athena Health

April 16, 2020

Athena Health is creating a new performance management application for its clients. One of its key components is Apache Druid.

Apache Druid for Anti-Money Laundering (AML) at DBS Bank

April 15, 2020

The DBS compliance team uses Druid to help with anti-money laundering (AML) , allowing them to explore anomalies and apply machine learning.

How Apache Druid Powers Real-Time Analytics at BT

April 14, 2020

BT discusses their Apache Druid journey, which started in early 2019 when they asked Imply to help us with an in-house Network Performance Management project

Using Druid for Network Monitoring and Trust Analytics at Cisco

April 10, 2020

Cisco covers experiences and insights on how they deploy, monitor, and integrate Apache Druid with their applications.

Apache Druid Fireside Chat (Ask Us Anything)

April 10, 2020

The world’s most adept Apache Druid experts take any and all questions.

Analytics over Terabytes of Data at Twitter using Apache Druid

April 10, 2020

Twitter discusses the architecture of their analytics platform, Apache Druid cluster setup, hardware choices, monitoring and use cases.