Dec 06, 2023

Moving ingestion from 3 hours to 5 minutes – Challenges and Mitigations

This is a real world account from a Druid cluster in production. A story of 48 hours of debugging, learning and understanding Druid better, filing a couple of issues in Druid github and finally a stable production pipeline again thanks to the Druid community. We will discuss the bottlenecks we had in overlord, slot issues for Peons in middle managers, coordinator bottlenecks, how to mitigated task and segment flooding, what configs we changed sprinkled with real world numbers and snapshots from our graphana dashboards. Finally we will list all the latest awesomeness in Druid 25.0 that helped us in this endeavour, how we discovered those midnight and our learnings.

See similar videos

No records found...
Jan 07, 2026

Strategies for Managing Your Splunk Spend at Scale in 2026

Learn how a decoupled architecture for Splunk—powered by Imply Lumi and Federated Search—helps you keep more data searchable, reduce costs, and scale efficiently without changing existing Splunk workflows.

Watch now
Nov 19, 2025

Observability at a Breaking Point: How Decoupling Unlocks Speed, Scale, & Savings

Learn how decoupled observability helps you do more with your Splunk data, reduce costs, and scale efficiently with Federated Search.

Watch now
Oct 22, 2024

Keynote: Powering Event-Driven Data with Apache Druid

The distinction between OLTP and OLAP is becoming less relevant as data architectures shift toward entities and events. In this session, we’ll delve into how Apache Druid’s event-first approach synthesizes...

Watch now

Ready to decouple your observability stack?
No workflow changes. No migrations. More data, less spend.

Request a Demo