Dec 06, 2023

Enhancing Druid’s Analytics with Apache Arrow and Flight SQL

Currently Druid has two result formats: a JSON format and a protobuf-based format using Apache Calcite Avatica. Both of these formats are row-oriented while all of Druid’s internal data representations are column-oriented, as expected for high-performance analytics. This means that performance is being left on the table due to these transpositions.

This talk will describe a host of possible benefits for Apache Druid that would come by supporting Apache Arrow as an output format. Apache Arrow is an in-memory, columnar data format. It is used as the backing memory representation for utilities like pandas and polars, and allows for zero-copy data communication across many libraries. This session will cover various ways that we could improve the performance, interoperability and flexibility of Apache Druid, along with making it easier to integrate Druid with new data sources and analytics pipelines by leveraging Arrow, FlightSQL and ADBC (Arrow Database Connectivity).

Given the competitive landscape of data computation engines right now (Snowflake, BigQuery, Druid, Dremio, DuckDB), embracing support for Arrow will help Druid “keep up with the competition” and open doors for enhanced connectivity and performance!

See similar videos

No records found...
Apr 22, 2026

Lunch & Learn: Imply Lumi Observability Warehouse Demo

In this 30-minute session, you'll see a demo of Imply Lumi — the observability data layer built to help you store more, search faster, and reduce cost without changing your existing tools.

Watch now
Mar 05, 2026

Imply Lumi: What’s New, What’s Next — and How to Unlock More Observability Value Today

Observability teams must retain more data, investigate faster, and control costs without disrupting existing tools. This live Imply Lumi update shows new ingestion, retention, search, Splunk interoperability,...

Watch now
Jan 07, 2026

Strategies for Managing Your Splunk Spend at Scale in 2026

Learn how a decoupled architecture for Splunk—powered by Imply Lumi and Federated Search—helps you keep more data searchable, reduce costs, and scale efficiently without changing existing Splunk workflows.

Watch now

Ready to decouple your observability stack?
No workflow changes. No migrations. More data, less spend.

Request a Demo