Oct 22, 2024

Seamless Ingestion of Delta Lake Tables into Apache Druid for Faster Analytics

Delta Lake is an open-source storage layer that brings reliability and performance to data lakes. In this session, we will explore the fundamentals of Delta Lake and demonstrate how to ingest Delta Lake tables into Apache Druid for sub-second query latency. Additionally, we’ll introduce a new upcoming feature in Apache Druid: scheduled batch supervisors, which can facilitate continuous data ingestion natively in Druid, ensuring that your Druid tables stay up-to-date with the latest data from Delta Lake or any input source.

Speakers:
– Abhishek Balaji Radhakrishnan, Staff Software Engineer, Imply
– Venki Korukanti, Software Engineer, Databricks

[Timestamp] Table of Contents:
Introduction
[0:00] Agenda
[1:17] Motivation
[3:21] Overview of Delta Lake
Integrating Delta Lake
[4:10] Delta Lake: How does it work?
[9:36] Advanced Delta Feature: Deletion Vector
[11:52] Delta Kernel
[14:35] Integration Architecture
[15:46] DML Query Example
[17:29] Scheduled Batch Ingestion
[20:05] Demo
[24:34] Roadmap

See similar videos

No records found...
Jan 07, 2026

Strategies for Managing Your Splunk Spend at Scale in 2026

Learn how a decoupled architecture for Splunk—powered by Imply Lumi and Federated Search—helps you keep more data searchable, reduce costs, and scale efficiently without changing existing Splunk workflows.

Watch now
Nov 19, 2025

Observability at a Breaking Point: How Decoupling Unlocks Speed, Scale, & Savings

Learn how decoupled observability helps you do more with your Splunk data, reduce costs, and scale efficiently with Federated Search.

Watch now
Oct 22, 2024

Keynote: Powering Event-Driven Data with Apache Druid

The distinction between OLTP and OLAP is becoming less relevant as data architectures shift toward entities and events. In this session, we’ll delve into how Apache Druid’s event-first approach synthesizes...

Watch now

Ready to decouple your observability stack?
No workflow changes. No migrations. More data, less spend.

Request a Demo