Dec 06, 2023

Streaming Ingestion – A Look Under The Hood

I’ve always thought that an intricate understanding of how something works helps you use it more effectively. In this session, I’ll try to convey what I’ve learned from some of Apache Druid’s creators about the inner workings of streaming ingestion in Apache Druid . We’ll start at the high level with how parallelism throughout the data pipeline drives scalability for real-time analytics. Then we’ll open the hood and see what’s inside each parallel ingestion task. We’ll talk threads, buffers, inner architecture and some of the main configuration parameters that control both its ability to ingest streaming data and process queries on arrival.

See similar videos

No records found...
Nov 18, 2024

Druid Summit 2024 – Panel: Real-Time Data Experiences

Analytics applications powered by Apache Druid are fun places to be. Take a seat as user experience leaders from Google, the Data Visualisation Society, and Imply bring their minds together, and answer your...

Watch now
Nov 18, 2024

Druid Summit 2024 – Panel: Lakehouse Analytics

Apache Iceberg and Delta Lake are coming into the big top, and Apache Druid is ready! Get ready to hear from Apache Druid PMC members and industry experts on just where Apache Druid fits, how it can be used,...

Watch now
Nov 18, 2024

Druid Summit 2024 – Panel: Operations and Optimization

Come along and meet people who have implemented and tuned Druid in situations small and large – very very large! Panelists will share some of their key tips and tricks, and be open to your questions, whether...

Watch now

Let us help with your analytics apps

Request a Demo