Druid vs Rockset Revisited

Mar 05, 2021
Alan Li

Rockset recently published a blog post that compared the performance of Apache Druid 0.18 versus Rockset using the SSB benchmark. Druid 0.18 is about 9 months out of date at this point, so we wanted to revisit the benchmark based on the latest version of Druid (0.20.1), which includes several performance improvements we’ve been doing over the last few months.

Our findings are below:

QueryImply3.3.0Imply 2021.02Rockset
Q1.11245462944
Q1.2155154254
Q1.392160296
Q2.1662139161
Q2.2591102136
Q2.331979129
Q3.1292333626
Q3.2514102598
Q3.341791343
Q3.41007232
Q4.1883179384
Q4.2389109132
Q4.33848741
 604320694076

We can see a remarkable improvement in performance with the latest version of Druid. Two main things drove most of the change:

1) Many of these queries in the SSB benchmark use expressions. In older versions of Druid, we did not have vectorized expressions implemented, so many of the queries couldn’t vectorize. In the latest version, 100% of these queries vectorize.

2) The schema used to ingest data was changed to match what Rockset is doing. Similar to how Rockset “specified some keys for column-based clustering”, we also used column based clustering (in Druid it’s called partitioning). No further tuning was done in Druid.

In our next release of Druid, we are releasing another set of performance improvements. Stay tuned for more information.

Other blogs you might find interesting

No records found...
Nov 12, 2025

The Breaking Point for Observability Leaders

Observability is at a crossroads For years, observability has promised to give teams the visibility they need to keep digital services resilient. But as data volumes explode, many leaders are realizing the...

Learn More
Nov 04, 2025

The State of Log Management 2025

Logs are exploding. Costs are climbing. Performance is stalling. If you manage logs, you’re in the hot seat Every app, every integration, every security risk—it all generates more data. And when something...

Learn More
Oct 29, 2025

The next evolution in observability: How architecture is following in BI’s footsteps

Modern observability systems are hitting the same wall business intelligence did a decade ago. As data volumes explode, the traditional model — where a single product handles ingestion, storage, compute,...

Learn More

Ready to decouple your observability stack?
No workflow changes. No migrations. More data, less spend.

Request a Demo