RixEngine

RixEngine Delivers Real-Time Observability for Global Ad Exchanges

“The transition to Imply Enterprise Hybrid has significantly enhanced our operational efficiency and flexibility for data roll-ups, drill-downs, and multi-dimensional data analysis. Imply has become an indispensable tool for our business and marketing teams, providing data-driven insights to optimize ad strategies and deliver real-time, actionable insights.”
Zeyu Huang, Data Engineer  |    RixEngine

Summary

RixEngine reduced costs by over 50% and halved query latency with real-time observability, giving partners instant, actionable insights.

Highlights

  • Decreased infrastructure costs by 50%+ due to efficient data compression, tiered storage, streamlined cluster management, and a 70% reduction in bandwidth usage
  • Improved query speed from >10 seconds to <5 seconds
  • Saved 8 hours of engineering resources per week
  • Accelerated data analysis across 100+ dimensions with real-time dashboards
  • Lowered operational overhead with simplified ingestion, management, and support

Background

RixEngine, an AI-driven advertising exchange platform incubated by Baidu Global, operates independently as a full-stack SaaS solution for programmatic advertising. This advanced platform requires robust analytics capabilities to facilitate real-time bidding and generate actionable insights for performance optimization.

Prior to adopting Imply Enterprise Hybrid for real-time analytics, RixEngine relied on Apache Druid® and Amazon Redshift as their core database technologies. For visualization, they used Metabase and an internal operations platform.

Challenge

Before switching to Imply Enterprise Hybrid, RixEngine relied heavily on open-source Druid as their primary database. Although Druid provided a performant foundation for real-time analytics, they faced several challenges that prompted them to upgrade to Imply Hybrid as a more robust and scalable solution:

  • Unstable Query Performance: Initially, RixEngine attempted to improve query performance by adding more data nodes, but this approach proved to be costly and only provided marginal improvements. They tried tuning parameters for the Historical and Broker nodes, but couldn’t achieve the desired performance levels.
  • Data Ingestion Bottlenecks: With daily data ingestion volumes reaching hundreds of terabytes, RixEngine frequently experienced data consumption delays. The ingestion pipeline felt overwhelmed, and the service was often fragile. Even minor changes, such as adding a new field, could lead to unexpected issues and disruptions.
  • Operational Complexity: Managing and maintaining the Druid cluster required significant effort, and the system’s limitations made it difficult to scale efficiently or adapt to evolving business needs.
  • Data Silos: Data was spread across various systems, including Kafka streams (ad request and reporting logs, blocked ad request logs, machine monitoring data), AWS S3, Google Cloud Storage (GCS), MySQL databases (operational platform data), and third-party platforms providing anti-fraud information. As each source had different urgency and importance profiles, the distributed landscape made comprehensive analysis difficult, slowed down reporting, and increased operational overhead.

Solution

Overall, Imply Hybrid provided stable and high-performance queries, efficient handling of large-scale data ingestion, and flexible, in-depth data exploration. Imply’s managed services also reduced operational overhead for the RixEngine team, allowing them to focus more on delivering value to the business. Beyond these core capabilities, Imply delivered two key improvements for RixEngine: (1) unified data analysis and (2) flexible data management.

(1) Unified data analysis: Imply helped RixEngine eliminate data siloes by centralizing data into a unified platform and improving data quality through cleansing, transformation, and enrichment. This consolidation replaced a fragmented architecture that spanned across multiple databases, BI tools and self-managed systems ー thereby reducing operational overhead and the risk of data errors. Imply provides a single source of truth via:

  • Real-time Ingestion (Kafka): Imply’s robust Kafka integration allows RixEngine to ingest streaming data with low latency, which is crucial for near real-time dashboards, monitoring, and alerting on critical events.
  • Batch Ingestion (MSQ): For less time-sensitive data in S3 and GCS, RixEngine leverages Imply’s MSQ (Multi-Stage Query engine) for efficient batch ingestion and reindexing. This lends flexibility to load historical data or update existing datasets.
  • Lookups: Imply’s Lookup feature further enriches their data by augmenting event data with contextual information stored in an external system (MySQL).

(2) Flexible data management: Imply Hybrid’s flexibility allows RixEngine to tailor data management to their specific needs for optimal performance. To illustrate:

  • Adaptive Data Compression: For data with low query frequency, they leverage Imply’s auto-compaction feature, minimizing storage costs without impacting performance. Conversely, for frequently queried data, they use MSQ to periodically compress data older than three hours. This ensures optimal query performance for recent data while maintaining cost-effective storage for historical data.
  • Real-time Data Enrichment with Lookups: For contextual data or ID-to-name mappings stored in MySQL, Imply’s lookup functionality enables rapid implementation and enrichment of data, requiring minimal code changes and accelerating time to market for new features.
  • Efficient Data Synchronization with MSQ: As RixEngine maintains a separate copy of critical data in Redshift and other databases, they use Imply’s MSQ to efficiently export key dimensions and metrics from various sources to S3, enabling seamless data synchronization between systems. This streamlines their data pipelines and ensures data consistency across their organization.

Impact

Switching to Imply Enterprise Hybrid helped RixEngine decrease machine costs by over 50%, while reducing query times from over 10 seconds to under 5 seconds. Their ability to reduce the total cost of infrastructure (TCO) by 50% stemmed from data node optimization (50% fewer nodes deployed for equivalent workloads with Imply vs. self-hosted Druid), bandwidth optimization across machines (co-developed a multi-stage aggregation pipeline with Imply, reducing bandwidth usage by 70%), tool consolidation, and operational efficiency gained from data replay incident reduction and lower DevOps overhead.

Additionally, Imply saved RixEngine’s engineering team 8 hours per week by improving data analysis, monitoring, visualization and support processes via:

  • Enhanced Data Analysis: The data cube, alert, and dashboard modules have enriched their data analysis capabilities, enabling quick issue identification across nearly 100 dimensions. This has allowed them to pinpoint root causes, whether they originate from our service, network providers, or upstream/downstream users.
  • Superior Support: The support team’s proactive standby during upgrades, scenario-based testing, and assistance in testing new features like window functions have enhanced RixEngine’s efficiency.
  • Improved Monitoring with Clarity: Clarity has provided better service and query monitoring. RixEngine can observe which dimensions have the poorest query performance and consider adding them as partition keys to optimize performance.
  • User-Friendly UI: The Pivot design in Imply’s UI is more intuitive and user-friendly compared to alternatives like Metabase and Superset, making it easier for users to perform queries and navigate the platform.

Moving forward, RixEngine plans to continue advancing along the path of data- and algorithm-driven development by combining Imply’s data analysis capabilities (e.g. more SQL quantile-related and window functions) with large language models. They’re also investigating how to make their visualizations more accessible via Pivot 一 The overall goal is to guide the team toward deeper data analysis and present it in an easily understandable way through Pivot, lowering the barrier for non-developers to use these functions.

References

See more similar to RixEngine

Metaimpact Delivers Real-Time Visibility into Data Stack Performance

Metaimpact upgraded to Dimension Tables for ~120 ms queries, 2× better memory efficiency, and 10% cost savings, improving infrastructure performance visibility.

Learn More
Adikteev

Adikteev Delivers Real-Time Visibility and Alerts for Campaign Health

Adikteev cut dashboard latency from over 7 minutes to seconds, enabling self-service exploration of campaign telemetry and proactively alerting clients to DAU shifts two days earlier than internal pipelines.

Learn More
Pinterest

Pinterest Delivers Real-Time Advertising Analytics at Massive Scale

Pinterest processes 500,000 ad events per second and answers 99% of advertiser queries in ~250 ms, reducing event-to-ingestion time to under a minute for near-instant advertiser insights.

Learn More
Poshmark

Poshmark Delivers Real-Time Observability for User Behavior & A/B Testing

Poshmark supports 80M users and 60+ data dimensions, cutting dashboards to <5 seconds and enabling A/B test insights in <2 seconds for fast visibility into platform usage.

Learn More
Nielsen Marketing Cloud

Nielsen Marketing Cloud Delivers Real-Time Audience Visibility at Scale

Nielsen enables customers to drill into 80,000 attributes with real-time queries over terabytes of daily data, using distinct-count sketches for fast, accurate insights into audience trends.

Learn More

Atlassian Delivers Real-Time Visibility for Usage Analytics

Atlassian improved analytics with 5× faster performance, ≤100 ms queries, and 5+ years of retained data, giving customers instant visibility into long-term usage trends.

Learn More
Pepsi logo

PepsiCo Delivers Real-Time Operational Visibility Across Sales, Supply Chain & Marketing

PepsiCo enables subsecond queries over massive event data, improving responsiveness in sales, predictive out-of-stock detection, and embedding operational insights into tools with minimal overhead.

Learn More
Paytm

Paytm Delivers Real-Time Visibility into Customer Behavior at Petabyte Scale

Paytm cut infra costs by 50%, boosted performance 10×, and freed 12 weekly engineering hours by enabling real-time behavioral analytics at petabyte scale.

Learn More
Blis

Blis Delivers Real-Time Visibility into Massive Ad Auction Scale

Blis processes hundreds of thousands of ad auction requests per second across 35,000 publishers and 200 million consumers, giving customers instant insights like ad-opportunity counts by place and time.

Learn More
Twitch

Twitch Delivers Real-Time Visibility for Usage and Performance

Twitch processes ~80B events per day and supports ~70,000 queries from 500+ internal users with <500 ms response times, enabling teams to observe usage and performance trends in real time.

Learn More
PayPal

PayPal Delivers Real-Time Visibility into the User Journey

PayPal ingests 5.5B events daily and runs tens of thousands of queries to monitor KPIs and identify pain points, giving teams fast insights to improve user experience.

Learn More
Netflix

Netflix Delivers Real-Time Observability Across Playback Quality

Netflix ingests 2 million events per second and queries 1.5 trillion rows in milliseconds, enabling real-time monitoring of playback quality for a consistent viewing experience.

Learn More
Expedia Logo

Expedia Delivers Real-Time Visibility for Traveler Segmentation

Expedia cut query latency from 24 hours to <5 seconds, enabling dynamic traveler segmentation across multiple datasets and giving marketing and ops teams near-instant insights.

Learn More
Reddit

Reddit Delivers Real-Time Analytics for Ad Campaign Performance

Reddit accelerated query performance and improved availability to 99.9%, enabling advertisers to explore six months of ad event data in real time to optimize campaign outcomes.

Learn More
WalkMe

WalkMe Delivers Real-Time Observability Across Device Performance

WalkMe replaced slow search systems with a real-time platform to monitor billions of client devices, giving teams cost-effective visibility into performance and usage.

Learn More
The Royal and Ancient Golf Club of St Andrews logo, aka. R&A

The R&A Delivers Real-Time Observability During Major Sporting Events

The R&A ingested 250M new rows every 15 minutes and processed 25B rows with subsecond queries—maintaining 100% uptime and enabling teams to respond instantly during live events.

Learn More
Citrix

Citrix Delivers Real-Time Observability for Threat Detection and Environment Health

Citrix ingests 3B events daily, delivers 99.9% uptime, and meets SLOs on 90% of queries—providing proactive insider threat detection and visibility into its environment.

Learn More
Amobee

Amobee Delivers High-Performance Ad Analytics with Real-Time Query Flexibility

Amobee runs subsecond queries over trillions of rows with hundreds of concurrent users, enabling advertisers to explore any market or time period in real time while reducing costs.

Learn More
Salesforce

Salesforce Delivers Edge Observability Across Releases & Performance

Salesforce ingests 5B events per day, cuts storage 47%, and speeds queries by ~30%, enabling release comparisons and performance visibility for 150,000+ customers.

Learn More
Zillow Group

Zillow Group Delivers Real-Time Visibility for Self-Serve Analytics

Zillow enables product and business teams to self-serve ad hoc analytics with 5-minute onboarding and effortless scaling, reducing overhead while accelerating decisions.

Learn More
NTT

NTT Delivers Real-Time Observability for Global IP Traffic

NTT ingests 100k+ events/sec to provide real-time network visibility across its global IP backbone, enabling both technical and non-technical users to explore traffic trends.

Learn More
Ibotta

Ibotta Delivers Real-Time Observability for Fraud & Incident Detection

Ibotta built real-time fraud detection and incident response, cutting costs by 25% and enabling 30× more users with self-service analytics for faster visibility into anomalies.

Learn More
Confluent

Confluent Delivers Real-Time Observability into Streaming Cluster Operations

Confluent ingests 5M+ events/sec and supports hundreds of subsecond queries on high-cardinality metrics, giving real-time visibility into thousands of Kafka clusters.

Learn More
Charter

Charter Communications Delivers Real-Time Visibility into Product Performance

Charter scaled to 100–150k messages/sec, boosted storage 10×, and doubled throughput, enabling real-time visibility into product performance and customer experience.

Learn More
Splunk

Splunk Delivers Real-Time Observability for Data Investigation & Monitoring

Splunk ingests 500M rows per minute and reduced storage 13.5×, enabling users to monitor, investigate, and act on their data in real time with far smaller footprint.

Learn More
Iron Source

IronSource Delivers Real-Time Visibility for Dashboard Performance

IronSource cut query latency from 35s to <5s while handling up to 1.5M events/sec, enabling non-SQL users to interactively explore streaming data via dashboards.

Learn More

Orb Delivers Real-Time Observability into Billing Operations

Orb scaled its billing platform with Kafka event streams, optimized query workloads, and saved 20–30 engineering hours/month while extending visibility into revenue reporting.

Learn More
Target

Target Delivers Real-Time Visibility for Front-Line Business Decisions

Target runs 4M queries per day for 70,000 users with 300–600 ms response times, empowering teams to explore data interactively and act on insights at the edge of operations.

Learn More

Roblox Delivers Real-Time Visibility Across Millions of Gaming Experiences

Roblox runs subsecond queries over high-cardinality gameplay data across 10M+ experiences and 86 TB ingested daily, giving creators fast, actionable insights without high cost or overhead.

Learn More
Yahoo

Yahoo Delivers Real-Time Visibility into User Behavior at Massive Scale

Yahoo processes 100B events daily across 10B sessions, managing 20+ PB of data and enabling both internal users and customers to access behavioral insights on demand.

Learn More
Walmart

Walmart Delivers Real-Time Visibility into Competitor Pricing

Walmart processes over 1B events daily with near subsecond query latency to monitor competitor pricing, enabling teams to respond instantly to market shifts.

Learn More
TrueCar

TrueCar Delivers Real-Time Observability with Dashboards & Anomaly Detection

TrueCar ingests 20M records daily across ~80 TB, delivers <1 ms query times, and provides anomaly detection dashboards with minimal engineering overhead.

Learn More
Rakuten

Rakuten Delivers Real-Time Visibility Across Event Streams

Rakuten supports hundreds of concurrent users and thousands of daily queries over millions of records per second, enabling near real-time insights across business units.

Learn More
GameAnalytics

GameAnalytics Delivers Real-Time Observability for Game Developers

GameAnalytics processes 25B events/day and 100,000 queries/hour with subsecond latency, giving developers real-time observability into player behavior and game performance.

Learn More

With Imply for Druid,
save time and money.

Imply is the easiest way to build with Druid through our cloud service and committer-driven expertise. For existing Apache Druid users, we can guarantee it.

Get started

Ready to decouple your observability stack?
No workflow changes. No migrations. More data, less spend.

Request a Demo