Ingestion from Confluent Cloud and Kafka in Polaris
Apr 20, 2023
Timmy Freese
One of the foundational pillars of real-time data analytics is the ability to move data from one place to another quickly; Apache Kafka® is a widely adopted solution for this functionality. Confluent Cloud provides a fully-managed solution for Kafka. Whether you use Confluent Cloud, self-hosted Kafka, or alternative hosting platforms such as AWS MSK or Aiven MAK, Imply Polaris offers a solution for you to ingest data.
For Confluent Cloud users, Polaris offers a native, pull-based ingestion capability. Users only need to specify a few fields to establish a connection to their Confluent Cloud account and can ingest data in no time—every event enters the database and is queryable with sub-second latency. Additionally, this method offers exactly-once semantics and infers the schema from available data using Druid’s sampler.
For self-hosted and other Kafka solutions, Polaris offers an HTTP-based Kafka Connector which uses our Push API endpoint. This connector will automatically batch and compress your data and handles auth token renewal for you. Additionally, this solution can be used in tandem with Polaris’s PrivateLink offering to provide enhanced security.
Regardless of which solution you choose, you can expect high throughput and low latency results. We provide OLAP functionality with OLTP performance.
A First Look at Lumi Loglake: Query Logs Where They Live
At Databricks Data + AI Summit, we will preview Imply Lumi Loglake, a new step toward a more decoupled model for observability and machine data. The idea is simple: Point Lumi at your logs. Start querying....
Imply Lumi Major Release Preview: Continuing the Journey Towards Decoupled Observability/SIEM
We are getting ready to introduce the next major expansion of Imply Lumi and the observability warehouse. When we introduced the industry’s first observability warehouse, the goal was clear: decouple the...
Imply Lumi's Grafana Loki integration is now in Private Preview. The same logs you've loaded into Lumi for Splunk are now queryable natively in Grafana using LogQL with no second pipeline, no duplicate storage,...