The Imply Blog

We write about our product, technology, and company

Tag: how-to

Design on a dime: How we built a license-key generator using AWS serverless architecture

by Nicholas Lippis · in Solutions · September 29, 2020

Software Engineer Nicholas Lippis explains how his team developed a license-key generation and management service that our employees can use to generate secure keys for Imply customers. A lambda (serverless) architecture came to be the design breakthrough that helped balance cost with functionality

Read More

Tutorial: Add BGP Analytics to your Imply netflow analysis

by Eric Graham · Paolo Lucente, NTT · in Solutions · June 8, 2020

Imply is a real-time data platform for self-service analytics. It is very well suited for high performance analytics against event-driven data. One of the common use cases is to store, analyze, and visualize different types of networking data (NetFlow v5/v9, sFlow, IPFIX, etc.).

Read More

Apache Druid Best Practices - Determining Worker Capacity (slots) for Automatic Compaction

by Venkatraman Poornalingam · in Solutions · May 22, 2020

In Apache Druid, Compaction basically helps with managing the segments for a given datasource. Using compaction, we can either merge smaller segments or split large segments to optimize segment size. One of the first options to consider would be to determine, if the segments could be generated optimally. If that isn’t possible, compaction would be required.

Read More

Behold the void: Measuring the performance impact of numeric column NULL checks in Apache Druid

by Clint Wylie · in Solutions · April 6, 2020

In this post, I am going to talk a bit about Apache Druid and a recently documented configuration option that enables true NULL values to be stored and queried for better SQL compatibility: druid.generic.useDefaultValueForNull=false, and in the process do a deep dive into how it relates to a small sliver of the query processing system as we explore the performance of this feature.

Read More

Hadoop Indexing for Apache Druid at Scale - Configuration Best Practices

by Rommel Garcia · in Solutions · July 12, 2019

When Hadoop is pushing data into Druid, Hadoop indexer performance is key and becomes challenging at scale. There are a quite a few things to consider when running large scale Hadoop indexing.

Read More

Clickstream Analysis - An Open Source Architecture

by Mike McLaughlin · Peter Marshall · in Solutions · June 12, 2019

A triad of open source projects - Divolte, Apache Kafka and Apache Druid - can power real-time collection, streaming and interactive visualisation of clickstreams, so you can investigate and explore what’s happening on your digital channels as easily as looking out of your office window.

Read More

Tutorial: Using Apache Druid and Imply With Google Cloud Dataproc For Hadoop Indexing

by Rommel Garcia · in Solutions · June 6, 2019

To help you get to know GCP and Druid, the tutorial below will walk you through how to install and configure Druid to work with Dataproc (GCP’s managed Hadoop offering) for Hadoop Indexing. Then it will show you how to ingest and query data as well.

Read More

Tutorial: An End-to-end Streaming Analytics Stack for Juniper Streaming Telemetry

by Eric Graham · in Solutions · May 23, 2019

In this tutorial, we will step through how to set up Imply, Kafka, and Open-NTI to build an end-to-end streaming analytics stack that can handle Juniper Native streaming telemetry data.

Read More

Tutorial: An End-to-end Streaming Analytics Stack for syslog Data

by Eric Graham · in Solutions · April 18, 2019

In this tutorial, we will step through how to set up Imply, Kafka, and syslog-ng kafka to build an end-to-end streaming analytics stack that can handle many different forms of log data.

Read More

Tutorial: An End-to-end Streaming Analytics Stack for Network Telemetry Data

by Eric Graham · in Solutions · March 26, 2019

In this tutorial, we will step through how to set up Imply, Kafka, and pmacct to build an end-to-end streaming analytics stack that can handle many different forms of networking data.

Read More

How to analyze AWS VPC logs with Imply

by Eric Graham · in Solutions · March 14, 2019

Have you ever wanted more visibility in your AWS network traffic? This how-to blog covers how to analyze VPC flow logs with Imply.

Read More

Imply lookups for enhanced network flow visibility

by Eric Graham · in Solutions · November 26, 2018

Within Druid there are multiple ways to enhance visibility for existing network flow records. This how-to blog covers one way to do this using Druid lookup tables.

Read More

Who is knocking on our door? Analyzing AWS Netflows

by Vadim Ogievetsky · in Solutions · June 27, 2018

We ingested our internal AWS VPC netflows into Imply and found something surprising.

Read More

How can we help?