You're reading from Machine Learning with the Elastic Stack - Second Edition

Product typeBook

Published inMay 2021

Reading LevelBeginner

PublisherPackt

ISBN-139781801070034

Edition2nd Edition

Languages

Python

Tools

Elasticsearch

Concepts

Machine Learning

Authors (3):

Rich Collier

Camilla Montonen

Bahaaldine Azarmi

View More author details

Chapter 3: Anomaly Detection

Anomaly detection was the original capability of Elastic ML and is the most mature, stretching its roots back to the Prelert days (before the acquisition by Elastic in 2016). This technology is robust, easy to use, powerful, and broadly applicable to all kinds of use cases for time series data.

This jam-packed chapter will focus on using Elastic ML to detect anomalies in the occurrence rates of documents/events, rare occurrences of things, and numerical values outside of expected normal operation. We will run through some simple but effective examples that will highlight both the efficacy of Elastic ML and its ease of use.

Specifically, we will cover the following:

Elastic ML job types
Dissecting the detector
Detecting changes in event rates
Detecting changes in metric values
Understanding the advanced detector functions
Splitting analysis along categorical features
Understanding temporal versus population analysis

Technical requirements

The information in this chapter is based on the Elastic Stack as it exists in v7.10. As with all of the chapters, all the example code can be found on GitHub: https://github.com/PacktPublishing/Machine-Learning-with-Elastic-Stack-Second-Edition.

Elastic ML job types

When we start using the Elastic ML UI to configure anomaly detection jobs, we will see that there are five different job wizards that are shown:

Figure 3.1 – The Create job UI showing different configuration wizards

The existence of these different configuration wizards implies that there are different "types" of jobs. In actuality, there is really only one job type—it is just that the anomaly detection job has many options, and many of these wizards make certain aspects of that configuration easier. Everything that you may desire to configure can be done via the Advanced wizard (or the API). In fact, when Elastic ML was first released as beta in v5.4, that was all that existed. Since then, the other wizards have been added for simplicity and usability in specific use cases.

An anomaly detection job has many configuration settings, but the two most important ones are the analysis configuration and the datafeed...

Dissecting the detector

At the heart of the anomaly detection job are the analysis configuration and the detector. The detector has several key components to it:

The function
The field
The partition field
The by field
The over field

We will go through each in turn to fully understand them all. Note that in the next few sections, however, we will often refer to the actual names of settings within the job configuration as if we were using the advanced job editor or the API. Although it is good to fully understand the nomenclature, as you progress through this chapter you will also notice that many of the details of the job configuration are abstracted away from the user or are given more "UI-friendly" labels than the real setting names.

The function

The detector function describes how the data will be aggregated or measured within the analysis interval (bucket span). There are many functions, but they can be classified into the following...

Detecting changes in metric values

Obviously, not all data being emitted from systems will be text or categorical in nature—a vast amount of it is numerical. Detecting changes in metric values over time is perfectly suited for anomaly detection because, as mentioned in Chapter 1, Machine Learning for IT, the historical paradigm of alerting on exceptions in numerical values via static thresholds has been troublesome for decades. Let's explore all that Elastic ML has to offer with respect to the functions that help you detect changes in numerical fields in your data.

Metric functions

Metric functions operate on numerical fields and return numerical values. They are perhaps the easiest of the detector functions to understand.

min, max, mean, median, and metric

These functions do exactly as you would expect: they return the minimum, maximum, average/mean, and median of all of the numerical observations for the field of interest in the bucket span.

The metric...

Understanding the advanced detector functions

In addition to the detector functions mentioned so far, there are also a few other, more advanced functions that allow some very unique capabilities. Some of these functions are only available if the ML job is configured via the advanced job wizard or via the API.

rare

In the context of a stream of temporal information (such as a log file), the notion of something being statistically rare (occurring at a low frequency) is paradoxically both intuitive and hard to understand. If I were asked, for example, to trawl through a log file and find a rare message, I might be tempted to label the first novel message that I saw as a rare one. But what if practically every message was novel? Are they all rare? Or is nothing rare?

In order to define rarity to be useful in the context of a stream of events in time, we need to agree that the declaration of something as being rare must take into account the context in which it exists. If there...

Splitting analysis along categorical features

We have seen the power of anomaly detection jobs in uncovering interesting anomalies in a single time series dataset. However, there are a few mechanisms by which the analysis can be split along a categorical field to invoke a parallel analysis across tens, hundreds, and even multiple thousands of unique entities.

Setting the split field

When using some of the job wizards (such as the Multi-metric and Population wizards), you will see an option to split the analysis:

Figure 3.23 – Splitting on a categorical field

Here, in Figure 3.23, which uses the Multi-metric wizard to build a job against the kibana_sample_data_ecommerce index, we see that the high sum function on the taxful_total_price field is being split per instance on the field called category.keyword (plus turning the Sparse data option on). In other words, the analysis will be done for every category of items in this e-commerce store (men...

Understanding temporal versus population analysis

We learned back in Chapter 1, Machine Learning for IT, that there are effectively two ways to consider something as anomalous:

Whether or not something changes drastically with respect to its own behavior over time
Whether or not something is drastically different when compared to its peers in an otherwise homogeneous population

By default, the former (which we'll simply call temporal analysis) is the mode used unless the over_field_name setting is specified in the detector config.

Population analysis can be very useful in finding outliers in a variety of important use cases. For example, perhaps we want to find machines that are logging more (or less) than similarly configured machines in the following scenarios:

Incorrect configuration changes that have caused more errors to suddenly occur in the log file for the system or application.
A system that might be compromised by malware may actually...

Categorization analysis of unstructured messages

Imagine that you are troubleshooting a problem by looking at a particular log file. You see a line in the log that looks like the following:

   18/05/2020 15:16:00 DB Not Updated [Master] Table

Unless you have some intimate knowledge about the inner workings of the application that created this log, you may not know whether the message is important. Having the database be Not Updated possibly sounds like a negative situation. However, if you knew that the application routinely writes this message, day in and day out, several hundred times per hour, then you would naturally realize that this message is benign and should possibly be ignored, because clearly the application works fine every day despite this message being written to the log file.

The problem, obviously, is one of human interpretation. Inspection of the text of the message and the reading of a negative phrase (Not Updated) potentially biases a person...

Managing Elastic ML via the API

As with just about everything in the Elastic Stack, ML can also be completely automated via API calls—including job configuration, execution, and result gathering. Actually, all interactions you have in the Kibana UI leverage the ML API behind the scenes. You could, for example, completely write your own UI if there were specific workflows or visualizations that you wanted.

Note

For more in-depth information about the anomaly detection APIs, please refer to elastic.co/guide/en/machine-learning/current/ml-api-quickref.html. The data frame analytics part of Elastic ML has a completely separate API, which will be discussed in Chapters 9 to 13.

We won't go into each API call, but we would like to highlight some parts that are worth a detour.

The obvious first API to mention is the job creation API, which allows the creation of the ML job configuration. For example, if you wanted to recreate the population analysis job shown in Figure...

Summary

We've seen that Elastic ML can highlight variations in volume, diversity, and uniqueness in metrics and log messages, including those that need some categorization first. Also, we've shown that population analysis can be an extremely interesting alternative to temporal anomaly detection when the focus is more on finding the most unusual entities. These techniques help solve the challenges we described before, where a human might struggle to recognize what is truly unusual and worthy of attention and investigation.

The skills learned in this chapter will be helpful in subsequent chapters, where we will see how ML assists in the process of getting to the root cause of complex IT problems, identifying application performance slowdowns, or when ML can assist in the identification of malware and/or malicious activity.

In the next chapter, we'll see how the expressive time series models built by anomaly detection jobs can be leveraged to forecast trends of your...

The rest of the chapter is locked

You have been reading a chapter from

Machine Learning with the Elastic Stack - Second Edition

Published in: May 2021Publisher: PacktISBN-13: 9781801070034

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (3)

Rich Collier

Rich Collier is a solutions architect at Elastic. Joining the Elastic team from the Prelert acquisition, Rich has over 20 years' experience as a solutions architect and pre-sales systems engineer for software, hardware, and service-based solutions. Rich's technical specialties include big data analytics, machine learning, anomaly detection, threat detection, security operations, application performance management, web applications, and contact center technologies. Rich is based in Boston, Massachusetts.
Read more about Rich Collier

Camilla Montonen

Camilla Montonen is a Senior Machine Learning Engineer at Elastic.
Read more about Camilla Montonen

Bahaaldine Azarmi

Bahaaldine Azarmi, Global VP Customer Engineering at Elastic, guides companies as they leverage data architecture, distributed systems, machine learning, and generative AI. He leads the customer engineering team, focusing on cloud consumption, and is passionate about sharing knowledge to build and inspire a community skilled in AI.
Read more about Bahaaldine Azarmi

Other recommended products

Related to this chapter

Machine Learning with the Elastic Stack

Elastic has announced the integration of Prelert machine learning technology within its ecosystem allowing real-time generation of business insights from the Elasticsearch data without it leaving the cluster at all. This book will demonstrate these unique features and teach you to perform machine learning on the Elastic Stack without any hassle.

BookJan 2019304 pages

Learning Kibana 7

This book will introduce you to Kibana 7, and will show you how it fits into the Elastic stack. You will build a pure metric analytics architecture and visualize it using Timelion. You will also learn how to build relationships between documents using Graph visualization. You will also learn to build powerful Elastic dashboards using Kibana.

BookJul 2019280 pages

Mastering Kibana 6.x

Mastering Kibana 6.x provides a rundown explanation required for data visualization and analysis such as X-Pack features, Beats, and machine learning. You will be expert in creating analytics-driven visualizations from a web application. You will be a maestro in creating custom monitoring dashboard using Beats with various examples

BookJul 2018376 pages

Advanced Elasticsearch 7.0

Advanced Elasticsearch 7.0, will help the readers to leverage new features and Core APIs of Elasticsearch to perform advanced search operations. This book covers data modeling, aggregations, pipeline processing, and data Analytics using Elasticsearch

BookAug 2019560 pages

Threat Hunting with Elastic Stack

Elastic security offers enhanced threat hunting capabilities to build active defense strategies. Complete with practical examples and tips, this easy-to-follow guide will help you enhance your security skills by leveraging the Elastic Stack for security monitoring, incident response, intelligence analysis, or threat hunting.

BookJul 2021392 pages

Learning Kibana 5.0

BookFeb 2017284 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages