You're reading from Machine Learning with the Elastic Stack - Second Edition

Product typeBook

Published inMay 2021

Reading LevelBeginner

PublisherPackt

ISBN-139781801070034

Edition2nd Edition

Languages

Python

Tools

Elasticsearch

Concepts

Machine Learning

Authors (3):

Rich Collier

Camilla Montonen

Bahaaldine Azarmi

View More author details

Chapter 4: Forecasting

Forecasting is a natural extension of the time series modeling of Elastic ML. Since very expressive models are built behind the scenes and describe how data has behaved historically, it is therefore possible to project that information forward in time and predict how something should behave at a future time.

We will spend time learning the concepts behind forecasting, as well as stepping through some practical examples.

Specifically, this chapter will cover the following topics:

Contrasting forecasting with prophesying
Forecasting use cases
Forecasting theory of operation
Single time series forecasting
Looking at forecasting results
Multiple time series forecasting

Technical requirements

The information and examples demonstrated in this chapter are relevant as of v7.11 of the Elastic Stack.

Contrasting forecasting with prophesying

Past performance is not indicative of future results. This disclaimer is used by financial companies when they reference the performance of products such as mutual funds. But this disclaimer is a bit of an odd contradiction, because the past is all that we have to work with. If the companies that comprise the mutual fund have had consistently positive quarterly results for the last eight quarters straight, does that guarantee that they will also have a positive set of results for the next eight quarters and that their public valuation will continue to rise? Probability could be on the side of that being the case, but that might not be the whole story. And, before we get too wishful in thinking that Elastic ML’s ability to forecast is our key to making a fortune in the stock market, we should be realistic about one key caveat—there are always uncontrollable factors.

The reason financial companies use the preceding disclaimer...

Forecasting use cases

In the context of Elastic ML, there are really just two—somewhat similar—use cases in which someone would use forecasting. These are outlined here:

Value-focused: This is where you extrapolate a time series into the future to understand a probable future value. This would be akin to answering questions such as: “How many widgets will I sell per day 2 months from now?”
Time-focused: This is where you understand the likely time at which an expected value is to be reached. This would answer questions similar to: “Do I expect to reach 80% utilization in the next week?”

The differences between these two use cases might not just be how a question is asked (how the data is searched), but also how you interpret the output. However, before we delve into a few examples of how to use the forecasting feature, let’s take a little time to discuss how it works logistically.

Forecasting theory of operation

The first thing to realize is that the act of invoking a forecast on data is that it is an extension of an existing Anomaly Detection job. In other words, you need to have an Anomaly Detection job configured, and that job needs to have analyzed historical data before you can forecast on that data. This is because the forecasting process uses the models that are created by the Anomaly Detection job. To forecast the data, you need to follow the same steps that were used to create an Anomaly Detection job as described in other chapters. If anomalies were generated by the execution of that job, you can disregard them if your only purpose is to execute forecasting. Once the job has learned on some historical data, the model or models (if the job is configured to analyze data from more than one time series) associated with that job are current and up to date, as represented in the following diagram:

Figure 4.1 – A symbolic representation...

Single time series forecasting

To illustrate the procedure of forecasting, we will start with a dataset that is a single time series. While this dataset is generic, you could imagine that it could represent a system performance metric, the number of transactions processed by a system, or even sales revenue data. The important aspect of this dataset is that it contains several distinct time-based trends—a daily trend, a weekly trend, and an overall increasing trend. Elastic ML will discover all three trends and will effectively predict those into the future. It is good to note that the dataset also contains some anomalies, but (of course) future anomalies cannot be predicted as they are surprise events by definition. Since our discussion here is purely focused on forecasting, we will ignore the existence of any anomalies found in our dataset while building the models for forecasting.

With that said, let’s jump into an example by using the forecast_example dataset from...

Looking at forecast results

Now that we have run a forecast, we can look in more depth at the results that are generated by the forecasting process. We can view the results of a previously created forecast at any time in the UI via one of two methods. The first way is to click the Forecast button in Single Metric Viewer to reveal a list of previous forecasts, like so:

Figure 4.20 – Viewing previously created forecasts from Single Metric Viewer

Alternatively, you can view them in the Job Management page under the Forecasts tab for that job, as illustrated in the following screenshot:

Figure 4.21 – Viewing previously created forecasts from the Job Management page

Note

Forecast results built in Kibana have a default lifespan of 14 days. After that, the forecast results are deleted permanently. If a different expiration duration is desired, then the forecast will have to be invoked via the _forecast API endpoint, which...

Multiple time series forecasting

To invoke forecasting on multiple time series, you simply just need an ML job that is modeling multiple time series. Let’s assume that we have an ML job that has analyzed web requests per country. In fact, using the built-in sample web logs (kibana_sample_data_logs) we used in Chapter 3, Anomaly Detection, we could easily create a multi-metric job that counts events, split on the source country code of the request (the field is called geo.src), as illustrated in the following screenshot:

Figure 4.26 – Creating a multi-metric job for forecasting

There are 183 unique source countries in this dataset. After creating and running this Anomaly Detection job in order to build baseline models for all 183 countries, we are now in a position to invoke a forecast. If we approach the invocation of a forecast in the same way as we did before (via Single Metric Viewer), we might erroneously think that a forecast will only be...

Summary

Elastic ML has an additional feature over and above anomaly detection: the ability to take and extrapolate time series models into the future for forecasting purposes. With use cases that include advanced breach detection and capacity planning, this feature alleviates the human burden of manually charting, tracking, and predicting where things are going in the future, based upon how they have behaved in the past.

In the next chapter, we’ll go deeper into the results that anomaly detection and forecasting give us, and we will set up a better understanding of how to leverage those results for dashboards and proactive alerts.

The rest of the chapter is locked

You have been reading a chapter from

Machine Learning with the Elastic Stack - Second Edition

Published in: May 2021Publisher: PacktISBN-13: 9781801070034

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (3)

Rich Collier

Rich Collier is a solutions architect at Elastic. Joining the Elastic team from the Prelert acquisition, Rich has over 20 years' experience as a solutions architect and pre-sales systems engineer for software, hardware, and service-based solutions. Rich's technical specialties include big data analytics, machine learning, anomaly detection, threat detection, security operations, application performance management, web applications, and contact center technologies. Rich is based in Boston, Massachusetts.
Read more about Rich Collier

Camilla Montonen

Camilla Montonen is a Senior Machine Learning Engineer at Elastic.
Read more about Camilla Montonen

Bahaaldine Azarmi

Bahaaldine Azarmi, Global VP Customer Engineering at Elastic, guides companies as they leverage data architecture, distributed systems, machine learning, and generative AI. He leads the customer engineering team, focusing on cloud consumption, and is passionate about sharing knowledge to build and inspire a community skilled in AI.
Read more about Bahaaldine Azarmi

Other recommended products

Related to this chapter

Machine Learning with the Elastic Stack

Elastic has announced the integration of Prelert machine learning technology within its ecosystem allowing real-time generation of business insights from the Elasticsearch data without it leaving the cluster at all. This book will demonstrate these unique features and teach you to perform machine learning on the Elastic Stack without any hassle.

BookJan 2019304 pages

Learning Kibana 7

This book will introduce you to Kibana 7, and will show you how it fits into the Elastic stack. You will build a pure metric analytics architecture and visualize it using Timelion. You will also learn how to build relationships between documents using Graph visualization. You will also learn to build powerful Elastic dashboards using Kibana.

BookJul 2019280 pages

Mastering Kibana 6.x

Mastering Kibana 6.x provides a rundown explanation required for data visualization and analysis such as X-Pack features, Beats, and machine learning. You will be expert in creating analytics-driven visualizations from a web application. You will be a maestro in creating custom monitoring dashboard using Beats with various examples

BookJul 2018376 pages

Advanced Elasticsearch 7.0

Advanced Elasticsearch 7.0, will help the readers to leverage new features and Core APIs of Elasticsearch to perform advanced search operations. This book covers data modeling, aggregations, pipeline processing, and data Analytics using Elasticsearch

BookAug 2019560 pages

Threat Hunting with Elastic Stack

Elastic security offers enhanced threat hunting capabilities to build active defense strategies. Complete with practical examples and tips, this easy-to-follow guide will help you enhance your security skills by leveraging the Elastic Stack for security monitoring, incident response, intelligence analysis, or threat hunting.

BookJul 2021392 pages

Learning Kibana 5.0

BookFeb 2017284 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages