You're reading from Elasticsearch 8.x Cookbook - Fifth Edition

Product type Book

Published in May 2022

Publisher Packt

ISBN-13 9781801079815

Pages 750 pages

Edition 5th Edition

Languages

Concepts

Enterprise Search

Author (1):

Alberto Paro

Table of Contents (20) Chapters

Preface

Chapter 1: Getting Started

Chapter 2: Managing Mappings

Chapter 3: Basic Operations

Chapter 4: Exploring Search Capabilities

Chapter 5: Text and Numeric Queries

Chapter 6: Relationships and Geo Queries

Chapter 7: Aggregations

Chapter 8: Scripting in Elasticsearch

Chapter 9: Managing Clusters

Chapter 10: Backups and Restoring Data

Chapter 11: User Interfaces

Chapter 12: Using the Ingest Module

Chapter 13: Java Integration

Chapter 14: Scala Integration

Chapter 15: Python Integration

Chapter 16: Plugin Development

Chapter 17: Big Data Integration

Chapter 18: X-Pack

Other Books You May Enjoy

Using the Histogram field type

Histograms are a common data type for analytics and machine learning analysis. We can store Histograms in the form of values and counts; they are not indexed, but they can be used in aggregations.

The histogram field type is a special mapping that's available in X-Pack that is commonly used to store the results of Histogram aggregations in Elasticsearch for further processing, such as to compare the aggregation results at different times.

Getting ready

You will need an up-and-running Elasticsearch installation, as described in the Downloading and installing Elasticsearch recipe of Chapter 1, Getting Started.

To execute the commands in this recipe, you can use any HTTP client, such as curl (https://curl.haxx.se/), Postman (https://www.getpostman.com/), or similar. I suggest using the Kibana console, which provides code completion and better character escaping for Elasticsearch.

How to do it…

In this recipe, we will simulate a common use case of Histogram data that is stored in Elasticsearch. Here, we will use a Histogram that specifies the millimeters of rain divided by year for our advanced analytics solution. To achieve this, follow these steps:

First, let's create an index for the Histogram by using the following mapping:

PUT test-histo
{ "mappings": {
    "properties": {
      "histogram": { "type": "histogram" },
      "model": { "type": "keyword" } } } }

Now, we can store a document to test the mapping:

POST test-histo/_doc/1
{ "model":"show_level", "histogram" : { "values" : [2016, 2017, 2018, 2019, 2020, 2021],  "counts" : [283, 337, 323, 312, 236, 232] } }

How it works…

The histogram field type specializes in storing Histogram data. I must be provided as a JSON object composed of the values and counts fields with the same cardinality of items. The only supported aggregations are the following ones. We will look at these in more detail in Chapter 7, Aggregations:

Metric aggregations such as min, max, sum, value_count, and avg
The percentiles and percentile_ranks aggregations
The boxplot aggregation
The histogram aggregation

The data is not indexed, but you can also check the existence of a document by populating this field with the exist query.

You're reading from Elasticsearch 8.x Cookbook - Fifth Edition

Table of Contents (20) Chapters

Using the Histogram field type

Getting ready

How to do it…

How it works…

See also

Authors (1)

Personalised recommendations for you