You're reading from Elasticsearch 8.x Cookbook - Fifth Edition

Product typeBook

Published inMay 2022

PublisherPackt

ISBN-139781801079815

Edition5th Edition

Tools

Elasticsearch Elasticsearch

Concepts

Enterprise Search

Author (1)

Alberto Paro

Chapter 13: Java Integration

Elasticsearch functionalities can be easily integrated into any Java application in a couple of ways, via a REST API or via native APIs. In Java, it's easy to call a REST HTTP interface with one of the many libraries available, such as the Apache HttpComponents client (see http://hc.apache.org/ for more information). In this field, there's no such thing as the most used library; typically, developers choose the library that best suits their preferences or one that they know very well. From Elasticsearch 6.x onward, Elastic has provided a battle low-/high-level HTTP for clients to use. In version 8.x, Elastic released a modern/functional/strongly typed client and in this chapter, we will mainly use this version for all the examples that are provided.

Each Java Virtual Machine (JVM) language can also use the native protocol to integrate Elasticsearch with their applications; however, we will not cover this because it has fallen out of use from...

Creating a standard Java HTTP client

An HTTP client is one of the easiest clients to create. It's very handy because it allows for the calling, not only of the internal methods as the native protocol does, but also of third-party calls implemented in plugins that can only be called via HTTP.

Getting ready

You need an up-and-running Elasticsearch installation, as we described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

To correctly execute the following commands, you will need an index populated with the ch04/populate_kibana.txt commands that are available in the online code.

A Maven tool or an Integrated Development Environment (IDE) that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed. Elasticsearch code is targeting Java 17, so it's best practice to have installed JDK 17 or above.

The code for this recipe is in the chapter_13/http_java_client directory...

Creating a low-level Elasticsearch client

There are two official Elasticsearch clients: the low-level one and the new typed one available from Elasticsearch 8.x (https://github.com/elastic/elasticsearch-java). The low-level one is used to communicate with Elasticsearch, and its main features are as follows:

Minimal dependencies
Load balancing across all available nodes
Failover in the case of node failures and upon specific response codes
Failed connection penalization (whether a failed node is retried depends on how many consecutive times it failed; the more failed attempts, the longer the client will wait before trying that same node again)
Persistent connections
Trace logging of requests and responses
Optional automatic discovery of cluster nodes

Getting ready

You need an up-and-running Elasticsearch installation, which can be obtained as described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

To...

Using the Elasticsearch official Java client

The official Java client is built on top of a low-level one and provides strong typed communication with Elasticsearch.

Initially released with Elasticsearch in the latest versions of 7.15 or above, this client is the official Java client for Elasticsearch 8.x. It provides many extra functionalities, such as the following:

Integration from application classes to JSON instances via an object mapper such as Jackson
Request/response marshaling/unmarshaling that provides stronger typed programming
Support for both synchronous and asynchronous calls
Use of fluent builders and functional patterns to allow writing concise yet readable code when creating complex nested structures
Built on top of previous low-level client

Getting ready

You need an up-and-running Elasticsearch installation, as we described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

To correctly execute...

Managing indices

In the previous recipe, we learned how to initialize a client to send calls to an Elasticsearch cluster. In this recipe, we will learn how to manage indices via client calls.

Getting ready

You need an up-and-running Elasticsearch installation, which we described how to get in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory and the referred class is IndicesOperations.

How to do it...

An Elasticsearch client maps all index operations under the indices object of the client, such as create, delete, exists, open, close, and optimize. The following steps retrieve a client and execute the main operations on the indices:

First, we import the required classes, as shown in the following code:
```
import...
```

Managing mappings

After creating an index, the next step is to add some mappings to it. We have already seen how to add a mapping via the REST API in Chapter 3, Basic Operations. In this recipe, we will look at how to manage mappings via a native client.

Getting ready

You need an up-and-running Elasticsearch installation, which we described how to get in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory and the referred class is MappingOperations.

How to do it...

In the following steps, we add a mapping to a myindex index via the native client:

Import the required classes using the following code:

import co.elastic.clients.elasticsearch.ElasticsearchClient;
import java.io.IOException;
import java.security.KeyManagementException...

Managing documents

The native APIs for managing documents (index, delete, and update) are the most important after the search APIs. In this recipe, we will learn how to use them. In the next recipe, we will proceed to bulk actions to improve performance.

Getting ready

You need an up-and-running Elasticsearch installation, which we described how to get in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool, or an IDE that natively supports it for Java programming such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory and the referred class is DocumentOperations.

How to do it...

For managing documents, we will perform the following steps:

We'll need to import the required classes to execute all the document CRUD operations via the high-level client, as follows:
```
import co.elastic.clients.elasticsearch.ElasticsearchClient...
```

Managing bulk actions

Executing automatic operations on items via a single call will often be the cause of a bottleneck if you need to index or delete thousands/millions of records. The best practice, in this case, is to execute a bulk action.

We have discussed bulk actions via the REST API in the Speeding up atomic operations (bulk) recipe in Chapter 3, Basic Operations.

Getting ready

You need an up-and-running Elasticsearch installation, which you can get using the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code of this recipe is in the ch13/elasticsearch-java-client directory and the referred class is BulkOperations.

How to do it...

To manage a bulk action, we will perform these steps:

We'll need to import the required classes to execute bulk actions via the high...

Building a query

Before a search, a query must be built. Elasticsearch provides several ways to build these queries. In this recipe, we will learn how to create a query object via QueryBuilder and simple strings.

Getting ready

You need an up-and-running Elasticsearch installation, which you can get as described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory and the referred class is QueryCreation.

How to do it...

To create a query, we will perform the following steps:

We need to import SearchRequest using the following code:

import co.elastic.clients.elasticsearch.core.SearchRequest;

Next, we'll create a query using SearchRequest, as follows:

SearchRequest searchRequest = new SearchRequest.Builder...

Executing a standard search

In the previous recipe, we learned how to build queries. In this recipe, we will execute a query to retrieve some documents.

Getting ready

You need an up-and-running Elasticsearch installation, as we described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory and the referred class is the QueryExample.

How to do it...

To execute a standard query, we will perform the following steps:

We need to import SearchRequest.QueryBuilder to create the query, as follows:
```
import co.elastic.clients.elasticsearch.core.SearchRequest;
```

We can create an index and populate it with some data, as follows:

String index = "mytest";
QueryHelper qh = new QueryHelper();
qh.populateData(index...

Executing a search with aggregations

The previous recipe can be extended to support aggregations in order to retrieve analytics on indexed data.

Getting ready

You need an up-and-running Elasticsearch installation, which you can get as described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory, and the referred class is AggregationExample.

How to do it...

To execute a search with aggregations, we will perform the following steps:

We need to import the necessary classes for the aggregations using the following code:

import co.elastic.clients.elasticsearch.ElasticsearchClient;
import co.elastic.clients.elasticsearch._types.aggregations.StringTermsAggregate;
import co.elastic.clients.elasticsearch._types.aggregations...

Executing a scroll search

Pagination with a standard query works very well if you are matching documents with documents that do not change too often; otherwise, performing pagination with live data returns unpredictable results. To bypass this problem, Elasticsearch provides an extra parameter in the query: scroll.

Getting ready

You need an up-and-running Elasticsearch installation, which you can get as described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool, or an IDE that natively supports it for Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the ch13/elasticsearch-java-client directory and the referred class is ScrollQueryExample.

How to do it...

The search is done as it was shown in the Execute a standard search recipe in Chapter 4, Exploring Search Capabilities. The main difference is the use of a scroll timeout, which allows the resulting...

Integrating with DeepLearning4j

DeepLearning4J (DL4J) is one of the most used open source libraries in machine learning. It can be found at https://deeplearning4j.org/.

The best description for this library is available on its website, which says—Deeplearning4j is the first commercial-grade, open-source, distributed deep learning library written for Java and Scala. Integrated with Hadoop and Apache Spark, DL4J brings AI to business environments for use on distributed GPUs and CPUs.

In this recipe, we will see how it's possible to use Elasticsearch as a source for data to be trained in a machine learning algorithm.

Getting ready

You need an up-and-running Elasticsearch installation, as we described in the Downloading and installing Elasticsearch recipe in Chapter 1, Getting Started.

A Maven tool or an IDE that natively supports Java programming, such as Visual Studio Code, Eclipse, or IntelliJ IDEA, must be installed.

The code for this recipe is in the...

The rest of the chapter is locked

You have been reading a chapter from

Elasticsearch 8.x Cookbook - Fifth Edition

Published in: May 2022Publisher: PacktISBN-13: 9781801079815

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Alberto Paro

Alberto Paro is an engineer, manager, and software developer. He currently works as technology architecture delivery associate director of the Accenture Cloud First data and AI team in Italy. He loves to study emerging solutions and applications, mainly related to cloud and big data processing, NoSQL, Natural language processing (NLP), software development, and machine learning. In 2000, he graduated in computer science engineering from Politecnico di Milano. Then, he worked with many companies, mainly using Scala/Java and Python on knowledge management solutions and advanced data mining products, using state-of-the-art big data software. A lot of his time is spent teaching how to effectively use big data solutions, NoSQL data stores, and related technologies.
Read more about Alberto Paro

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages