Packt+ | Advance your knowledge in tech

You're reading from ElasticSearch Cookbook

Product typeBook

Published inDec 2013

Reading LevelBeginner

PublisherPackt

ISBN-139781782166627

Edition1st Edition

Languages

Java

Tools

Elasticsearch

Concepts

Enterprise Search

Author (1)

Alberto Paro

Chapter 10. Java Integration

In this chapter, we will cover the following topics:

Creating an HTTP client
Creating a native client
Managing indices with the native client
Managing mappings
Managing documents
Managing bulk action
Creating a query
Executing a standard search
Executing a facet search
Executing a scroll/scan search

Introduction

ElasticSearch functionalities can be easily integrated in every Java application in several ways, both via REST API then native ones.

With the use of Java, it's easy to call a REST HTTP interface with one of the many libraries available, such as Apache HTTPComponents Client (http://hc.apache.org/). In this field, there is no library which is used the most; typically developers choose the library that best suits their taste or that they know very well.

Every JVM language can also use the Native protocol to integrate ElasticSearch in their products. The Native protocol, discussed in Chapter 1, Getting Started, is one of the fastest protocols available to communicate with ElasticSearch due to many factors, such as its binary nature, the fast native serializer/deserializer of the data, the asynchronous approach for communicating and the hop reduction (native client is able to communicate directly with the node that contains the data without executing a double hop needed in REST calls...

Creating an HTTP client

An HTTP Client is one of the easiest clients to create. It's very handy because it allows calling not only the internal methods as the Native protocol does, but also the third-party calls implemented in plugins that can be called only via HTTP.

Getting ready

You need a working ElasticSearch cluster and Maven installed. The code of this recipe is in the chapter_10/http_client directory present in the code bundle available on Packt's website.

How to do it...

For creating an HTTP client, we will perform the steps given as follows:

For these examples, we have chosen the Apache HttpComponents that is one of the most famous libraries to execute HTTP calls. This library is available in the main Maven repository search.Maven.org. To enable the compilation in your Maven pom.xml project, just add:
```
<dependency>
  <groupId>org.apache.httpcomponents</groupId>
  <artifactId>httpclient</artifactId>
  <version>4.3</version>
</dependency>
```
If...

Creating a native client

To create a native client to communicate with an ElasticSearch server, there are two ways:

Creating an embedded node (a node that doesn't contain data, but it works as arbiter) and getting the client from it. This node will appear in the cluster state nodes and it's able to use discovery capabilities of ElasticSearch to join the cluster (so no node address is required to connect to a cluster). This client is able to reduce the node routing due to knowledge of cluster topology.
Creating a transport client, which is a standard client that requires the address and port of nodes to connect.

In this recipe, we will see how to create these clients.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book provided on Packt's website.

How to do it...

To create a native client, we will perform the steps given as follows:

Before starting, we must be sure that Maven loads...

Managing indices with the native client

In the previous recipe we have seen how to initialize a client to send calls to an ElasticSearch cluster. In this recipe, we will see how to manage indices via client calls.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle, which can be downloaded from Packt's website, and the referred class is IndicesOperations.

How to do it...

ElasticSearch client maps all indices operations under the admin.indices object of the client. Here, there are all the indices operation (create, delete, exists, open, close, optimize, and so on). In the following example, we will only see the most used calls on indices.

The following code retrieves a client and executes the main operation on indices:

import org.elasticsearch.action.admin.indices.exists.indices.IndicesExistsResponse;
import org.elasticsearch.client.Client;

public class IndicesOperations {
  private final Client...

Managing mappings

After creating an Index the next step is to add some mapping to it. We have already seen how to put a mapping via REST API in Chapter 4, Standard Operations. In this recipe, we will see how to manage mappings via native client.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book, available on Packt's website, and the referred class is MappingsOperations.

How to do it...

In the following code, we add a mytype mapping to a myindex via native client:

importorg.elasticsearch.action.admin.indices.mapping.put.PutMappingResponse;
import org.elasticsearch.client.Client;
import org.elasticsearch.common.xcontent.XContentBuilder;

import java.io.IOException;

import static org.elasticsearch.common.xcontent.XContentFactory.jsonBuilder;

public class MappingOperations {

  public static void main( String[] args )
  {
    String index="mytest";
    String type="mytype";
  ...

Managing documents

The native APIs for managing document (index, delete, and update) are the most important after the search ones. In this recipe, we will see how to use them. In the next one we will evolve in executing bulk actions to improve performances.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book available on Packt's website, and the referred class is DocumentOperations.

How to do it...

For managing documents, we will perform the steps given as follows:

We'll execute all the document with CRUD operations (CReate, Update, Delete) via native client:

import org.elasticsearch.action.delete.DeleteResponse;
import org.elasticsearch.action.get.GetResponse;
import org.elasticsearch.action.index.IndexResponse;
import org.elasticsearch.action.update.UpdateResponse;
import org.elasticsearch.client.Client;
import org.elasticsearch.common.xcontent.XContentFactory;

import java.io...

Managing bulk action

Executing atomic operation on items via single call is often a bottleneck if you need to index or delete thousands/millions of records: the best practice in this case is to execute a bulk action. We discussed bulk action via REST API in the Speeding up atomic operations (bulk) recipe in Chapter 4, Standard Operations.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book available on Packt's website and the referred class is BulkOperations.

How to do it...

For managing a bulk action, we will perform the steps given as follows:

We'll execute a bulk action adding 1000 elements, updating them and deleting them:

import org.elasticsearch.action.bulk.BulkRequestBuilder;
import org.elasticsearch.client.Client;
import org.elasticsearch.common.xcontent.XContentFactory;

import java.io.IOException;

public class BulkOperations {
  public static void main( String[] args...

Creating a query

Before search, a query must be built: ElasticSearch provides several ways to build these queries. In this recipe, will see how to create a query object via QueryBuilder and via simple strings.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven. The code of this recipe is in chapter_10/nativeclient in the code bundle of this book available on Packt's website and the referred class is QueryCreation.

How to do it...

For creating a query, we will perform the steps given as follows:

There are several ways to define a query in ElasticSearch; they are interoperable.
Generally a query can be defined as a:
- QueryBuilder: This is a helper to build a query.
- XContentBuilder: This is a helper to create JSON code. We discussed it in the Managing mapping recipe in this chapter. The JSON code to be generated is similar to the previous REST, but converted in programmatic code.
- Array of bytes or string: In this case, it's usually the JSON to be executed as we have...

Executing a standard search

In the previous recipe, we saw how to build queries. In this recipe, we can execute this query to retrieve some documents.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book available on Packt's website and the referred class is QueryExample.

How to do it...

For executing a standard query, we will perform the steps given as follows:

After having created a query, to execute it is enough using the prepareQuery call and pass to it your query object. Here, there is a complete example:

import org.elasticsearch.action.search.SearchResponse;
import org.elasticsearch.client.Client;
import org.elasticsearch.index.query.QueryBuilder;
import org.elasticsearch.search.SearchHit;
import static org.elasticsearch.index.query.FilterBuilders.*;
import static org.elasticsearch.index.query.QueryBuilders.*;

public class QueryExample {
  public static void main(String[]...

Executing a facet search

The previous recipe can be extended to support facet and to retrieve analytics on indexed data.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book available on Packt's website and the referred class is FacetExample.

How to do it...

For executing a facet search, we will perform the steps given as follows:

We'll calculate two different facets (term and statistical):

import org.elasticsearch.action.search.SearchResponse;
import org.elasticsearch.client.Client;
import org.elasticsearch.search.facet.FacetBuilder;
import org.elasticsearch.search.facet.statistical.StatisticalFacet;
import org.elasticsearch.search.facet.terms.TermsFacet;

import static org.elasticsearch.index.query.QueryBuilders.*;
import static org.elasticsearch.search.facet.FacetBuilders.*;

public class FacetExample {
  public static void main(String[] args) {
    …
    Client client=qh.getClient...

Executing a scroll/scan search

The standard query works very well if you need to provide results in which documents do not change too often. Otherwise, doing pagination with live data brings a strange behavior to the returned results. To bypass this problem, ElasticSearch provides an extra parameter in the query: the scroll.

Getting ready

You need a working ElasticSearch cluster and a working copy of Maven.

The code of this recipe is in chapter_10/nativeclient in the code bundle of this book available on Packt's website and the referred class is ScrollScanQueryExample.

How to do it...

The search is done as in the previous recipe. The big difference is a setScroll timeout, which allows storing in memory the resultant IDs for a query for a defined timeout.

We can change the code of the previous recipe by using scroll in the following way:

import org.elasticsearch.action.search.SearchResponse;
import org.elasticsearch.action.search.SearchType;
import org.elasticsearch.client.Client;
import org.elasticsearch...

The rest of the chapter is locked

You have been reading a chapter from

ElasticSearch Cookbook

Published in: Dec 2013Publisher: PacktISBN-13: 9781782166627

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Alberto Paro

Alberto Paro is an engineer, manager, and software developer. He currently works as technology architecture delivery associate director of the Accenture Cloud First data and AI team in Italy. He loves to study emerging solutions and applications, mainly related to cloud and big data processing, NoSQL, Natural language processing (NLP), software development, and machine learning. In 2000, he graduated in computer science engineering from Politecnico di Milano. Then, he worked with many companies, mainly using Scala/Java and Python on knowledge management solutions and advanced data mining products, using state-of-the-art big data software. A lot of his time is spent teaching how to effectively use big data solutions, NoSQL data stores, and related technologies.
Read more about Alberto Paro

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages