Packt+ | Advance your knowledge in tech

You're reading from Mastering Elastic Stack

Product typeBook

Published inFeb 2017

PublisherPackt

ISBN-139781786460011

Edition1st Edition

Tools

Elasticsearch

Concepts

Enterprise Search

Authors (2):

Ravi Kumar Gupta

Yuvraj Gupta

View More author details

Chapter 8. Elasticsearch APIs

In Chapter 2, Step into Elasticsearch, we learned about underlying technology and how Elasticsearch works and the APIs it offers. In Chapter 4, Kibana Interface, we understood how to use the Console and got to use aggregations using Kibana.

This chapter will complete the rest of the APIs and we will use Console to send API requests to Elasticsearch. We'll cover the following topics in this chapter:

Cluster APIs
Cat APIs
Modules
Ingest nodes
Elasticsearch clients
Java APIs

For a quick go-through of Console, you can refer to the Exploring dev tools section in Chapter 4, Kibana Interface.

Note

Assuming for development and learning purpose, Kibana is installed locally.

The cluster APIs

These APIs allow us to know about cluster state, health, statistics, node statistics, node information, and so on.

Cluster health

To know cluster health, we can use the _cluster/health endpoint, as shown in the following example:

GET /_cluster/health/library

Here, GET is the verb, _cluster/health is the endpoint, and library is the index. This call will result in information about nodes, data nodes, shards, tasks, and status of the index in case the index was specified otherwise for the cluster:

The results on the Response pane show the status of the Elasticsearch cluster as green with other values such as node information, shards, and pending tasks.

Let's have a look at what other values of the Elasticsearch status denote:

Red: Some or all of the primary shards are not allocated or ready
Yellow: All primary shards are allocated, but some or none of the replica shards are allocated
Green: All primary and replica shards are allocated and the cluster is fully up

Tip

If replicas...

The cat APIs

This API helps us to print information nodes, indices, fields, tasks, and plugins in a human-readable format rather than a JSON. It can also be visualized to see how tables are printed on the console.

All these commands can be used with the GET verb of curl. By default, the commands will list only data and no headers. To print headers, we can use v in query parameters:

GET /_cat/health?v

The preceding command can be used instead of the following:

GET /_cat/health

We can also specify which headers to show by supplying the comma-separated values for the h query parameter.

Let's see the endpoints available to operate on:

_cat/indices: This shows data about indices such as health, status, index name, primary, replicas, documents count, and memory:
```
        GET /_cat/indices?v&h=health,status,index,docs.count,store
```

Here is what it looks like in the Console:

As we can see, it shows health and stats for each index.

_cat/master: This shows the node ID, IP address, and node name of the...

Elasticsearch modules

Every great project has a number of modules to support what it offers. Elasticsearch has many such modules. These modules need some settings, either static using elasticsearch.yml or dynamic settings, which can be updated using the cluster API. Let's look at different modules.

Cluster module

The cluster module decides how the shards are allocated to nodes and takes care of the movement of shards in order to keep the cluster balanced. This process is known as shard allocation. There are a number of settings for this module, which can be dynamically applied using the cluster API. These settings take care of shard allocation among nodes in a cluster as well as within a node.

Discovery module

The discovery module helps to discover the nodes in the network for a specified cluster. In the elasticsearch.yml configuration file, there is one configuration for the cluster name, which decides which cluster this node will be part of. The default name is elasticsearch:

cluster.name ...

Ingest nodes

As we learnt in the previous section, ingest nodes help to preprocess things before a document is indexed. Before a bulk request or index operations, the ingest node intercepts the request and does required processing on the document. An example of such a processor can be the date processor, which is used to parse the dates in fields. Another example is a convert processor, which converts a field value to a target type, for example, string to integer. A number of processors are available at: https://www.elastic.co/guide/en/elasticsearch/reference/5.1/ingest-processors.html.

These kinds of nodes are helpful when a huge processing happens and we do not want a data node or master node to engage in processing. Dedicated ingest nodes can help to reduce the load significantly. It is best to set node.ingest as false for data and master nodes.

To understand how ingest nodes work with pipelines, let's follow these steps. Let's take the example of our library index and type movies added...

Elasticsearch clients

Earlier in this chapter, we got to know that Elasticsearch nodes support both transport and HTTP protocols. Using this flexibility, Elasticsearch nodes can be managed by its client written in other programming languages. There are a number of clients to perform operations on cluster and nodes. These clients can connect to a node or cluster to manage indices, operations on documents, and make searches.

Supported clients

A few clients are supported officially by the Elasticsearch organization. These clients are basically APIs that you can utilize with your own applications written in respective programming languages. For example, if you are developing a Java web application that integrates to Elasticsearch and you want to offer managing indices through an admin panel, you can use the Java API supported by Elasticsearch to connect to the cluster and nodes and do the necessary operations. The following is a list of all supported clients by Elasticsearch:

Java API
JavaScript...

Java API

The Java client uses the transport layer for its operations and supports all kinds of operations. We can make searches, index documents, delete, or get documents including admin tasks on the cluster. We can also perform operations in bulk.

To use the Java API in our application we need to use a few JAR files as the dependency. For a maven project, we can add dependency in our pom.xml as follows:

<dependency> 
        <groupId>org.elasticsearch</groupId> 
        <artifactId>elasticsearch</artifactId> 
        <version>${elasticsearch.version}</version> 
</dependency>

To include the jar files directly to the project, we can also download from the repository here https://repo.maven.apache.org/maven2/org/elasticsearch/elasticsearch. We can select the version we want for our application.

One thing to note here is that the client version should be the same as the version of Elasticsearch being used. For example, if...

Elasticsearch plugins

As learned in Chapter 7, Customizing Elastic Stack, under the Extending Elasticsearch section, earlier versions (before 5.x) of Elasticsearch offered a number of plugins and these plugins were divided into three types - Java, Site, and Mixed plugins. Now Site and Mixed plugins are deprecated and only Java plugins are supported. These Java plugins must be installed on every node and contain only JAR files. Chapter 7, Customizing Elastic Stack, also talks about Elasticsearch plugins.

Elastic.co categorizes plugins as core plugins, which are developed and maintained officially, and community plugins, which are developed and maintained by a community. To utilize these plugins, we need to install into Elasticsearch by using the Elasticsearch-plugin utility. Core plugins are released with Elasticsearch, and share the same version as Elasticsearch.

In this section, we will get familiar with a few of the interesting plugins. Core plugins can be installed just by using the name...

Summary

This chapter concludes the Elasticsearch APIs. Being a very vast topic, not all of the APIs can be covered, but we have got the gist of how these APIs work and help us to manage the Elasticsearch cluster, nodes, and indices or even make a search for documents. When you are working with Kibana, the same things can be done using the Console. There are many REST-based clients developed for Elasticsearch for numerous languages and platforms that use http protocol and we have been learning that since Chapter 2, Stepping into Elasticsearch. This chapter also covered the other side of the story - using Transport Client with the help of the Java API.

The next chapter is going to focus on the customization of Elastic Stack using plugins. Plugins give us a good amount of control on the functionalities and we get a liberty to implement what is not present or mend what is present to make it work for us. We will be learning the way we can create new plugins and customize the stack.

The rest of the chapter is locked

You have been reading a chapter from

Mastering Elastic Stack

Published in: Feb 2017Publisher: PacktISBN-13: 9781786460011

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Ravi Kumar Gupta

Ravi Kumar Gupta is an author, reviewer, and open source software evangelist. He pursued an MS degree in software system at BITS Pilani and a B.Tech at LNMIIT, Jaipur. His technological forte is portal management and development. He is currently working with Azilen Technologies, where he acts as a Technical Architect and Project Manager. His previous assignment was as a lead consultant with CIGNEX Datamatics. He was a core member of the open source group at TCS, where he started working on Liferay and other UI technologies. During his career, he has been involved in building enterprise solutions using the latest technologies with rich user interfaces and open source tools. He loves to spend time writing, learning, and discussing new technologies. His interest in search engines and that small project on crawler during college time made him a technology lover. He is one of the authors of Test-Driven JavaScript Development, Packt Publishing. He is an active member of the Liferay forum. He also writes technical articles for his blog at TechD of Computer World (http://techdc.blogspot.in). He has been a Liferay trainer at TCS and CIGNEX, where he has provided training on Liferay 5.x and 6.x versions. He was also a reviewer for Learning Bootstrap, Packt Publishing. He can be reached on Skype at kravigupta, on Twitter at @kravigupta, and on LinkedIn at https://in.linkedin.com/in/kravigupta.
Read more about Ravi Kumar Gupta

Yuvraj Gupta

Yuvraj Gupta is an author and a keen technologist with interest towards Big Data, Data Analytics, Data Visualization, and Cloud Computing. He has been working as a Big Data Consultant primarily in domain of Big Data Testing. He loves to spend time writing on various social platforms. He is an avid gadget lover, a foodie, a sports enthusiast and love to watch tv-series or movies. He always keep himself updated with the latest happenings in technology. He has authored a book titled Kibana Essentials with Packt Publishers. He can be reached at gupta.yuvraj@gmail.com or at LinkedIn www.linkedin.com/in/guptayuvraj.
Read more about Yuvraj Gupta

Other recommended products

Related to this chapter

Kibana 7 Quick Start Guide

Kibana is the visualization tool of the Elastic Stack, used for visualizing the results of the queries as well the dashboards generated out of the Elasticsearch and Logstash components. This book contains core concepts of Kibana with a straightforward form of chapters so that reader can move forward in a step by step manner.

BookJan 2019172 pages

Learning Kibana 7

This book will introduce you to Kibana 7, and will show you how it fits into the Elastic stack. You will build a pure metric analytics architecture and visualize it using Timelion. You will also learn how to build relationships between documents using Graph visualization. You will also learn to build powerful Elastic dashboards using Kibana.

BookJul 2019280 pages

Elasticsearch 7 Quick Start Guide

Elasticsearch is one of the most popular tools for distributed search. This book will help you in understanding all about the new features of Elasticsearch 7, and how to use them efficiently for searching, aggregating and indexing data with speed and accuracy.

BookOct 2019186 pages

Learning Elastic Stack 6.0

This book will give you a fundamental understanding of what the stack is all about, and how to use it efficiently to build powerful real-time data processing applications. It provide in-depth coverage of the different components of the Elastic Stack, and how to use them all together.

BookDec 2017434 pages

Mastering Kibana 6.x

Mastering Kibana 6.x provides a rundown explanation required for data visualization and analysis such as X-Pack features, Beats, and machine learning. You will be expert in creating analytics-driven visualizations from a web application. You will be a maestro in creating custom monitoring dashboard using Beats with various examples

BookJul 2018376 pages

Learning Elastic Stack 7.0

This book teaches you about every component of the Elastic Stack - including Elasticsearch, Kibana, Logstash, and X-pack - with new and the updated features that are released with the 7.0 version. With the help of this book, you will be able to develop enterprise-grade distributed search and analytics applications for your data without any hassle.

BookMay 2019474 pages

Learning Elasticsearch

Elasticsearch is a Lucene-based search and analytics engine for distributed search and analytics. This book will be your hands-on guide as you explore and put to use the features of Elasticsearch 5.x.

BookJun 2017404 pages

Mastering Elasticsearch 5.x

This book will help you leverage Elasticsearch, guiding you through everything from writing and creating customized plugins to extend Elasticsearch to tackling challenges while handling relational data in Elasticsearch. You’ll learn with the help of practical examples in a step-by-step way.

BookFeb 2017428 pages

Elasticsearch 5.x Cookbook

BookFeb 2017696 pages

Learning Kibana 5.0

BookFeb 2017284 pages

Advanced Elasticsearch 7.0

Advanced Elasticsearch 7.0, will help the readers to leverage new features and Core APIs of Elasticsearch to perform advanced search operations. This book covers data modeling, aggregations, pipeline processing, and data Analytics using Elasticsearch

BookAug 2019560 pages

Elasticsearch 7.0 Cookbook

This book is your one-stop guide to master Elasticsearch. It provides numerous problem-solution based recipes through which you can implement Elasticsearch in your enterprise applications in a very simple, hassle-free way.

BookApr 2019724 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages