Mastering Elasticsearch - Second Edition

Further your knowledge of the Elasticsearch server by learning more about its internals, querying, and data handling

Mastering Elasticsearch - Second Edition

This ebook is included in a Mapt subscription
Rafał Kuć, Marek Rogoziński

1 customer reviews
Further your knowledge of the Elasticsearch server by learning more about its internals, querying, and data handling
$0.00
$16.50
$54.99
$29.99p/m after trial
RRP $32.99
RRP $54.99
Subscription
eBook
Print + eBook
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 4,000+ eBooks & Videos
  • 40+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Preview in Mapt

Book Details

ISBN 139781783553792
Paperback434 pages

Book Description

Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, and open source search and analytics engine. Elasticsearch leverages the capabilities of Apache Lucene, providing a new level of control over how you can index and search even huge sets of data.

This book covers intermediate and advanced functionalities of Elasticsearch and walks you through its internals including caches, the Apache Lucene library, and its monitoring capabilities. You'll learn about practical usage of Elasticsearch configuration parameters and how to use the monitoring API.

With this book, you'll delve into Elasticsearch's query rewrite, query template, bulk operation, document grouping, and function score queries. You will also learn how to improve user search experience, index distribution, segment statistics, and merging. By the end of the book, you will be able to enhance Elasticsearch's performance and create your own Elasticsearch plugins.

Table of Contents

Chapter 1: Introduction to Elasticsearch
Introducing Apache Lucene
Introducing Elasticsearch
The story
Summary
Chapter 2: Power User Query DSL
Default Apache Lucene scoring explained
Query rewrite explained
Query templates
Handling filters and why it matters
Choosing the right query for the job
Summary
Chapter 3: Not Only Full Text Search
Query rescoring
Controlling multimatching
Significant terms aggregation
Documents grouping
Relations between documents
Scripting changes between Elasticsearch versions
Summary
Chapter 4: Improving the User Search Experience
Correcting user spelling mistakes
Improving the query relevance
Summary
Chapter 5: The Index Distribution Architecture
Choosing the right amount of shards and replicas
Routing explained
Altering the default shard allocation behavior
Query execution preference
Summary
Chapter 6: Low-level Index Control
Altering Apache Lucene scoring
Choosing the right directory implementation – the store module
NRT, flush, refresh, and transaction log
Segment merging under control
When it is too much for I/O – throttling explained
Understanding Elasticsearch caching
Summary
Chapter 7: Elasticsearch Administration
Discovery and recovery modules
The human-friendly status API – using the Cat API
Backing up
Federated search
Summary
Chapter 8: Improving Performance
Using doc values to optimize your queries
Knowing about garbage collector
Benchmarking queries
Very hot threads
Scaling Elasticsearch
Summary
Chapter 9: Developing Elasticsearch Plugins
Creating the Apache Maven project structure
Understanding the basics
Creating custom REST action
Creating the custom analysis plugin
Summary

What You Will Learn

  • Understand Apache Lucene and Elasticsearch's design and architecture
  • Use and configure different scoring models to alter the default scoring mechanism
  • Choose the appropriate amount of shards and replicas for your deployment
  • Improve user search experience by utilizing Elasticsearch functionality
  • Control segment merging and learn why Elasticsearch uses merging
  • Develop custom Elasticsearch plugins and cover detailed examples of how to extend Elasticsearch by writing your own plugins
  • Apply your knowledge to create scalable, efficient, and fault tolerant clusters and monitor your cluster by using and understanding the Elasticsearch API

Authors

Table of Contents

Chapter 1: Introduction to Elasticsearch
Introducing Apache Lucene
Introducing Elasticsearch
The story
Summary
Chapter 2: Power User Query DSL
Default Apache Lucene scoring explained
Query rewrite explained
Query templates
Handling filters and why it matters
Choosing the right query for the job
Summary
Chapter 3: Not Only Full Text Search
Query rescoring
Controlling multimatching
Significant terms aggregation
Documents grouping
Relations between documents
Scripting changes between Elasticsearch versions
Summary
Chapter 4: Improving the User Search Experience
Correcting user spelling mistakes
Improving the query relevance
Summary
Chapter 5: The Index Distribution Architecture
Choosing the right amount of shards and replicas
Routing explained
Altering the default shard allocation behavior
Query execution preference
Summary
Chapter 6: Low-level Index Control
Altering Apache Lucene scoring
Choosing the right directory implementation – the store module
NRT, flush, refresh, and transaction log
Segment merging under control
When it is too much for I/O – throttling explained
Understanding Elasticsearch caching
Summary
Chapter 7: Elasticsearch Administration
Discovery and recovery modules
The human-friendly status API – using the Cat API
Backing up
Federated search
Summary
Chapter 8: Improving Performance
Using doc values to optimize your queries
Knowing about garbage collector
Benchmarking queries
Very hot threads
Scaling Elasticsearch
Summary
Chapter 9: Developing Elasticsearch Plugins
Creating the Apache Maven project structure
Understanding the basics
Creating custom REST action
Creating the custom analysis plugin
Summary

Book Details

ISBN 139781783553792
Paperback434 pages
Read More
From 1 reviews

Read More Reviews