Reader small image

You're reading from  Elasticsearch 7 Quick Start Guide

Product typeBook
Published inOct 2019
PublisherPackt
ISBN-139781789803327
Edition1st Edition
Right arrow
Authors (2):
Anurag Srivastava
Anurag Srivastava
author image
Anurag Srivastava

Anurag Srivastava is a senior technical lead in a multinational software company. He has more than 12 years' experience in web-based application development. He is proficient in designing architecture for scalable and highly available applications. He has handled development teams and multiple clients from all over the globe over the past 10 years of his professional career. He has significant experience with the Elastic Stack (Elasticsearch, Logstash, and Kibana) for creating dashboards using system metrics data, log data, application data, and relational databases. He has authored three other booksMastering Kibana 6.x, and Kibana 7 Quick Start Guide, and Learning Kibana 7 - Second Edition, all published by Packt.
Read more about Anurag Srivastava

Douglas Miller
Douglas Miller
author image
Douglas Miller

Douglas Miller is an expert in helping fast-growing companies to improve performance and stability, and in building search platforms using Elasticsearch. Clients (including Walgreens, Nike, Boeing, and Dish Networks) have seen sales increase, fast performance times, and lower overall costs in terms of the total costs of ownership for their Elasticsearch clusters.
Read more about Douglas Miller

View More author details
Right arrow

Anatomy of an analyzer

An analyzer is a package that contains three building blocks: character filters, tokenizers, and token filters. A user can create a custom analyzer by using these or other building blocks to create the functionality needed. Allow me to elaborate more on what these building blocks are:

  • Character filters convert text into a stream of characters. They can transform the stream by adding, removing, or changing the format of the characters. For example, a character filer can change the & character to the word and. An analyzer may have no character filters, or many, but they are always applied in order.
  • Tokenizers receive the stream of characters and break it down into tokens. The output will then be a stream of tokens. For example, a whitespace tokenizer breaks the text using whitespaces: Hello World! into [hello, world]. It also records the order of the...
lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Elasticsearch 7 Quick Start Guide
Published in: Oct 2019Publisher: PacktISBN-13: 9781789803327

Authors (2)

author image
Anurag Srivastava

Anurag Srivastava is a senior technical lead in a multinational software company. He has more than 12 years' experience in web-based application development. He is proficient in designing architecture for scalable and highly available applications. He has handled development teams and multiple clients from all over the globe over the past 10 years of his professional career. He has significant experience with the Elastic Stack (Elasticsearch, Logstash, and Kibana) for creating dashboards using system metrics data, log data, application data, and relational databases. He has authored three other booksMastering Kibana 6.x, and Kibana 7 Quick Start Guide, and Learning Kibana 7 - Second Edition, all published by Packt.
Read more about Anurag Srivastava

author image
Douglas Miller

Douglas Miller is an expert in helping fast-growing companies to improve performance and stability, and in building search platforms using Elasticsearch. Clients (including Walgreens, Nike, Boeing, and Dish Networks) have seen sales increase, fast performance times, and lower overall costs in terms of the total costs of ownership for their Elasticsearch clusters.
Read more about Douglas Miller