Reader small image

You're reading from  Learning Elasticsearch

Product typeBook
Published inJun 2017
PublisherPackt
ISBN-139781787128453
Edition1st Edition
Right arrow
Author (1)
Abhishek Andhavarapu
Abhishek Andhavarapu
author image
Abhishek Andhavarapu

Abhishek Andhavarapu is a software engineer at eBay who enjoys working on highly scalable distributed systems. He has a master's degree in Distributed Computing and has worked on multiple enterprise Elasticsearch applications, which are currently serving hundreds of millions of requests per day. He began his journey with Elasticsearch in 2012 to build an analytics engine to power dashboards and quickly realized that Elasticsearch is like nothing out there for search and analytics. He has been a strong advocate since then and wrote this book to share the practical knowledge he gained along the way.
Read more about Abhishek Andhavarapu

Right arrow

Doc values

Before we jump into doc values, let's quickly refresh what an inverted index is and why it is needed. Let's says we have the following documents:

  • Doc 1: Apple
  • Doc2: Apple
  • Doc3: Samsung

The inverted index for the preceding documents looks like the following:

Term Doc ID
Apple 1, 2
Samsung 3

To find all the products manufactured by Apple, we would simply use a match query as shown here:

{
"query": {
"match": {
"manufacturer": "Apple"
}
}
}

With the help of inverted index, we can quickly look up all the documents associated with term Apple. But if you want to sort or run the aggregation using the inverted index, we have to go through the entire terms list and collect the document IDs, which is practically not possible. To solve this problem, doc values are introduced. Doc values for the preceding...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Learning Elasticsearch
Published in: Jun 2017Publisher: PacktISBN-13: 9781787128453

Author (1)

author image
Abhishek Andhavarapu

Abhishek Andhavarapu is a software engineer at eBay who enjoys working on highly scalable distributed systems. He has a master's degree in Distributed Computing and has worked on multiple enterprise Elasticsearch applications, which are currently serving hundreds of millions of requests per day. He began his journey with Elasticsearch in 2012 to build an analytics engine to power dashboards and quickly realized that Elasticsearch is like nothing out there for search and analytics. He has been a strong advocate since then and wrote this book to share the practical knowledge he gained along the way.
Read more about Abhishek Andhavarapu