Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Elasticsearch 7 Quick Start Guide

You're reading from  Elasticsearch 7 Quick Start Guide

Product type Book
Published in Oct 2019
Publisher Packt
ISBN-13 9781789803327
Pages 186 pages
Edition 1st Edition
Languages
Authors (2):
Anurag Srivastava Anurag Srivastava
Profile icon Anurag Srivastava
Douglas Miller Douglas Miller
Profile icon Douglas Miller
View More author details

Data sparsity

In previous versions of Elasticsearch, the sparsity of documents was to be avoided because of Lucene's structure. This structure identifies documents internally with document IDs, which are then used for communication between the internal APIs of Lucene. Lucene retrieves values of the norm from the document ID, generated by a search query, by reading the byte at the index of the document ID.

Lucene is a full-featured text search engine that is written in Java, and Elasticsearch is built on top of Lucene.

This is, at the same time, both very efficient and time-intensive, because Lucene can quickly access the norm values and the documents that have no value and use one byte of storage for each. This means, though, that if an index has x documents, the norms require x bytes of storage per field. This not only affects the sparsity requirements, but also the indexing...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}