Reader small image

You're reading from  Learning Elasticsearch

Product typeBook
Published inJun 2017
PublisherPackt
ISBN-139781787128453
Edition1st Edition
Right arrow
Author (1)
Abhishek Andhavarapu
Abhishek Andhavarapu
author image
Abhishek Andhavarapu

Abhishek Andhavarapu is a software engineer at eBay who enjoys working on highly scalable distributed systems. He has a master's degree in Distributed Computing and has worked on multiple enterprise Elasticsearch applications, which are currently serving hundreds of millions of requests per day. He began his journey with Elasticsearch in 2012 to build an analytics engine to power dashboards and quickly realized that Elasticsearch is like nothing out there for search and analytics. He has been a strong advocate since then and wrote this book to share the practical knowledge he gained along the way.
Read more about Abhishek Andhavarapu

Right arrow

Translog

In the What happens when you index a document section, we discussed that when you index a document, the refresh process creates a new segment. Since writing the segment to the disk on every refresh is very expensive, the segment is only written to in-memory file system cache. When certain conditions are met, a process known as Lucene commit writes all the files (segments) in the memory to a physical disk.

If a node crashes before the files in memory are persisted to the physical disk, the data in the file system cache is lost, and any uncommitted changes are also lost. But Lucene commit is very expensive and cannot be done after every operation. To solve this problem, Elasticsearch introduced transaction log, which is a write-ahead log:

During an index or delete operation, the document is written to both the memory buffer and the transaction log. The index/delete operation...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Learning Elasticsearch
Published in: Jun 2017Publisher: PacktISBN-13: 9781787128453

Author (1)

author image
Abhishek Andhavarapu

Abhishek Andhavarapu is a software engineer at eBay who enjoys working on highly scalable distributed systems. He has a master's degree in Distributed Computing and has worked on multiple enterprise Elasticsearch applications, which are currently serving hundreds of millions of requests per day. He began his journey with Elasticsearch in 2012 to build an analytics engine to power dashboards and quickly realized that Elasticsearch is like nothing out there for search and analytics. He has been a strong advocate since then and wrote this book to share the practical knowledge he gained along the way.
Read more about Abhishek Andhavarapu