Apache Solr High Performance


Apache Solr High Performance
eBook: $20.99
Formats: PDF, PacktLib, ePub and Mobi formats
$17.84
save 15%!
Print + free eBook + free PacktLib access to the book: $55.98    Print cover: $34.99
$34.99
save 37%!
Free Shipping!
UK, US, Europe and selected countries in Asia.
Also available on:
Overview
Table of Contents
Author
Support
Sample Chapters
  • Achieve high scores by boosting query time and index time, implementing boost queries and functions using the Dismax query parser and formulae.
  • Set up and use SolrCloud for distributed indexing and searching, and implement distributed search using Shards
  • Use GeoSpatial search, handling homophones, and ignoring listed words from being indexed and searched

Book Details

Language : English
Paperback : 124 pages [ 235mm x 191mm ]
Release Date : March 2014
ISBN : 1782164820
ISBN 13 : 9781782164821
Author(s) : Surendra Mohan
Topics and Technologies : All Books, Big Data and Business Intelligence, Open Source


Table of Contents

Preface
Chapter 1: Installing Solr
Chapter 2: Boost Your Search
Chapter 3: Performance Optimization
Chapter 4: Additional Performance Optimization Techniques
Chapter 5: Troubleshooting
Chapter 6: Performance Optimization with ZooKeeper
Appendix: Resources
Index
  • Chapter 2: Boost Your Search
    • Scoring
      • Query-time and index-time boosting
        • Index-time boosting
        • Query-time boosting
      • Troubleshoot queries and scores
    • The dismax query parser
      • Lucene DisjunctionMaxQuery
      • Autophrase boosting
        • Configuring autophrase boosting
        • Configuring the phrase slop
        • Boosting a partial phrase
      • Boost queries
      • Boost functions
        • Boost addition and multiplication
    • Function queries
      • Field references
      • Function references
      • Mathematical operations
      • The ord() and rord() functions
      • Other functions
      • Boosting the function query
      • Logarithm
      • Reciprocal
      • Linear
      • Inverse reciprocal
    • Summary
  • Chapter 3: Performance Optimization
    • Solr performance factors
    • Solr caching
      • Document caching
      • Query result caching
      • Filter caching
      • Result pages caching
    • Using SolrCloud
      • Creating a SolrCloud cluster
      • Multiple collections within a cluster
      • Managing a SolrCloud cluster
      • Distributed indexing and searching
      • Stopping automatic document distribution
    • Near real-time search
    • Summary
  • Chapter 5: Troubleshooting
    • Dealing with the corrupt index
    • Reducing the file count in the index
    • Dealing with the locked index
    • Truncating the index size
    • Dealing with a huge count of open files
    • Dealing with out-of-memory issues
    • Dealing with an infinite loop exception in shards
    • Dealing with expensive garbage collection
    • Bulk updating a single field without full indexation
    • Summary
  • Chapter 6: Performance Optimization with ZooKeeper
    • Getting familiar with ZooKeeper
      • Prerequisites for a distributed server
      • Aid your distributed system using ZooKeeper
      • Setting an ideal node count for ZooKeeper
    • Setting up, configuring, and deploying ZooKeeper
      • Setting up ZooKeeper
      • Configuring ZooKeeper
      • Deploying ZooKeeper
    • Applications of ZooKeeper
    • Summary

Surendra Mohan

Surendra Mohan, who has served a few top-notch software organizations in varied roles, is currently a freelance software consultant. He has been working on various cutting-edge technologies such as Drupal, Moodle, Apache Solr, ElasticSearch, and Node.js for the past 10 years. He also delivers technical talks at various community events like Drupal Meetups and Drupal Camps. To find out more about him, his write-ups, technical blogs, and much more, check out http://www.surendramohan.info/.

He has also authored books such as Administrating Solr and Apache Solr High Performance published by Packt Publishing and has reviewed other technical books such as Drupal 7 Multi Site Configuration, Drupal Search Engine Optimization, titles on Drupal commerce, ElasticSearch, Drupal-related video tutorials, title on OpsView, and many more.

Additionally, he writes technical blogs and articles with SitePoint.com. His published blogs and articles can be found at http://www.sitepoint.com/author/smohan/.

Sorry, we don't have any reviews for this title yet.

Code Downloads

Download the code and support files for this book.


Submit Errata

Please let us know if you have found any errors not listed on this list by completing our errata submission form. Our editors will check them and add them to this list. Thank you.

Sample chapters

You can view our sample chapters and prefaces of this title on PacktLib or download sample chapters in PDF format.

Frequently bought together

Apache Solr High Performance +    Creating E-Learning Games with Unity =
50% Off
the second eBook
Price for both: $30.90

Buy both these recommended eBooks together and get 50% off the cheapest eBook.

What you will learn from this book

  • Boost your search based on scores, the DisMax query parser, and function queries.
  • Explore performance metrics along with implementing different Solr caching like Document, query result, filter, and whole result page caching.
  • Index and search across shards and near real-time searching.
  • Get to grips with additional performance optimization activities like fetching documents similar to the ones queried, searching homophones, or filtering searches on the basis of specific key words.
  • Troubleshoot the common problems like corrupt and locked indexes, memory, expensive garbage collection, and infinite loop exception when using multiple server environment efficiently
  • Set up, configure, and deploy various applications of ZooKeeper to optimize Solr’s performance

In Detail

Apache Solr is one of the most popular open source search servers available on the web. However, simply setting up Apache Solr is not enough to ensure the success of your web product. To maximize efficiency, you need to use techniques to boost Solr performance in order to return relevant results faster. You need to implement robust techniques that focus on optimizing the performance of your Solr instances and also troubleshoot issues that are prone to arise while maintaining Solr.

Apache Solr High Performance is a practical guide that will help you explore and take full advantage of the robust nature of Apache Solr so as to achieve optimized Solr instances, especially in terms of performance.

You will learn everything you need to know in order to achieve a high performing Solr instance or set of instances, as well as how to troubleshoot the common problems you are prone to face while working with single or multiple Solr servers.

This book offers you an introduction by explaining the prerequisites of Apache Solr and installing it, while also integrating it with the required additional components, and gradually progresses into features that make Solr flexible enough to achieve high performance ratings in various circumstances. Moving forward, the book will cover several clear and highly practical concepts that will help you further optimize your Solr instances’ performance both on single as well as multiple servers, and learn how to troubleshoot common problems that are prone to arise while using your Solr instance. By the end of the book you will also learn how to set up, configure, and deploy ZooKeeper along with learning more about other applications of ZooKeeper.

You will also learn how to handle data in multiple server environments, searches based on specific geographical co-ordinates, different caching techniques, and various algorithms and formulae that enable better performance; and many more.

Approach

This book is an easy-to-follow guide, full of hands-on, real-world examples. Each topic is explained and demonstrated in a specific and user-friendly flow, from search optimization using Solr to Deployment of Zookeeper applications.

Who this book is for

This book is ideal for Apache Solr developers who want to learn different techniques to optimize Solr's performance with utmost efficiency, along with effectively troubleshooting the problems that usually occur while trying to boost performance. Familiarity with search servers and database querying is expected.

Code Download and Errata
Packt Anytime, Anywhere
Register Books
Print Upgrades
eBook Downloads
Video Support
Contact Us
Awards Voting Nominations Previous Winners
Judges Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software
Resources
Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software