Apache Solr Search Patterns

Leverage the power of Apache Solr to power up your business by navigating your users to their data quickly and efficiently

Apache Solr Search Patterns

This ebook is included in a Mapt subscription
Jayant Kumar

1 customer reviews
Leverage the power of Apache Solr to power up your business by navigating your users to their data quickly and efficiently
$0.00
$39.99
$49.99
$29.99p/m after trial
RRP $39.99
RRP $49.99
Subscription
eBook
Print + eBook
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 4,000+ eBooks & Videos
  • 40+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Preview in Mapt

Book Details

ISBN 139781783981847
Paperback316 pages

Book Description

Apache Solr is an open source search platform built on a Java library called Lucene. It serves as a search platform for many websites, as it has the capability of indexing and searching multiple websites to fetch desired results.

We begin with a brief introduction of analyzers and tokenizers to understand the challenges associated with implementing large-scale indexing and multilingual search functionality. We then move on to working with custom queries and understanding how filters work internally. While doing so, we also create our own query language or Solr plugin that does proximity searches. Furthermore, we discuss how Solr can be used for real-time analytics and tackle problems faced during its implementation in e-commerce search. We then dive deep into the spatial features such as indexing strategies and search/filtering strategies for a spatial search. We also do an in-depth analysis of problems faced in an ad serving platform and how Solr can be used to solve these problems.

Table of Contents

Chapter 1: Solr Indexing Internals
The job site problem statement – Solr indexing fundamentals
Working of analyzers, tokenizers, and filters
Handling a multilingual search
Measuring the quality of search results
The e-commerce problem statement
The job site problem statement
Challenges of large-scale indexing
The SolrCloud solution
Summary
Chapter 2: Customizing the Solr Scoring Algorithm
Relevance calculation
Building a custom scorer
Drawbacks of the TF-IDF model
The information gain model
Implementing the information gain model
Options to TF-IDF similarity
Summary
Chapter 3: Solr Internals and Custom Queries
Working of a scorer on an inverted index
Working of OR and AND clauses
The eDisMax query parser
Using BRS queries instead of DisMax
Building a custom query parser
Summary
Chapter 4: Solr for Big Data
Introduction to big data
Getting data points using facets
Radius faceting for location-based data
Data analysis using pivot faceting
Graphs for analytics
Summary
Chapter 5: Solr in E-commerce
Designing an e-commerce search
Handling unclean data
Handling variations in the product
Sorting
Problems and solutions of flash sale searches
Faceting with the option of multi-select
Faceting with hierarchical taxonomy
Faceting with size
Implementing semantic search
Optimizations
Summary
Chapter 6: Solr for Spatial Search
Features of spatial search
Lucene 4 spatial module
Indexing for spatial search
Searching and filtering on a spatial index
Distance sort and relevancy boost
Advanced concepts
Summary
Chapter 7: Using Solr in an Advertising System
Ad system functionalities
Architecture of an ad distribution system
Requirements of an ad distribution system
Performance improvements
Merging Solr with Redis
Summary
Chapter 8: AJAX Solr
The purpose of AJAX Solr
The AJAX Solr architecture
Working with AJAX Solr
Performance tuning
Summary
Chapter 9: SolrCloud
The SolrCloud architecture
Centralized configuration
Setting up SolrCloud
Distributed indexing and search
Routing documents to a particular shard
Adding more nodes to the SolrCloud
Fault tolerance and high availability in SolrCloud
Advanced sharding with SolrCloud
Asynchronous calls
Migrating documents to another collection
Sizing and monitoring of SolrCloud
Using SolrCloud as a NoSQL database
Summary
Chapter 10: Text Tagging with Lucene FST
An overview of FST and text tagging
Implementation of FST in Lucene
Text tagging algorithms
Using Solr for text tagging
Implementing a text tagger using Solr
Summary

What You Will Learn

  • Customize the Solr scoring algorithm to get better and more relevant search results
  • Use Solr with big data for analytical purposes
  • Get insights into Solr internals—indexing and search
  • Setting up and scaling with Solr cloud
  • Implement spatial search with Solr
  • Understand Finite State Transducers (FST) and implement text tagging using FST
  • Breeze through the strategies used in executing search using Solr in e-commerce, advertising, and real estate websites
  • Learn more about how to use Solr with AJAX

Authors

Table of Contents

Chapter 1: Solr Indexing Internals
The job site problem statement – Solr indexing fundamentals
Working of analyzers, tokenizers, and filters
Handling a multilingual search
Measuring the quality of search results
The e-commerce problem statement
The job site problem statement
Challenges of large-scale indexing
The SolrCloud solution
Summary
Chapter 2: Customizing the Solr Scoring Algorithm
Relevance calculation
Building a custom scorer
Drawbacks of the TF-IDF model
The information gain model
Implementing the information gain model
Options to TF-IDF similarity
Summary
Chapter 3: Solr Internals and Custom Queries
Working of a scorer on an inverted index
Working of OR and AND clauses
The eDisMax query parser
Using BRS queries instead of DisMax
Building a custom query parser
Summary
Chapter 4: Solr for Big Data
Introduction to big data
Getting data points using facets
Radius faceting for location-based data
Data analysis using pivot faceting
Graphs for analytics
Summary
Chapter 5: Solr in E-commerce
Designing an e-commerce search
Handling unclean data
Handling variations in the product
Sorting
Problems and solutions of flash sale searches
Faceting with the option of multi-select
Faceting with hierarchical taxonomy
Faceting with size
Implementing semantic search
Optimizations
Summary
Chapter 6: Solr for Spatial Search
Features of spatial search
Lucene 4 spatial module
Indexing for spatial search
Searching and filtering on a spatial index
Distance sort and relevancy boost
Advanced concepts
Summary
Chapter 7: Using Solr in an Advertising System
Ad system functionalities
Architecture of an ad distribution system
Requirements of an ad distribution system
Performance improvements
Merging Solr with Redis
Summary
Chapter 8: AJAX Solr
The purpose of AJAX Solr
The AJAX Solr architecture
Working with AJAX Solr
Performance tuning
Summary
Chapter 9: SolrCloud
The SolrCloud architecture
Centralized configuration
Setting up SolrCloud
Distributed indexing and search
Routing documents to a particular shard
Adding more nodes to the SolrCloud
Fault tolerance and high availability in SolrCloud
Advanced sharding with SolrCloud
Asynchronous calls
Migrating documents to another collection
Sizing and monitoring of SolrCloud
Using SolrCloud as a NoSQL database
Summary
Chapter 10: Text Tagging with Lucene FST
An overview of FST and text tagging
Implementation of FST in Lucene
Text tagging algorithms
Using Solr for text tagging
Implementing a text tagger using Solr
Summary

Book Details

ISBN 139781783981847
Paperback316 pages
Read More
From 1 reviews

Read More Reviews