Apache Solr Enterprise Search Server - Third Edition

Enhance your search experience with Apache Solr 4 - from faceted navigation, result highlighting and relevancy-ranked sorting, harness Solr search with ease

Apache Solr Enterprise Search Server - Third Edition

David Smiley et al.

1 customer reviews
Enhance your search experience with Apache Solr 4 - from faceted navigation, result highlighting and relevancy-ranked sorting, harness Solr search with ease
Mapt Subscription
FREE
$29.99/m after trial
eBook
$25.20
RRP $35.99
Save 29%
Print + eBook
$44.99
RRP $44.99
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$25.20
$44.99
$29.99p/m after trial
RRP $35.99
RRP $44.99
Subscription
eBook
Print + eBook
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Preview in Mapt

Book Details

ISBN 139781782161363
Paperback432 pages

Book Description

Solr Apache is a widely popular open source enterprise search server that delivers powerful search and faceted navigation features—features that are elusive with databases. Solr supports complex search criteria, faceting, result highlighting, query-completion, query spell-checking, relevancy tuning, geospatial searches, and much more.

This book is a comprehensive resource for just about everything Solr has to offer, and it will take you from first exposure to development and deployment in no time. Even if you wish to use Solr 5, you should find the information to be just as applicable due to Solr's high regard for backward compatibility. The book includes some useful information specific to Solr 5.

Table of Contents

Chapter 1: Quick Starting Solr
An introduction to Solr
A few differences between Solr 4 and Solr 5
Getting started
A quick tour of Solr
Configuration files
What's next?
Resources outside this book
Summary
Chapter 2: Schema Design
Is Solr schemaless?
MusicBrainz.org
One combined index or separate indices
Schema design
The schema.xml file
Summary
Chapter 3: Text Analysis
Configuring field types
Character filters
Tokenization
Filtering
The multilingual search
Summary
Chapter 4: Indexing Data
Communicating with Solr
Solr's Update-XML format
Commit, optimize, and rollback the transaction log
Atomic updates and optimistic concurrency
Sending CSV-formatted data to Solr
The DataImportHandler framework
Indexing documents with Solr Cell
Update request processors
Summary
Chapter 5: Searching
Your first search – a walk-through
Solr's generic XML structured data representation
Solr's XML response format
Understanding request handlers
Query parameters
Query parsers and local-params
Query syntax (the lucene query parser)
The DisMax query parser – part 1
Filtering
Sorting
Joining
Spatial search
Summary
Chapter 6: Search Relevancy
Scoring
The DisMax query parser – part 2
Functions and function queries
Summary
Chapter 7: Faceting
A quick example – faceting release types
Field requirements
Types of faceting
Faceting field values
Faceting numeric and date ranges
Facet queries
Building a filter query from a facet
Pivot faceting
Excluding filters – multiselect faceting
Summary
Chapter 8: Search Components
About components
The highlight component
The SpellCheck component
Query complete/suggest
The QueryElevation component
The MoreLikeThis component
The Stats component
The Clustering component
Collapsing and expanding
The TermVector component
Summary
Chapter 9: Integrating Solr
Working with the included examples
Solritas – the integrated search UI
SolrJ – Solr's Java client API
Using JavaScript/AJAX with Solr
Using XSLT to transform XML search results
Accessing Solr from PHP applications
Ruby on Rails integrations
Nutch for crawling web pages
Solr and Hadoop
ManifoldCF – a connector framework
Document-level security
Summary
Chapter 10: Scaling Solr
Tuning complex systems is hard
Use SolrMeter to test Solr performance
Optimizing a single Solr server – scale up
Configuring Solr for near real-time search
Use SolrCloud to go big – scale wide
Summary
Chapter 11: Deployment
Deployment methodology for Solr
Installing Solr into a Servlet container
Configuring logging
A RequestHandler per search interface
Leveraging Solr cores
Setting up ZooKeeper for SolrCloud
Monitoring Solr performance
Securing Solr from prying eyes
Summary

What You Will Learn

  • Design a schema to include text indexing details such as tokenization, stemming, and synonyms
  • Import data from databases using various formats including CSV and XML and extract text from different document formats
  • Search using Solr's rich query syntax, perform geospatial searches, "join" relationally, and influence relevancy order
  • Build a query auto-complete/suggester capability with knowledge of the fundamental types of suggestion and ways to implement them
  • Enhance standard searches with faceting for navigation or analytics
  • Deploy Solr to production taking into account logging, security, and monitoring
  • Integrate a host of technologies with Solr including web crawlers, Hadoop, Java, JavaScript, Ruby, PHP, Drupal, and others
  • Tune Solr and use SolrCloud for horizontal scalability

Authors

Table of Contents

Chapter 1: Quick Starting Solr
An introduction to Solr
A few differences between Solr 4 and Solr 5
Getting started
A quick tour of Solr
Configuration files
What's next?
Resources outside this book
Summary
Chapter 2: Schema Design
Is Solr schemaless?
MusicBrainz.org
One combined index or separate indices
Schema design
The schema.xml file
Summary
Chapter 3: Text Analysis
Configuring field types
Character filters
Tokenization
Filtering
The multilingual search
Summary
Chapter 4: Indexing Data
Communicating with Solr
Solr's Update-XML format
Commit, optimize, and rollback the transaction log
Atomic updates and optimistic concurrency
Sending CSV-formatted data to Solr
The DataImportHandler framework
Indexing documents with Solr Cell
Update request processors
Summary
Chapter 5: Searching
Your first search – a walk-through
Solr's generic XML structured data representation
Solr's XML response format
Understanding request handlers
Query parameters
Query parsers and local-params
Query syntax (the lucene query parser)
The DisMax query parser – part 1
Filtering
Sorting
Joining
Spatial search
Summary
Chapter 6: Search Relevancy
Scoring
The DisMax query parser – part 2
Functions and function queries
Summary
Chapter 7: Faceting
A quick example – faceting release types
Field requirements
Types of faceting
Faceting field values
Faceting numeric and date ranges
Facet queries
Building a filter query from a facet
Pivot faceting
Excluding filters – multiselect faceting
Summary
Chapter 8: Search Components
About components
The highlight component
The SpellCheck component
Query complete/suggest
The QueryElevation component
The MoreLikeThis component
The Stats component
The Clustering component
Collapsing and expanding
The TermVector component
Summary
Chapter 9: Integrating Solr
Working with the included examples
Solritas – the integrated search UI
SolrJ – Solr's Java client API
Using JavaScript/AJAX with Solr
Using XSLT to transform XML search results
Accessing Solr from PHP applications
Ruby on Rails integrations
Nutch for crawling web pages
Solr and Hadoop
ManifoldCF – a connector framework
Document-level security
Summary
Chapter 10: Scaling Solr
Tuning complex systems is hard
Use SolrMeter to test Solr performance
Optimizing a single Solr server – scale up
Configuring Solr for near real-time search
Use SolrCloud to go big – scale wide
Summary
Chapter 11: Deployment
Deployment methodology for Solr
Installing Solr into a Servlet container
Configuring logging
A RequestHandler per search interface
Leveraging Solr cores
Setting up ZooKeeper for SolrCloud
Monitoring Solr performance
Securing Solr from prying eyes
Summary

Book Details

ISBN 139781782161363
Paperback432 pages
Read More
From 1 reviews

Read More Reviews

Recommended for You

Apache Solr 4 Cookbook Book Cover
Apache Solr 4 Cookbook
$ 26.99
$ 18.90
Mastering Web Application Development with AngularJS Book Cover
Mastering Web Application Development with AngularJS
$ 26.99
$ 5.40
Machine Learning with Spark Book Cover
Machine Learning with Spark
$ 29.99
$ 3.00
Apache Solr 3 Enterprise Search Server Book Cover
Apache Solr 3 Enterprise Search Server
$ 29.99
$ 21.00
Building Machine Learning Systems with Python Book Cover
Building Machine Learning Systems with Python
$ 29.99
$ 6.00
Mastering ElasticSearch Book Cover
Mastering ElasticSearch
$ 32.99
$ 23.10