Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Elasticsearch 5.x Cookbook - Third Edition

You're reading from  Elasticsearch 5.x Cookbook - Third Edition

Product type Book
Published in Feb 2017
Publisher
ISBN-13 9781786465580
Pages 696 pages
Edition 3rd Edition
Languages
Author (1):
Alberto Paro Alberto Paro
Profile icon Alberto Paro

Table of Contents (25) Chapters

Credits
About the Author
About the Reviewer
www.PacktPub.com
Customer Feedback
Dedication
Preface
Getting Started Downloading and Setup Managing Mappings Basic Operations Search Text and Numeric Queries Relationships and Geo Queries Aggregations Scripting Managing Clusters and Nodes Backup and Restore User Interfaces Ingest Java Integration Scala Integration Python Integration Plugin Development Big Data Integration

Indexing data with Apache Pig


Apache Pig (https://pig.apache.org/) is a tool frequently used to store/manipulate data in datastores. It can be very handy if you need to import some CSV in Elasticsearch in a very fast way.

Getting ready

You need an up-and-running Elasticsearch installation as we described in Downloading and installing Elasticsearch recipe in Chapter 2, Downloading and Setup.

You need a working Pig installation. Depending on your operating system you should follow the instruction at http://pig.apache.org/docs/r0.16.0/start.html.

If you are using Mac OS X with Homebrew you can install it with brew install pig.

How to do it...

We want read a CSV and write the data in Elasticsearch. We will perform the steps given as follows:

  1. We will download a CSV dataset from geonames site: all the geoname locations of Great Britain. We can fast download them and unzip them via:

            wget http://download.geonames.org/export/dump/GB.zip 
            unzip GB.zip 
    
  2. We can write es.pig that contains...

lock icon The rest of the chapter is locked
arrow left Previous Chapter
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}