Free Sample
+ Collection

Pentaho for Big Data Analytics

Manoj R Patil, Feris Thia

With your knowledge of Java and this guide, you can take the analysis of your big data to new levels using Pentaho. Covers all the essentials tools, techniques, tips, and tricks in one handy volume.
RRP $23.99
RRP $39.99
Print + eBook

Want this title & more?

$12.99 p/month

Subscribe to PacktLib

Enjoy full and instant access to over 2000 books and videos – you’ll find everything you need to stay ahead of the curve and make sure you can always get the job done.

Book Details

ISBN 139781783282159
Paperback118 pages

About This Book

  • A guide to using Pentaho Business Analytics for Big Data analysis
  • Learn Pentaho’s visualization and reporting tools with practical examples and tips
  • Precise insights into churning big data into meaningful knowledge with Pentaho

Who This Book Is For

This book is for developers, system administrators, and business intelligence professionals looking to learn how to get more out of their data through Pentaho. In order to best engage with the examples, some knowledge of Java will be required.

Table of Contents

Chapter 1: The Rise of Pentaho Analytics along with Big Data
Pentaho BI Suite – components
Edge over competitors
Chapter 2: Setting Up the Ground
Pentaho BI Server and the development platform
Prerequisites/system requirements
Obtaining Pentaho BI Server (Community Edition)
The JAVA_HOME and JRE_HOME environment variables
Running Pentaho BI Server
Pentaho User Console (PUC)
Pentaho Action Sequence and solution
The JPivot component example
The message template component example
The embedded HSQLDB database server
Pentaho Marketplace
Saiku installation
Pentaho Administration Console (PAC)
Creating data connections
Chapter 3: Churning Big Data with Pentaho
An overview of Big Data and Hadoop
The Hadoop architecture
Pentaho Data Integration (PDI)
Importing data to Hive
Putting a data file into HDFS
Loading data from HDFS into Hive (job orchestration)
Chapter 4: Pentaho Business Analytics Tools
The business analytics life cycle
Preparing data
Pentaho Reporting
Data visualization and dashboard building
Chapter 5: Visualization of Big Data
Data visualization
Data source preparation
Visualizing data using CTools
CSS styling

What You Will Learn

  • Get to grips with the Pentaho suite
  • Explore the basics of Big Data and its business context
  • Set up a Pentaho business analytics server
  • Consume Big Data on HDFS platform using Pentaho Data Integration
  • Create visualization with Pentaho's tools
  • Distinguish signal from noise with Pentaho's Data Analytics capabilities
  • Design and set up your own Pentaho dashboard
  • Move from data to analytics in just a few steps with Community Dashboard Framework (CDF)

In Detail

Pentaho accelerates the realization of value from big data with the most complete solution for big data analytics and data integration. The real power of big data analytics is the abstraction between data and analytics. Data can be distributed across the cluster in various formats, and the analytics platform should have the capability to talk to different heterogeneous data stores and fetch the filtered data to enrich its value.

Pentaho Big Data Analytics is a practical, hands-on guide that provides you with clear, step-by-step exercises for using Pentaho to take advantage of big data systems, where data beats algorithm, and gives you a good grounding in using Pentaho Business Analytics’ capabilities.

This book looks at the key ingredients of the Pentaho Business Analytics platform. We will see how to prepare the Pentaho BI environment, and get to grips with the big data ecosystem through. The book provides a clear guide to the essential tools of Pentaho Business Analytics, providing familiarity with both the various design tools for setting up reports, and the visualization tools necessary for complete data analysis.


Read More