Getting Started with Greenplum for Big Data Analytics

A hands-on guide on how to execute an analytics project from conceptualization to operationalization using Greenplum
Code Files

Getting Started with Greenplum for Big Data Analytics

Sunila Gollapudi

A hands-on guide on how to execute an analytics project from conceptualization to operationalization using Greenplum
Packt Subscription
$5.00
$9.99/m after first month
eBook
$5.00
RRP $23.99
Save 79%
Print + eBook
$39.99
RRP $39.99
What do I get with a Packt subscription?
  • Exclusive monthly discount - no contract
  • Unlimited access to entire Packt library of 6500+ eBooks and Videos
  • 120 new titles added every month, on new and emerging tech
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$5.00
$5.00
$39.99
$9.99/m after first month
RRP $23.99
RRP $39.99
Subscription
eBook
Print + eBook
Subscribe Now

Frequently bought together


Getting Started with Greenplum for Big Data Analytics Book Cover
Getting Started with Greenplum for Big Data Analytics
$ 23.99
$ 5.00
Learning PostgreSQL 10 - Second Edition Book Cover
Learning PostgreSQL 10 - Second Edition
$ 27.99
$ 5.00
Buy 2 for $10.00
Save $41.98
Add to Cart

Book Details

ISBN 139781782177043
Paperback172 pages

Book Description

Organizations are leveraging the use of data and analytics to gain a competitive advantage over their opposition. Therefore, organizations are quickly becoming more and more data driven. With the advent of Big Data, existing Data Warehousing and Business Intelligence solutions are becoming obsolete, and a requisite for new agile platforms consisting of all the aspects of Big Data has become inevitable. From loading/integrating data to presenting analytical visualizations and reports, the new Big Data platforms like Greenplum do it all. It is now the mindset of the user that requires a tuning to put the solutions to work.

"Getting Started with Greenplum for Big Data Analytics" is a practical, hands-on guide to learning and implementing Big Data Analytics using the Greenplum Integrated Analytics Platform. From processing structured and unstructured data to presenting the results/insights to key business stakeholders, this book explains it all.

"Getting Started with Greenplum for Big Data Analytics" discusses the key characteristics of Big Data and its impact on current Data Warehousing platforms. It will take you through the standard Data Science project lifecycle and will lay down the key requirements for an integrated analytics platform. It then explores the various software and appliance components of Greenplum and discusses the relevance of each component at every level in the Data Science lifecycle.

You will also learn Big Data architectural patterns and recap some key advanced analytics techniques in detail. The book will also take a look at programming with R and integration with Greenplum for implementing analytics. Additionally, you will explore MADlib and advanced SQL techniques in Greenplum for analytics. This book also elaborates on the physical architecture aspects of Greenplum with guidance on handling high-availability, back-up, and recovery.

What You Will Learn

  • Load data from multiple data sources using the built-in ELT / ETL
  • Learn Parallel Processing / MPP / MapReduce techniques
  • Program with R and MADlib
  • Understand back-up and recovery implementation in Greenplum
  • Optimize data processing and querying using optimal distribution and partitioning strategies
  • Exchange data between the Greenplum Database and Hadoop
  • Handle high-availability requirements on Greenplum
  • Integrate ETL, reporting, and visualization tools

Authors

Book Details

ISBN 139781782177043
Paperback172 pages
Read More

Read More Reviews

Recommended for You

Learning PostgreSQL 10 - Second Edition Book Cover
Learning PostgreSQL 10 - Second Edition
$ 27.99
$ 5.00
Pig Design Patterns Book Cover
Pig Design Patterns
$ 32.99
$ 5.00
MongoDB Cookbook - Second Edition Book Cover
MongoDB Cookbook - Second Edition
$ 35.99
$ 5.00
Advanced Splunk Book Cover
Advanced Splunk
$ 39.99
$ 5.00
Splunk 7 Essentials - Third Edition Book Cover
Splunk 7 Essentials - Third Edition
$ 31.99
$ 5.00
Bitcoin Essentials Book Cover
Bitcoin Essentials
$ 23.99
$ 5.00