Data Lake for Enterprises

A practical guide to implementing your enterprise data lake using Lambda Architecture as the base

Data Lake for Enterprises

Tomcy John, Pankaj Misra

1 customer reviews
A practical guide to implementing your enterprise data lake using Lambda Architecture as the base
Mapt Subscription
FREE
$30.00/m after trial
eBook
$12.95
RRP $18.49
Save 29%
Print + eBook
$22.99
RRP $22.99
What do I get with a Mapt subscription?
  • Unlimited access to all Packt’s 6,000+ eBooks and Videos
  • 100+ new titles a month, learning paths, assessments & code files
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$0.00
$12.95
$22.99
$29.99 p/m after trial
RRP $18.49
RRP $22.99
Subscription
eBook
Print + eBook
Start 14 Day Trial

Frequently bought together


Data Lake for Enterprises Book Cover
Data Lake for Enterprises
$ 18.49
$ 12.95
Frank Kane's Taming Big Data with Apache Spark and Python Book Cover
Frank Kane's Taming Big Data with Apache Spark and Python
$ 31.99
$ 22.40
Buy 2 for $30.45
Save $20.03
Add to Cart

Book Details

ISBN 139781787281349
Paperback596 pages

Book Description

The term "Data Lake" has recently emerged as a prominent term in the big data industry. Data scientists can make use of it in deriving meaningful insights that can be used by businesses to redefine or transform the way they operate. Lambda architecture is also emerging as one of the very eminent patterns in the big data landscape, as it not only helps to derive useful information from historical data but also correlates real-time data to enable business to take critical decisions. This book tries to bring these two important aspects — data lake and lambda architecture—together.

This book is divided into three main sections. The first introduces you to the concept of data lakes, the importance of data lakes in enterprises, and getting you up-to-speed with the Lambda architecture. The second section delves into the principal components of building a data lake using the Lambda architecture. It introduces you to popular big data technologies such as Apache Hadoop, Spark, Sqoop, Flume, and ElasticSearch. The third section is a highly practical demonstration of putting it all together, and shows you how an enterprise data lake can be implemented, along with several real-world use-cases. It also shows you how other peripheral components can be added to the lake to make it more efficient.

By the end of this book, you will be able to choose the right big data technologies using the lambda architectural patterns to build your enterprise data lake.

Table of Contents

What You Will Learn

  • Build an enterprise-level data lake using the relevant big data technologies
  • Understand the core of the Lambda architecture and how to apply it in an enterprise
  • Learn the technical details around Sqoop and its functionalities
  • Integrate Kafka with Hadoop components to acquire enterprise data
  • Use flume with streaming technologies for stream-based processing
  • Understand stream- based processing with reference to Apache Spark Streaming
  • Incorporate Hadoop components and know the advantages they provide for enterprise data lakes
  • Build fast, streaming, and high-performance applications using ElasticSearch
  • Make your data ingestion process consistent across various data formats with configurability
  • Process your data to derive intelligence using machine learning algorithms

Authors

Table of Contents

Book Details

ISBN 139781787281349
Paperback596 pages
Read More
From 1 reviews

Read More Reviews

Recommended for You

Frank Kane's Taming Big Data with Apache Spark and Python Book Cover
Frank Kane's Taming Big Data with Apache Spark and Python
$ 31.99
$ 22.40
Frank Kane's Taming Big Data with Apache Spark and Python Book Cover
Frank Kane's Taming Big Data with Apache Spark and Python
$ 31.99
$ 22.40
Architectural Patterns Book Cover
Architectural Patterns
$ 39.99
$ 28.00
Data Science Algorithms in a Week Book Cover
Data Science Algorithms in a Week
$ 31.99
$ 22.40
Statistics for Machine Learning Book Cover
Statistics for Machine Learning
$ 39.99
$ 28.00
Building Data Streaming Applications with Apache Kafka Book Cover
Building Data Streaming Applications with Apache Kafka
$ 35.99
$ 25.20