Apache Flume: Distributed Log Collection for Hadoop - Second Edition

Design and implement a series of Flume agents to send streamed data into Hadoop

Apache Flume: Distributed Log Collection for Hadoop - Second Edition

Steve Hoffman

Design and implement a series of Flume agents to send streamed data into Hadoop
Mapt Subscription
FREE
$30.00/m after trial
eBook
$16.10
RRP $22.99
Save 29%
Print + eBook
$36.99
RRP $36.99
What do I get with a Mapt subscription?
  • Unlimited access to all Packt’s 6,000+ eBooks and Videos
  • 100+ new titles a month, learning paths, assessments & code files
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$0.00
$16.10
$36.99
$29.99 p/m after trial
RRP $22.99
RRP $36.99
Subscription
eBook
Print + eBook
Start 14 Day Trial

Frequently bought together


Apache Flume: Distributed Log Collection for Hadoop - Second Edition Book Cover
Apache Flume: Distributed Log Collection for Hadoop - Second Edition
$ 22.99
$ 16.10
Mastering Apache Spark 2.x - Second Edition Book Cover
Mastering Apache Spark 2.x - Second Edition
$ 39.99
$ 28.00
Buy 2 for $33.60
Save $29.38
Add to Cart

Book Details

ISBN 139781784392178
Paperback178 pages

Book Description

Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis.

This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.

A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features.

What You Will Learn

  • Understand the Flume architecture, and also how to download and install open source Flume from Apache
  • Follow along a detailed example of transporting weblogs in Near Real Time (NRT) to Kibana/Elasticsearch and archival in HDFS
  • Learn tips and tricks for transporting logs and data in your production environment
  • Understand and configure the Hadoop File System (HDFS) Sink
  • Use a morphline-backed Sink to feed data into Solr
  • Create redundant data flows using sink groups
  • Configure and use various sources to ingest data
  • Inspect data records and move them between multiple destinations based on payload content
  • Transform data en-route to Hadoop and monitor your data flows

Authors

Book Details

ISBN 139781784392178
Paperback178 pages
Read More

Read More Reviews

Recommended for You

Mastering Apache Spark 2.x - Second Edition Book Cover
Mastering Apache Spark 2.x - Second Edition
$ 39.99
$ 28.00
Building Data Streaming Applications with Apache Kafka Book Cover
Building Data Streaming Applications with Apache Kafka
$ 35.99
$ 25.20
Apache Oozie Essentials Book Cover
Apache Oozie Essentials
$ 27.99
$ 19.60
Hadoop: Data Processing and Modelling Book Cover
Hadoop: Data Processing and Modelling
$ 79.99
$ 56.00
Mastering Apache Storm Book Cover
Mastering Apache Storm
$ 39.99
$ 28.00
Hadoop 2.x Administration Cookbook Book Cover
Hadoop 2.x Administration Cookbook
$ 39.99
$ 28.00