Apache Flume: Distributed Log Collection for Hadoop - Second Edition

Design and implement a series of Flume agents to send streamed data into Hadoop

Apache Flume: Distributed Log Collection for Hadoop - Second Edition

Steve Hoffman

Design and implement a series of Flume agents to send streamed data into Hadoop
Packt Subscription
FREE
$9.99/m after trial
eBook
$10.00
RRP $22.99
Save 56%
Print + eBook
$36.99
RRP $36.99
What do I get with a Packt subscription?
  • Exclusive monthly discount - no contract
  • Unlimited access to entire Packt library of 6500+ eBooks and Videos
  • 120 new titles added every month, on new and emerging tech
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$0.00
$10.00
$36.99
$9.99 p/m after trial
RRP $22.99
RRP $36.99
Subscription
eBook
Print + eBook
Start a FREE 10-day trial

Frequently bought together


Apache Flume: Distributed Log Collection for Hadoop - Second Edition Book Cover
Apache Flume: Distributed Log Collection for Hadoop - Second Edition
$ 22.99
$ 10.00
Building Data Streaming Applications with Apache Kafka Book Cover
Building Data Streaming Applications with Apache Kafka
$ 35.99
$ 10.00
Buy 2 for $20.00
Save $38.98
Add to Cart

Book Details

ISBN 139781784392178
Paperback178 pages

Book Description

Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis.

This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.

A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features.

What You Will Learn

  • Understand the Flume architecture, and also how to download and install open source Flume from Apache
  • Follow along a detailed example of transporting weblogs in Near Real Time (NRT) to Kibana/Elasticsearch and archival in HDFS
  • Learn tips and tricks for transporting logs and data in your production environment
  • Understand and configure the Hadoop File System (HDFS) Sink
  • Use a morphline-backed Sink to feed data into Solr
  • Create redundant data flows using sink groups
  • Configure and use various sources to ingest data
  • Inspect data records and move them between multiple destinations based on payload content
  • Transform data en-route to Hadoop and monitor your data flows

Authors

Book Details

ISBN 139781784392178
Paperback178 pages
Read More

Read More Reviews

Recommended for You

Building Data Streaming Applications with Apache Kafka Book Cover
Building Data Streaming Applications with Apache Kafka
$ 35.99
$ 10.00
Apache Oozie Essentials Book Cover
Apache Oozie Essentials
$ 27.99
$ 10.00
Mastering Apache Storm Book Cover
Mastering Apache Storm
$ 39.99
$ 10.00
Hadoop: Data Processing and Modelling Book Cover
Hadoop: Data Processing and Modelling
$ 79.99
$ 10.00
Mastering Apache Storm Book Cover
Mastering Apache Storm
$ 39.99
$ 10.00
Mastering Apache Spark 2.x - Second Edition Book Cover
Mastering Apache Spark 2.x - Second Edition
$ 39.99
$ 10.00