Apache Accumulo for Developers

Discover how to build Accumulo, Hadoop, and ZooKeeper clusters from scratch on both Windows and Linux. With this book’s examples-based approach, you’ll learn the painless way through clear instructions and real-world exercises.
Preview in Mapt

Apache Accumulo for Developers

Guðmundur Jón Halldórsson

Discover how to build Accumulo, Hadoop, and ZooKeeper clusters from scratch on both Windows and Linux. With this book’s examples-based approach, you’ll learn the painless way through clear instructions and real-world exercises.
Mapt Subscription
FREE
$29.99/m after trial
eBook
$14.70
RRP $20.99
Save 29%
Print + eBook
$34.99
RRP $34.99
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$14.70
$34.99
$29.99p/m after trial
RRP $20.99
RRP $34.99
Subscription
eBook
Print + eBook
Start 30 Day Trial

Frequently bought together


Apache Accumulo for Developers Book Cover
Apache Accumulo for Developers
$ 20.99
$ 14.70
Apache Spark 2.x for Java Developers Book Cover
Apache Spark 2.x for Java Developers
$ 39.99
$ 28.00
Buy 2 for $32.20
Save $28.78
Add to Cart
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 

Book Details

ISBN 139781783285990
Paperback120 pages

Book Description

Accumulo is a sorted and distributed key/value store designed to handle large amounts of data. Being highly robust and scalable, its performance makes it ideal for real-time data storage. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift.

Apache Accumulo for Developers is your guide to building an Accumulo cluster both as a single-node and multi-node, on-site and in the cloud. Accumulo has been proven to be able to handle petabytes of data, with cell-level security, and real-time analyses so this is your step by step guide in taking full advantage of this power.

Apache Accumulo for Developers looks at the process of setting up three systems - Hadoop, ZooKeeper, and Accumulo – and configuring, monitoring, and securing them.

You will learn to connect Accumulo to both Hadoop and ZooKeeper. You will also learn how to monitor the cluster (single-node or multi-node) to find any performance bottlenecks, and then integrate to Amazon EC2, Google Cloud Platform, Rackspace, and Windows Azure. When integrating with these cloud platforms, we will focus on scripting as well.

You will also learn to troubleshoot clusters with monitoring tools, and use Accumulo cell-level security to secure your data.

Table of Contents

Chapter 1: Building an Accumulo Cluster from Scratch
Necessary requirements
Setting up Cygwin
Setting up Hadoop
Setting up ZooKeeper
Setting up and configuring Accumulo
Starting the Accumulo cluster
Connecting to the Accumulo cluster using Java
Summary
Chapter 2: Monitoring and Managing Accumulo
Monitoring
Elasticity
Failover
Resource management
Summary
Chapter 3: Integrating Accumulo into Various Cloud Platforms
Amazon EC2
Google Cloud Platform
Rackspace
Windows Azure
Summary
Chapter 4: Optimizing Accumulo Performance
Prerequisites
Hadoop performance
ZooKeeper performance
Accumulo performance
Summary
Chapter 5: Security
Visibility
Security expression
Authorization
User authorizations
Handling secure authorization
Query Services Layer
Summary

What You Will Learn

  • Set up Hadoop, ZooKeeper, and Accumulo
  • Monitor clusters - both performance and application logs
  • Secure your data in Accumulo
  • Optimize Hadoop, ZooKeeper, and Accumulo performance
  • Integrate to various cloud platforms
  • Use the Accumulo command-line shell
  • Employ Ganglina to monitor the cluster and Graylog2 to monitor application logs
  • Understand what tools are needed to optimize Accumulo performance

Authors

Table of Contents

Chapter 1: Building an Accumulo Cluster from Scratch
Necessary requirements
Setting up Cygwin
Setting up Hadoop
Setting up ZooKeeper
Setting up and configuring Accumulo
Starting the Accumulo cluster
Connecting to the Accumulo cluster using Java
Summary
Chapter 2: Monitoring and Managing Accumulo
Monitoring
Elasticity
Failover
Resource management
Summary
Chapter 3: Integrating Accumulo into Various Cloud Platforms
Amazon EC2
Google Cloud Platform
Rackspace
Windows Azure
Summary
Chapter 4: Optimizing Accumulo Performance
Prerequisites
Hadoop performance
ZooKeeper performance
Accumulo performance
Summary
Chapter 5: Security
Visibility
Security expression
Authorization
User authorizations
Handling secure authorization
Query Services Layer
Summary

Book Details

ISBN 139781783285990
Paperback120 pages
Read More

Read More Reviews

Recommended for You

Hadoop Real-World Solutions Cookbook Book Cover
Hadoop Real-World Solutions Cookbook
$ 29.99
$ 21.00
Big Data Analytics with R and Hadoop Book Cover
Big Data Analytics with R and Hadoop
$ 29.99
$ 21.00
Storm Real-time Processing Cookbook Book Cover
Storm Real-time Processing Cookbook
$ 29.99
$ 21.00
Fast Data Processing with Spark Book Cover
Fast Data Processing with Spark
$ 22.99
$ 16.10
Practical Data Analysis Book Cover
Practical Data Analysis
$ 29.99
$ 21.00
Hadoop MapReduce Cookbook Book Cover
Hadoop MapReduce Cookbook
$ 29.99
$ 21.00