Learning Cassandra for Administrators

Understand the immense capabilities of Cassandra in managing large amounts of data and learn how to ensure that data is always available. This practical, hands-on guide takes you through every stage from installation to performance tuning.

Learning Cassandra for Administrators

Vijay Parthasarathy

1 customer reviews
Understand the immense capabilities of Cassandra in managing large amounts of data and learn how to ensure that data is always available. This practical, hands-on guide takes you through every stage from installation to performance tuning.
Mapt Subscription
FREE
$29.99/m after trial
eBook
$9.00
RRP $17.99
Print + eBook
$29.99
RRP $29.99
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$9.00
$29.99
$29.99p/m after trial
RRP $17.99
RRP $29.99
Subscription
eBook
Print + eBook
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Code Files
Preview in Mapt

Book Details

ISBN 139781782168171
Paperback120 pages

Book Description

Apache Cassandra is a massively scalable open source NoSQL database. Cassandra is perfect for managing large amounts of structured, semi-structured, and unstructured data across multiple data centers and the cloud. Cassandra delivers linear scalability and performance across many commodity servers with no single point of failure.

This book starts by explaining how to derive the solution, basic concepts, and CAP theorem. You will learn how to install and configure a Cassandra cluster as well as tune the cluster for performance. After reading the book, you should be able to understand why the system works in a particular way, and you will also be able to find patterns (and/or use cases) and anti-patterns which would potentially cause performance degradation. Furthermore, the book explains how to configure Hadoop, vnodes, multi-DC clusters, enabling trace, enabling various security features, and querying data from Cassandra.

Starting with explaining about the trade-offs, we gradually learn about setting up and configuring high performance clusters. This book will help the administrators understand the system better by understanding various components in Cassandra’s architecture and hence be more productive in operating the cluster. This book talks about the use cases and problems, anti-patterns, and potential practical solutions as opposed to raw techniques. You will learn about kernel and JVM tuning parameters that can be adjusted to get the maximum use out of system resources.

Table of Contents

Chapter 1: Basic Concepts and Architecture
CAP theorem
BigTable / Log-structured data model
Partitioning and replication Dynamo style
Summary
Chapter 2: Installing Cassandra
Memory, CPU, and network requirements
Cassandra in-memory data structures
Downloading/choosing binaries to install
Cassandra on EC2 instance
Create a keyspace
Summary
Chapter 3: Inserting Data and Manipulating Data
Querying data
Tracing
Data modeling
Summary
Chapter 4: Administration and Large Deployments
Manual repair
Bootstrapping
Monitoring tools
Summary
Chapter 5: Performance Tuning
vmstat
iostat
dstat
Garbage collection
Tuning memtables
Compaction tuning
Compression
Summary
Chapter 6: Analytics
Hadoop integration
Summary
Chapter 7: Security and Troubleshooting
Encryption
Audit
Things to look out for
Summary

What You Will Learn

  • Explore trade-offs and basic concepts
  • Install Cassandra, choose hardware, and configure the cluster
  • Query and insert data and CQL
  • Get to grips with performance tuning
  • Find out about Hadoop integration and evolving apps
  • Discover anti-patterns and how to secure your cluster

Authors

Table of Contents

Chapter 1: Basic Concepts and Architecture
CAP theorem
BigTable / Log-structured data model
Partitioning and replication Dynamo style
Summary
Chapter 2: Installing Cassandra
Memory, CPU, and network requirements
Cassandra in-memory data structures
Downloading/choosing binaries to install
Cassandra on EC2 instance
Create a keyspace
Summary
Chapter 3: Inserting Data and Manipulating Data
Querying data
Tracing
Data modeling
Summary
Chapter 4: Administration and Large Deployments
Manual repair
Bootstrapping
Monitoring tools
Summary
Chapter 5: Performance Tuning
vmstat
iostat
dstat
Garbage collection
Tuning memtables
Compaction tuning
Compression
Summary
Chapter 6: Analytics
Hadoop integration
Summary
Chapter 7: Security and Troubleshooting
Encryption
Audit
Things to look out for
Summary

Book Details

ISBN 139781782168171
Paperback120 pages
Read More
From 1 reviews

Read More Reviews

Recommended for You