Packt+ | Advance your knowledge in tech

You're reading from Mastering Mesos

Product typeBook

Published inMay 2016

PublisherPackt

ISBN-139781785886249

Edition1st Edition

Tools

Mesos

Concepts

Data Processing

Authors (2):

Dipa Dubhashi

Akhil Das

View More author details

Chapter 9. Mesos Big Data Frameworks 2

This chapter is a guide to deploying important big data storage frameworks, such as Cassandra, the Elasticsearch-Logstash-Kibana (ELK) stack, and Kafka, on Mesos.

Cassandra on Mesos

This section will introduce Cassandra and explain how to set up Cassandra on Mesos while also discussing the problems commonly encountered during the setup process.

Introduction to Cassandra

Cassandra is an open source, scalable NoSQL database that is fully distributed with no single point of failure and is highly performant for most standard use cases. It is both horizontally as well as vertically scalable. Horizontal scalability or scale-out solution involves adding more nodes with commodity hardware to the existing cluster while vertical scalability or scale-up solution means adding more CPU and memory resources to a node with specialized hardware.

Cassandra was developed by Facebook engineers to address the inbox search use case and was inspired by Google Bigtable, which served as the foundation for its storage model, and Amazon DynamoDB, which was the foundation of its distribution model. It was open sourced in 2008 and became an Apache top-level project in early 2010...

The Elasticsearch-Logstash-Kibana (ELK) stack on Mesos

This section will introduce the Elasticsearch-Logstash-Kibana (ELK) stack and explain how to set it up on Mesos while also discussing the problems commonly encountered during the setup process.

Introduction to Elasticsearch, Logstash, and Kibana

The ELK stack, a combination of Elasticsearch, Logstash, and Kibana, is an end-to-end solution for log analytics. Elasticsearch provides search capabilities, Logstash is a log management software, while Kibana serves as the visualization layer. The stack is commercially backed by a company called Elastic.

Elasticsearch

Elasticsearch is a Lucene-based open source distributed search engine designed for high scalability and fast search query response time. It simplifies the usage of Lucene, a highly performant search engine library, by providing a powerful REST API on top. Some of the important concepts in Elasticsearch are highlighted as follows:

Document: This is a JSON object stored in an index...

Kafka on Mesos

This section will introduce Kafka and explain how to set it up on Mesos while also discussing the problems commonly encountered during the setup process.

Introduction to Kafka

Kafka is a distributed publish-subscribe messaging system designed for speed, scalability, reliability, and durability. Some of the key terms used in Kafka are given as follows:

Topics: These are the categories where message feeds are maintained by Kafka
Producers: These are the upstream processes that send messages to a particular Kafka topic
Consumers: These are the downstream processes that listen to the incoming messages in a topic and process them as per requirements
Broker: Each node in a Kafka cluster is called a broker

Take a look at the following high-level diagram of Kafka (source: http://kafka.apache.org/documentation.html#introduction):

A partitioned log is maintained by the Kafka cluster for every topic, which looks similar to the following (source: http://kafka.apache.org/documentation.html...

Summary

This chapter introduced the reader to some important big data storage frameworks such as Cassandra, the ELK stack, and Kafka and covered topics such as the setup, configuration, and management of these frameworks on a distributed infrastructure using Mesos.

I hope that this book has armed you with all the resources that you require to effectively manage the complexities of today's modern datacenter requirements. By following the detailed step-by-step guides to deploy a Mesos cluster using the DevOps tool of your choice, you should now be in a position to handle the system administration requirements of your organization smoothly.

The rest of the chapter is locked

You have been reading a chapter from

Mastering Mesos

Published in: May 2016Publisher: PacktISBN-13: 9781785886249

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Dipa Dubhashi

Dipa Dubhashi is an alumnus of the prestigious Indian Institute of Technology and heads product management at Sigmoid. His prior experience includes consulting with ZS Associates besides founding his own start-up. Dipa specializes in envisioning enterprise big data products, developing their roadmaps, and managing their development to solve customer use cases across multiple industries. He advises several leading start-ups as well as Fortune 500 companies about architecting and implementing their next-generation big data solutions. Dipa has also developed a course on Apache Spark for a leading online education portal and is a regular speaker at big data meetups and conferences.
Read more about Dipa Dubhashi

Akhil Das

Akhil Das is a senior software developer at Sigmoid primarily focusing on distributed computing, real-time analytics, performance optimization, and application scaling problems using a wide variety of technologies such as Apache Spark and Mesos, among others. He contributes actively to the Apache Spark project and is a regular speaker at big data conferences and meetups, MesosCon 2015 being the most recent one.
Read more about Akhil Das

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages