You're reading from Mastering Mesos
This section will introduce Cassandra and explain how to set up Cassandra on Mesos while also discussing the problems commonly encountered during the setup process.
Cassandra is an open source, scalable NoSQL database that is fully distributed with no single point of failure and is highly performant for most standard use cases. It is both horizontally as well as vertically scalable. Horizontal scalability or scale-out solution involves adding more nodes with commodity hardware to the existing cluster while vertical scalability or scale-up solution means adding more CPU and memory resources to a node with specialized hardware.
Cassandra was developed by Facebook engineers to address the inbox search use case and was inspired by Google Bigtable, which served as the foundation for its storage model, and Amazon DynamoDB, which was the foundation of its distribution model. It was open sourced in 2008 and became an Apache top-level project in early 2010...
This section will introduce the Elasticsearch-Logstash-Kibana (ELK) stack and explain how to set it up on Mesos while also discussing the problems commonly encountered during the setup process.
The ELK stack, a combination of Elasticsearch, Logstash, and Kibana, is an end-to-end solution for log analytics. Elasticsearch provides search capabilities, Logstash is a log management software, while Kibana serves as the visualization layer. The stack is commercially backed by a company called Elastic.
Elasticsearch is a Lucene-based open source distributed search engine designed for high scalability and fast search query response time. It simplifies the usage of Lucene, a highly performant search engine library, by providing a powerful REST API on top. Some of the important concepts in Elasticsearch are highlighted as follows:
This section will introduce Kafka and explain how to set it up on Mesos while also discussing the problems commonly encountered during the setup process.
Kafka is a distributed publish-subscribe messaging system designed for speed, scalability, reliability, and durability. Some of the key terms used in Kafka are given as follows:
Take a look at the following high-level diagram of Kafka (source: http://kafka.apache.org/documentation.html#introduction):
A partitioned log is maintained by the Kafka cluster for every topic, which looks similar to the following (source: http://kafka.apache.org/documentation.html...
This chapter introduced the reader to some important big data storage frameworks such as Cassandra, the ELK stack, and Kafka and covered topics such as the setup, configuration, and management of these frameworks on a distributed infrastructure using Mesos.
I hope that this book has armed you with all the resources that you require to effectively manage the complexities of today's modern datacenter requirements. By following the detailed step-by-step guides to deploy a Mesos cluster using the DevOps tool of your choice, you should now be in a position to handle the system administration requirements of your organization smoothly.