Packt+ | Advance your knowledge in tech

You're reading from Practical Site Reliability Engineering

Product typeBook

Published inNov 2018

PublisherPackt

ISBN-139781788839563

Edition1st Edition

Tools

Kubernetes Docker

Concepts

Configuration Management

Authors (3):

Pethuru Raj Chelliah

Shreyash Naithani

Shailender Singh

View More author details

Chapter 10. Containers, Kubernetes, and Istio Monitoring

In the cloud world, we need to carry out monitoring to observe the progress and quality of our services and applications over a period of time. Monitoring allows us to keep our applications under systematic review. If something breaks, we want to know what it is and what caused it to malfunction. Monitoring helps us to investigate the failure points in our services. We can make sure that we detect these services early on using anomaly detection. White-box monitoring can help us work out which services are failing and why, and also how to debug them. It can also provide future trends, which means it can detect potential future failures. Here, we will be focusing only on tools that enable us to monitor either our application or our infrastructure:

Monitoring the application: It is very important that the features or services that are being developed are monitored. There should be a proper time-series graph and a dashboard.
Monitoring the...

Prometheus

Prometheus is an open source monitoring tool that was originally built by SoundCloud in 2012, inspired by Google's BrogMon. It is written in GoLang. According to the New Stack Survey of 2017, Prometheus is one of the most widely used tools for monitoring Kubernetes clusters. What makes Prometheus different than other open source monitoring systems is that it has a simple, text-based format, making it easy to get metrics from other systems. It also has a multidimensional data model and a rich and concise query language. Using Prometheus, we can monitor all levels, nodes, container-scheduling systems, and also routers and switches. If we are dealing with large applications and a fast-moving infrastructure, this means that the jobs that we run change rapidly and we have to deploy them around 100 times a day. In this case, Prometheus will be very useful, as it has the ability to discover services. If we have a dynamic infrastructure, we can use Prometheus to detect early failures...

Grafana

Grafana is a widely used open source tool that is used to monitor services and applications by visualizing time-series data. It can tell us how our services or servers are doing by showing us production business metrics. It can carry out both infrastructure monitoring and application monitoring. The official definition of Grafana is as follows:

"It is the analytics platform for all your metrics. Grafana allows you to query, visualize, alert, on and understand your metrics no matter where they are stored. Create, explore, and share dashboards with your team, and foster a data-driven culture."

One of the main reasons why we would use Grafana over Prometheus is to get perfect visualization and dashboard editing. Using Grafana, it is very easy to create a dashboard and customize it. With Prometheus, on the other hand, we would need to make use of console templates to do this, which makes it a little harder to use. Other features of Grafana include the following:

Advanced graphing
Powerful...

Summary

Monitoring is not a one-time task. We should be regularly measuring what's going on with our Kubernetes pods or our microservices. Monitoring plays a crucial role in the microservice system, as we need to monitor all endpoints in our microservices. To achieve a higher quality product, we should be able to detect failures before our customer does. We should enable anomaly detection and notify our operation team to troubleshoot the problem. We have to set up the necessary monitoring and alerts on both the infrastructure side and the application side.In this chapter, we saw how to use Prometheus and Grafana metrics to create powerful dashboards and alerts.

In the next chapter, we will talk about post-production activities and best practices for ensuring and enhancing the IT reliability.

The rest of the chapter is locked

You have been reading a chapter from

Practical Site Reliability Engineering

Published in: Nov 2018Publisher: PacktISBN-13: 9781788839563

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €14.99/month. Cancel anytime

Authors (3)

Pethuru Raj Chelliah

Pethuru Raj Chelliah (PhD) works as the chief architect at the Site Reliability Engineering Center of Excellence, Reliance Jio Infocomm Ltd. (RJIL), Bangalore. Previously, he worked as a cloud infrastructure architect at the IBM Global Cloud Center of Excellence, IBM India, Bangalore, for four years. He also had an extended stint as a TOGAF-certified enterprise architecture consultant in Wipro Consulting services division and as a lead architect in the corporate research division of Robert Bosch, Bangalore. He has more than 17 years of IT industry experience.
Read more about Pethuru Raj Chelliah

Shreyash Naithani

Shreyash Naithani is currently a site reliability engineer at Microsoft R&D. Prior to Microsoft, he worked with both start-ups and mid-level companies. He completed his PG Diploma from the Centre for Development of Advanced Computing, Bengaluru, India, and is a computer science graduate from Punjab Technical University, India. In a short span of time, he has had the opportunity to work as a DevOps engineer with Python/C#, and as a tools developer, site/service reliability engineer, and Unix system administrator. During his leisure time, he loves to travel and binge watch series.
Read more about Shreyash Naithani

Shailender Singh

Shailender Singh is a principal site reliability engineer and a solution architect with around 11 year's IT experience who holds two master's degrees in IT and computer application. He has worked as a C developer on the Linux platform. He had exposure to almost all infrastructure technologies from hybrid to cloud-hosted environments. In the past, he has worked with companies including Mckinsey, HP, HCL, Revionics and Avalara and these days he tends to use AWS, K8s, Terraform, Packer, Jenkins, Ansible, and OpenShift.
Read more about Shailender Singh

Other recommended products

Related to this chapter

Hands-On RESTful API Design Patterns and Best Practices

REST architecture (style) is a pivot of distributed systems, simplify data integration amongst modern and legacy applications leverages through the RESTful paradigm. This book is fully loaded with many RESTful API patterns, samples, hands-on implementations and also discuss the capabilities of many REST API frameworks for Java, Scala, Python and Go

BookJan 2019378 pages

Architectural Patterns

Enterprise Architecture (EA) is typically an aggregate of the business, application, data, and infrastructure architectures of any forward-looking enterprise. Due to constant changes and rising complexities in the business and technology landscapes, producing sophisticated architectures is on the rise. Architectural patterns are gaining a lot of attention these days.

BookDec 2017468 pages

Learning Docker

Docker is an open source containerization engine that offers a simple and faster way for developing and running software. It helps enable flexibility and portability on where the application can run, whether on premises, public cloud, or private cloud. This book will show you the new features of Docker and help you get started with Docker by building and deploying a simple application.

BookMay 2017300 pages

Mastering Service Mesh

Service Mesh helps overcome the operational challenges of connecting, securing, controlling, and observing modern microservices deployment. This book shows you exactly how to use a Service Mesh architecture to manage and operationalize your microservices-based applications.

BookMar 2020626 pages

Hands-On Microservices - Monitoring and Testing

Microservices are the newest way of developing web applications. Once you've started down the microservice path, how do you make sure that your applications are still fully tested? This book focuses on the number of approaches for managing the additional testing complexity of multiple independently deployable components.

BookOct 2018160 pages5

Java EE 8 Microservices

There is a shift from monolithic applications to microservice-based ones as cloud-based applications are increasingly in demand. With this book, you will get to know Java EE 8's components and how they are used to implement microservices.

BookDec 2018260 pages

Getting Started with Kubernetes

Kubernetes has continued to grow and achieve broad adoption across various industries, helping you to orchestrate and automate container deployments on a massive scale. This book will give you a complete understanding of Kubernetes and how to get a cluster up and running.

BookOct 2018470 pages

Spring 5.0 Microservices

The Spring Framework is an application framework and inversion of the control container for the Java platform. Spring 5.0 is due to arrive with a myriad of new and exciting features. Written to the latest specifications of Spring, this book will help you implement the microservice architecture in Spring Framework, Spring Boot, and Spring Cloud.

BookJul 2017414 pages

Microservices with Azure

Microsoft Azure is rapidly evolving and is widely used as a platform on which you can build Microservices that can be deployed on heterogeneous environments using Microsoft Azure Service Fabric. This book will help you understand the concepts of the Microservice application architecture and help you build highly maintainable and scalable enterprise-grade applications using Microsoft Azure Service Fabric.

BookJun 2017360 pages

TypeScript Microservices

Microservices has evolved as one of the most tangible solutions to make effective and scalable applications. Due to its evolution from ES5 to ES6 stack, Typescript has become one of the most de facto solutions. This book will help you leverage microservices’ power to build robust architecture using reactive programming and Typescript in Node.js.

BookMay 2018404 pages

Architecting Cloud Computing Solutions

Cloud adoption is a core component of digital transformation. Scaling the IT environment, making it resilient, and reducing costs are what organizations want. Architecting Cloud Computing Solutions presents and explains the critical Cloud solution design considerations and technology decisions required to choose and deploy the right Cloud service and deployment models based on your business and technology service requirements.

BookMay 2018378 pages

Cloud-Native Applications in Java

Businesses today are evolving so rapidly that they are resorting to the elasticity of the cloud to provide a platform to build and deploy their highly scalable applications. This means developers now are faced with the challenge of building build applications that are native to the cloud. For this, they need to be aware of the environment, tools, and resources they’re coding against. If you’re a Java developers who wants to build secure, resilient, robust, and scalable applications that are targeted for cloud-based deployment, this is the book for you.

BookFeb 2018406 pages

Personalised recommendations for you

Based on your interests and search pattern

Designing and Implementing Microsoft Azure Networking Solutions

Designing and Implementing Microsoft Azure Networking Solutions Exam Ref AZ-700 is an all-encompassing guide to the AZ-700 exam and contains all the information you need to succeed in the world of virtual networking with Azure. With this book, you will be fully prepared for the exam and the world of cloud networking.

BookAug 2023524 pages

Microsoft 365 Security, Compliance, and Identity Administration

The Microsoft 365 Security, Compliance, and Identity Administration is a comprehensive guide that helps you employ Microsoft 365's robust suite of features and empowers you to optimize your administrative tasks.

BookAug 2023630 pages

Zero Trust Overview and Playbook Introduction

Get started on Zero Trust with this step-by-step playbook and learn everything you need to know for a successful Zero Trust journey with tailored guidance for every role, covering strategy, operations, architecture, implementation, and measuring success. This book will become an indispensable reference for everyone in your organization.

BookOct 2023240 pages

The Self-Taught Cloud Computing Engineer

This self-study book helps you master multiple clouds, including AWS, Azure, and GCP, and serves as a roadmap to becoming a certified cloud computing expert. The book will guide you to develop a professional cloud career by helping you build a broad cloud knowledge base, developing hands-on cloud computing skills, and getting cloud certified.

BookSep 2023472 pages

Technology Operating Models for Cloud and Edge

This book will help you build and create ownership of a technology operating model, as well as connect your leadership with engineering and operations, keeping your internal and external customers in mind. It provides practical tips on why, where, and how to make the cloud and edge platform paradigm sing for you, your team, and your organization.

BookAug 2023228 pages

Azure Architecture Explained

Azure is the preferred platform to build mission-critical and secure apps. This book provides comprehensive coverage of essential Azure products, services, and solutions vital for every solution architect's success. Elevate your knowledge and master the critical components of Azure to excel in your role with Azure Architecture Explained.

BookSep 2023446 pages

Pentesting Active Directory and Windows-based Infrastructure

This practical guide helps you explore the pentesting of Microsoft infrastructure in detail, and enhances your offensive skillset by showing you the different ways to perform security assessment. This book will help blue teamers and IT engineers get up to speed with possible security issues they may encounter in their Windows environments.

BookNov 2023360 pages

Practical Ansible

In Practical Ansible, you'll work with the latest release of Ansible and learn to solve complex issues quickly with the help of task-oriented scenarios. You'll start by installing and configuring Ansible to automate monotonous and repetitive IT tasks and get to grips with concepts such as playbooks, inventories, plugins, collections, and network modules.

BookSep 2023420 pages

Windows 11 for Enterprise Administrators

Microsoft’s launch of Windows 11 is a step toward satisfying the enterprise administrator’s needs for better management and enhanced user experience customization. This book provides the enterprise administrator with the knowledge needed to fully utilize the advanced feature set of Windows 11 Enterprise.

BookOct 2023286 pages

The Linux DevOps Handbook

This book is for software and IT professionals seeking knowledge on Linux systems and DevOps practices. This book will provide you with guidance and tools to learn and gain proficiency in managing Linux-based infrastructures and knowledge of DevOps.

BookNov 2023428 pages2