Packt+ | Advance your knowledge in tech

You're reading from Kubernetes for Developers

Product typeBook

Published inApr 2018

Reading LevelIntermediate

PublisherPackt

ISBN-139781788834759

Edition1st Edition

Languages

JavaScript

Tools

Kubernetes

Concepts

Containerization

Author (1)

Joseph Heck

Chapter 6. Background Processing in Kubernetes

Kubernetes includes support for one-off (also known as batch) computation work, as well as supporting common use cases for asynchronous background work. In this chapter, we look at the Kubernetes concept of job, and its neighbor, CronJob. We also look at how Kubernetes handles and supports persistence, and some of the options that are available within Kubernetes. We then look at how Kubernetes can support asynchronous background tasks and the ways those can be represented, operated, and tracked by Kubernetes. We also go over how to set up worker codes operating from a message queue.

Topics covered in this chapter include:

Job
CronJob
A worker queue example with Python and Celery
Persistence with Kubernetes
Stateful Sets
Custom Resource Definitions (CRDs)

Job

Most of what we have covered so far has been focused on continuous, long-running processes. Kubernetes also has support for shorter, discrete runs of software. A job in Kubernetes is focused on a discrete run that is expected to end within some reasonably-known timeframe, and report a success or failure. Jobs use and build upon the same construct as the long-running software, so they use the pod specification at their heart, and add the concept of tracking the number of successful completions.

The simplest use case is to run a single pod to completion, letting Kubernetes handle any failures due to a node failure or reboot. The two optional settings you can use with jobs are parallelism and completion. Without specifying parallelism, the default is 1 and only one job will be scheduled at a time. You can specify both values as integers to run a number of jobs in parallel to achieve multiple completions, and you can leave completions unset if the job is working from a work queue of some...

CronJob

CronJobs are an extension that build on jobs to allow you to specify a recurring schedule for when they run. The name pulls from a common Linux utility for scheduling recurring scripts called cron. CronJobs were alpha as of Kubernetes version 1.7, and moved to beta in version 1.8, and remain in beta as of version 1.9. Remember that Kubernetes specifications may change, but tend to be fairly solid and have expected utility with beta, so the v1 release of CronJobs may be different, but you can likely expect it to be pretty close to what's available as of this writing.

The specification is highly related to a job, with the primary difference being the kind is CronJob and there is a required field schedule that takes a string representing the timing for running this job.

The format for this string is five numbers, and wildcards can be used. The fields represent:

Minute (0–59)
Hour (0–23)
Day of Month (1–31)
Month (1–12)
Day of Week (0–6)

A * or? a character can be used in any of these fields...

A worker queue example with Python and Celery

Where the CronJob is well positioned to run repeated tasks at a specific schedule, another common need is to process a series of work items more or less constantly. A job is well oriented to running a single task until it is complete, but if the volume of things you need to process is large enough, it may be far more effective to maintain a constant process to work on those items.

A common pattern to accommodate this kind of work uses a message queue, as shown here:

With a message queue, you can have an API frontend that creates the work to be run asynchronously, move that into a queue, and then have a number of worker processes pull from the queue to do the relevant work. Amazon has a web-based service supporting exactly this pattern of processing called Simple Queue Service (SQS). A huge benefit of this pattern is decoupling the workers from the request, so you can scale each of those pieces independently, as required.

You can do exactly the same...

Persistence with Kubernetes

So far, all our examples, and even code, have been essentially stateless. In the last chapter, we introduced a container using Redis, but didn't specify anything special for it. By default, Kubernetes will assume any resources associated with a pod are ephemeral, and if the node fails, or a deployment is deleted, all the associated resources can and will be deleted with it.

That said, almost all the work we do requires storing and maintaining state somewhere—a database, an object store, or even a persistent, in-memory queue. Kubernetes includes support for persistence, and as of this writing, it's still changing and evolving fairly rapidly.

Volumes

The earliest support in Kubernetes was for volumes, which can be defined by the cluster administrator, and we've already seen some variations of this construct with the configuration being exposed into a container using the Downward API back in Chapter 4, Declarative Infrastructure.

Another kind of volume that can be easily...

Stateful Sets

Following dynamic provisioning, as you think about persistence systems – whether they are classic databases, key-value data stores, memory caches, or document-based datastores – it is common to want to have some manner of redundancy and failover. ReplicaSets and deployments go a fairly significant way to supporting some of that capability, especially with persistent volumes, but it would be greatly beneficial to these systems to have them more fully integrated with Kubernetes, so that we can leverage Kubernetes to handle the life cycle and coordination of these systems. A starting point for this effort is Stateful Sets, which act similarly to a deployment and ReplicaSet in that they manage a group of pods.

Stateful Sets differ from the other systems as they also support each pod having a stable, unique identity and specific ordered scaling, both up and down. Stateful Sets are relatively new in Kubernetes, first appearing in Kubernetes 1.5, and moving into beta in version 1.9...

Custom Resource Definition

Stateful Sets don't automatically match all the persistent stores that are available, and some of them have even more complex logic requirements for managing the life cycle of the application. As Kubernetes looked at how to support extending its controllers to support more complex logic, the project started with the idea of Operators, external code that could be included in the Kubernetes project, and has evolved as of Kubernetes 1.8 to make this more explicit with CustomResourceDefinitions. A custom resource extends the Kubernetes API, and allows for custom API objects to be created and matched with a custom controller that you can also load into Kubernetes to handle the life cycle of those objects.

Note

Custom Resource Definitions go beyond the scope of what we will cover in this book, although you should be aware that they exist. You can get more details about Custom Resource Definitions and how to extend Kubernetes at the project's documentation site: https:...

Summary

In this chapter, we reviewed jobs and CronJobs, which Kubernetes provides to support batch and scheduled batch processing, respectively. We also looked through a Python example of how to set up a Celery worker queue with RabbitMQ and configure the two deployments to work together. We then looked at how Kubernetes can provide persistence with volumes, PersistentVolume, and its concept of PersistentVolumeClaims for automatically creating volumes for deployments as needed. Kubernetes also supports Stateful Sets for a variation of deployment that requires stable identity and persistent volumes, and we looked at a simple Node.js example converting our previous example of a deployment into a Stateful Set. We finished the chapter with a look at Custom Resource Definitions, used to extend Kubernetes.

In the next chapter, we start to look at how to leverage Kubernetes to get information about all these structures. We review how to capture and view metrics, leveraging Kubernetes and additional...

The rest of the chapter is locked

You have been reading a chapter from

Kubernetes for Developers

Published in: Apr 2018Publisher: PacktISBN-13: 9781788834759

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Joseph Heck

Joseph Heck has broad development and management experience across start-ups and large companies. He has architected, developed, and deployed a wide variety of solutions, ranging from mobile and desktop applications to cloud-based distributed systems. He builds and directs teams and mentors individuals to improve the way they build, validate, deploy, and run software. He also works extensively with and in open source, collaborating across many projects, including Kubernetes.
Read more about Joseph Heck

Other recommended products

Related to this chapter

Kubernetes Cookbook

Kubernetes is one of the most popular, sophisticated, and fast-evolving container orchestrators. In this book, you’ll learn the essentials and find out about the advanced administration in Kubernetes. We’ll take you through a step-by-step hands-on approach, which will familiarize you with the Kubernetes ecosystem.

BookMay 2018554 pages

kubectl: Command-Line Kubernetes in a Nutshell

Kubernetes is a de facto container orchestration system. To manage Kubernetes, you must understand how to work with kubectl, its command-line tool that lets you interact with your clusters to deploy applications and manage their lifecycle. This book is a comprehensive introduction for those who are new to Kubernetes management via the command line.

BookNov 2020136 pages

Hands-On Kubernetes on Azure

This book will help readers to Deploy web applications securely in Microsoft Azure with docker container and having the need for clustering services to achieve high availability, dynamic scalability, and to monitor applications

BookMar 2019258 pages

Kubernetes Design Patterns and Extensions

Kubernetes is an open source platform that automates scaling, deployment, and management of container-based applications. The book starts by explaining basics about container orchestration and then moves on to explain how the components of a complex system, such as Kubernetes, interact with each other and helps develop vital skills for troubleshooting Kubernetes clusters and applications running in Kubernetes clusters. Once you are done reading the book, you can easily develop and deploy your own applications with Kubernetes in no time.

BookSep 2018106 pages

Kubernetes - A Complete DevOps Cookbook

Kubernetes is one of the most popular, sophisticated, and fast-evolving container orchestrators. In this book, you’ll learn the essentials and find out about the advanced administration and orchestration techniques in Kubernetes. Readers will also learn to manage containers using the latest version of Kubernetes with a recipe-based approach.

BookMar 2020584 pages

Introduction to DevOps with Kubernetes

Introduction to DevOps with Kubernetes addresses the widespread problems faced by developers who are starting out with DevOps using Kubernetes. You’ll build out example portfolio pieces and tackle intellectually stimulating challenges that will help prepare you for technical screenings and complex software lifecycle problems.

BookMay 2019374 pages

Mastering Kubernetes

This new third edition of Mastering Kubernetes, updated with the latest tools and code, explores the newest features in Kubernetes 1.18 throughout the book to fully leverage the modularity and flexibility that Kubernetes offers to build and run large scale distributed applications.

BookJun 2020642 pages

The DevOps 2.3 Toolkit

Viktor Farcic’s latest book, The DevOps 2.3 Toolkit: Kubernetes, will take you on a hands-on journey with Viktor into the world of Kubernetes, and the tools not only behind the official project but also the wide-range of third-party apps that are available for you to use.

BookSep 2018418 pages

The DevOps 2.4 Toolkit

The DevOps Toolkit 2.4 is a deep exploration of continuous delivery and deployment in Kubernetes using Jenkins. It shows readers how to build, test, and deploy applications in Kubernetes using fully automated Jenkins pipelines.

BookNov 2019398 pages

DevOps with Kubernetes

Kubernetes is a reliable container orchestration framework. It makes things easier - building load testing websites, creating a staging environment or moving business online to production. With Kubernetes, the speed of development, testing, and deployment can be improved and accelerated. Upon completion of this book, you’ll learn not only handy DevOps skills including containerization, monitoring, logging and continuous delivery but also Kubernetes deployment and administration on popular public clouds.

BookOct 2017382 pages

The Kubernetes Book

This easy-to-read book is your one-stop solution to learning all the important and fundamental parts of Kubernetes. You’ll learn how to develop your applications with Kubernetes, deploy to your chosen cloud platform, migrate to a different cloud platform, and even migrate to your private cloud at your on-premises datacenter.

BookJun 2019228 pages

Cloud Native with Kubernetes

Cloud Native with Kubernetes will guide you effectively - from your first steps using Kubernetes right up to running enterprise-grade cloud native applications with best practices. The book covers every aspect of deploying, securing, and operating modern cloud native applications on Kubernetes.

BookJan 2021446 pages

Personalised recommendations for you

Based on your interests and search pattern

Designing and Implementing Microsoft Azure Networking Solutions

Designing and Implementing Microsoft Azure Networking Solutions Exam Ref AZ-700 is an all-encompassing guide to the AZ-700 exam and contains all the information you need to succeed in the world of virtual networking with Azure. With this book, you will be fully prepared for the exam and the world of cloud networking.

BookAug 2023524 pages

Microsoft 365 Security, Compliance, and Identity Administration

The Microsoft 365 Security, Compliance, and Identity Administration is a comprehensive guide that helps you employ Microsoft 365's robust suite of features and empowers you to optimize your administrative tasks.

BookAug 2023630 pages

Zero Trust Overview and Playbook Introduction

Get started on Zero Trust with this step-by-step playbook and learn everything you need to know for a successful Zero Trust journey with tailored guidance for every role, covering strategy, operations, architecture, implementation, and measuring success. This book will become an indispensable reference for everyone in your organization.

BookOct 2023240 pages

The Self-Taught Cloud Computing Engineer

This self-study book helps you master multiple clouds, including AWS, Azure, and GCP, and serves as a roadmap to becoming a certified cloud computing expert. The book will guide you to develop a professional cloud career by helping you build a broad cloud knowledge base, developing hands-on cloud computing skills, and getting cloud certified.

BookSep 2023472 pages

Technology Operating Models for Cloud and Edge

This book will help you build and create ownership of a technology operating model, as well as connect your leadership with engineering and operations, keeping your internal and external customers in mind. It provides practical tips on why, where, and how to make the cloud and edge platform paradigm sing for you, your team, and your organization.

BookAug 2023228 pages

Azure Architecture Explained

Azure is the preferred platform to build mission-critical and secure apps. This book provides comprehensive coverage of essential Azure products, services, and solutions vital for every solution architect's success. Elevate your knowledge and master the critical components of Azure to excel in your role with Azure Architecture Explained.

BookSep 2023446 pages

Pentesting Active Directory and Windows-based Infrastructure

This practical guide helps you explore the pentesting of Microsoft infrastructure in detail, and enhances your offensive skillset by showing you the different ways to perform security assessment. This book will help blue teamers and IT engineers get up to speed with possible security issues they may encounter in their Windows environments.

BookNov 2023360 pages

Practical Ansible

In Practical Ansible, you'll work with the latest release of Ansible and learn to solve complex issues quickly with the help of task-oriented scenarios. You'll start by installing and configuring Ansible to automate monotonous and repetitive IT tasks and get to grips with concepts such as playbooks, inventories, plugins, collections, and network modules.

BookSep 2023420 pages

Windows 11 for Enterprise Administrators

Microsoft’s launch of Windows 11 is a step toward satisfying the enterprise administrator’s needs for better management and enhanced user experience customization. This book provides the enterprise administrator with the knowledge needed to fully utilize the advanced feature set of Windows 11 Enterprise.

BookOct 2023286 pages

The Linux DevOps Handbook

This book is for software and IT professionals seeking knowledge on Linux systems and DevOps practices. This book will provide you with guidance and tools to learn and gain proficiency in managing Linux-based infrastructures and knowledge of DevOps.

BookNov 2023428 pages2