You're reading from Mastering OpenCV 4 with Python

Product typeBook

Published inMar 2019

Reading LevelExpert

PublisherPackt

ISBN-139781789344912

Edition1st Edition

Languages

Python

Tools

OpenCV

Concepts

Computer Vision

Author (1)

Alberto Fernández Villán

Image Processing Techniques

Image processing techniques are the core of your computer vision projects. They can be seen as useful key tools, which you can use to complete various tasks. In other words, image processing techniques are like building blocks that should be kept in mind when processing your images. Therefore, a basic understanding of image processing is required if you are to work with computer vision projects.

In this chapter, you will learn most of the common image processing techniques you need. These will be complemented by the other image processing techniques covered in the next three chapters of this book (histograms, thresholding techniques, contour detection, and filtering).

In this chapter, the following topics will be covered:

Splitting and merging channels
Geometric transformations of images—translation, rotation, scaling, affine transformation...

Technical requirements

The technical requirements for this chapter are listed as follows:

Python and OpenCV
A Python-specific IDE
The NumPy and Matplotlib packages
A Git client

For further details on how to install these requirements, see Chapter 1, Setting Up OpenCV. The GitHub repository, Mastering OpenCV 4 with Python, containing all the supporting project files necessary to work through this book from the first chapter to the last one, can be accessed at: https://github.com/PacktPublishing/Mastering-OpenCV-4-with-Python.

Splitting and merging channels in OpenCV

Sometimes, you have to work with specific channels on multichannel images. To do this, you have to split the multichannel image into several single-channel images. Additionally, once the processing has been done, you may want to create one multichannel image from different single-channel images. In order to both split and merge channels, you can use the cv2.split() and cv2.merge() functions, respectively. The cv2.split() function splits the source multichannel image into several single-channel images. The cv2.merge() function merges several single-channel images into a multichannel image.

In the next example, splitting_and_merging.py, you will learn how to work with these two aforementioned functions. Using the cv2.split() function, if you want to get the three channels from a loaded BGR image, then you should use the following code:

...

Geometric transformations of images

In this section the first, an introduction to the main geometric transformations of images will be covered. We will look at some examples the of scaling, translation, rotation, affine transformation, perspective transform, and cropping of images. The two key functions to perform these geometric transformations are cv2.warpAffine() and cv2.warpPerspective(). The cv2.warpAffine() function transforms the source image by using the following 2 x 3 M transformation matrix:

The cv2.warpPerspective() function transforms the source image using the following 3 x 3 transformation matrix:

In the next subsections, we will learn about the most common geometric transformation techniques, which we will learn more about when we look at the geometric_image_transformations.py script.

...

Image filtering

In this section, we are going to tackle how to blur and sharpen images, applying both several filters and custom-made kernels. Additionally, we will look at some common kernels that we can use to perform other image-processing functionalities.

Applying arbitrary kernels

OpenCV provides the cv2.filter2D() function in order to apply an arbitrary kernel to an image, convolving the image with the provided kernel. In order to see how this function works, we should first build the kernel that we will use later. In this case, a 5 x 5 kernel will be used, as shown in the following code:

kernel_averaging_5_5 = np.array([[0.04, 0.04, 0.04, 0.04, 0.04], [0.04, 0.04, 0.04, 0.04, 0.04], [0.04, 0.04, 0.04, 0.04, 0.04],...

Arithmetic with images

In this section, we will learn about some common arithmetic operations that can be performed on images, such as bitwise operations, addition, and subtraction, among others. In connection with these operations, one key point to take into account is the concept of saturation arithmetic, which is explained in the following subsection.

Saturation arithmetic

Saturation arithmetic is a type of arithmetic operation where the operations are limited to a fixed range by restricting the maximum and minimum values that the operation can take. For example, certain operations on images (for example, color space conversions, interpolation techniques, and so on) can produce values out of the available range. Saturation...

Morphological transformations

Morphological transformations are operations that are normally performed on binary images and based on the image shape. The exact operation is determined by a kernel-structuring element, which decides the nature of the operation. Dilation and erosion are the two basic operators in the area of morphological transformations. Additionally, opening and closing are two important operations, which are derived from the two aforementioned operations (dilation and erosion). Finally, there are three other operations that based on the difference between some of these previous operations.

All of these morphological transformations are described in the following subsections, and the morphological_operations.py script shows the output when applying these transformations to some test images. The key points will also be commented on.

...

Color spaces

In this section, the basics of popular color spaces will be covered. These color spaces are—RGB, CIE L*a*b*, HSL and HSV, and YCbCr.

OpenCV provides more than 150 color-space conversion methods to perform the user's required conversions. In the following example, the conversions are performed from an image loaded in the RGB (BGR in OpenCV) to the other color spaces (for example, HSV, HLS, or YCbCr).

Showing color spaces

The RGB color space is an additive color space, where a specific color is represented by red, green, and blue values. Human vision works in a similar way, so this color space is an appropriate way to display computer graphics.

The CIELAB color space (also known as CIE L*a*b* or simply...

Color maps

In many computer vision applications, the output of your algorithm is a grayscale image. However, human eyes are not good at observing changes in grayscale images. They are more sensitive when appreciating changes in color images, therefore a common approach is to transform (recolor) the grayscale images into a pseudocolor equivalent image.

Color maps in OpenCV

In order to perform this transformation, OpenCV has several color maps to enhance visualization. The cv2.applyColorMap() function applies a color map on the given image. The color_map_example.py script loads a grayscale image and applies the cv2.COLORMAP_HSV color map, as shown in the following code:

img_COLORMAP_HSV = cv2.applyColorMap(gray_img, cv2.COLORMAP_HSV...

Summary

In this chapter, we reviewed most of the common image processing techniques you need in your computer vision projects. In the next three chapters (Chapter 6, Constructing and Building Histograms, Chapter 7, Thresholding Techniques, and Chapter 8, Contour Detection, Filtering, and Drawing), the most common image processing techniques will be reviewed.

In Chapter 6, Constructing and Building Histograms, you will learn how to both create and understand histograms, which are a powerful technique that is used to better understand image content.

Questions

Which function splits a multichannel into several single-channel images?
Which function merges several single-channel images into a multichannel image?
Translate an image 150 pixels in the x direction and 300 pixels in the y direction.
Rotate an image named img by 30 degrees with respect to the center of the image with a scale factor of 1.
Build a 5 x 5 averaging kernel and apply it to an image using cv2.filter2D().
Add 40 to all the pixels in a grayscale image.
Apply the COLORMAP_JET color map to a grayscale image.

Alberto Fernndez Villn is a software engineer with more than 12 years of experience in developing innovative solutions. In the last couple of years, he has been working in various projects related to monitoring systems for industrial plants, applying both Internet of Things (IoT) and big data technologies. He has a Ph.D. in computer vision (2017), a deep learning certification (2018), and several publications in connection with computer vision and machine learning in journals such as Machine Vision and Applications, IEEE Transactions on Industrial Informatics, Sensors, IEEE Transactions on Industry Applications, IEEE Latin America Transactions, and more. As of 2013, he is a registered and active user (albertofernandez) on the Q&A OpenCV forum.
Read more about Alberto Fernández Villán

Other recommended products

Related to this chapter

The Computer Vision Workshop

With The Computer Vision Workshop, you’ll explore the basic and advanced techniques in video and image processing using OpenCV and Python. It is filled with real-world exercises and activities that will make the learning process easy and enjoyable.

BookJul 2020568 pages

OpenCV 3 Computer Vision with Python Cookbook

OpenCV 3 is a native cross-platform library for computer vision, machine learning, and image processing. OpenCV's convenient high-level APIs hide very powerful internals designed for computational efficiency that can take advantage of multicore and GPU processing. This book will help you tackle increasingly challenging computer vision problems by providing a number of recipes that you can use to improve your applications.

BookMar 2018306 pages

Raspberry Pi Computer Vision Programming

You will learn the basics of hardware and software required for image processing and computer vision with Raspberry Pi and Python 3. You will have a look at all the major image processing, manipulation, and computer vision techniques and algorithms in detail using engaging examples. You will build a lot of real-life computer vision applications.

BookJun 2020306 pages5

Hands-On Algorithms for Computer Vision

The field of Computer Vision has seen advancements in terms of processing power and performance. Many algorithms are introduced to perform Computer Vision tasks efficiently. This book is a starting point for anyone interested in this field and wants to dig deeper into the most practical algorithms used by professional Computer Vision developers.

BookJul 2018290 pages

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

BookAug 2017206 pages

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

BookSep 2018380 pages

OpenCV 3.x with Python By Example

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.

BookJan 2018268 pages

Mastering OpenCV 4

Mastering OpenCV, now in its third edition, targets computer vision engineers taking their first steps toward mastering OpenCV. Keeping the mathematical formulations to a solid but bare minimum, the book delivers complete projects from ideation to running code, targeting current hot topics in computer vision such as face recognition, landmark detection and pose estimation, and number recognition with deep convolutional networks.

BookDec 2018280 pages

Learning OpenCV 4 Computer Vision with Python 3

Now in its third edition, this is the original book on OpenCV’s Python bindings. Readers will learn a great range of techniques and algorithms, from the classics to the state-of-the-art, and from geometry to machine learning. All of this is in aid of solving practical computer vision problems in well-built applications.

BookFeb 2020372 pages

Hands-On Computer Vision with Julia

This book is a thorough guide for developers who want to get started with building computer vision applications using Julia. Julia is well suited to image processing because of its ease of use and the fact that it lets you write easy-to-compile and efficient machine code.

BookJun 2018202 pages

Learn OpenCV 4 By Building Projects

OpenCV is mainly used in Computer Vision and image processing and is considered to be one of the best open source libraries that helps developers focus on constructing complete projects on image processing, motion detection, and image segmentation. This book will be your guide to understanding the basic OpenCV concepts and algorithms.

BookNov 2018310 pages

OpenCV 4 for Secret Agents

OpenCV 4 for Secret Agents is an updated edition of the book that introduced thousands of developers to cat face detection, real-time Eulerian video magnification, and other scintillating topics in computer vision. Now, Python 3 and Android Studio are supported. With an applied approach and a love of storytelling, the author presents projects that will appeal to all you tinkers, tailors, mad scientists, and spies.

BookApr 2019336 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Mastering OpenCV 4 with Python

Unlock this book and the full library FREE for 7 days

Author (1)

The Computer Vision Workshop

With The Computer Vision Workshop, you’ll explore the basic and advanced techniques in video and image processing using OpenCV and Python. It is filled with real-world exercises and activities that will make the learning process easy and enjoyable.

OpenCV 3 Computer Vision with Python Cookbook

Raspberry Pi Computer Vision Programming

Hands-On Algorithms for Computer Vision

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

OpenCV 3.x with Python By Example

Mastering OpenCV 4

Learning OpenCV 4 Computer Vision with Python 3

Hands-On Computer Vision with Julia

This book is a thorough guide for developers who want to get started with building computer vision applications using Julia. Julia is well suited to image processing because of its ease of use and the fact that it lets you write easy-to-compile and efficient machine code.

Learn OpenCV 4 By Building Projects

OpenCV 4 for Secret Agents

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook