You're reading from Raspberry Pi Computer Vision Programming. - Second Edition

Product typeBook

Published inJun 2020

Reading LevelBeginner

PublisherPackt

ISBN-139781800207219

Edition2nd Edition

Languages

Python

Tools

OpenCV

Concepts

Computer Vision

Author (1)

Ashwin Pajankar

Chapter 5: Basics of Image Processing

In the previous chapter, we learned about and demonstrated various ways to capture images and videos for image processing and computer vision applications. We learned how to use Command Prompt and Python 3 programming extensively to read images and to interface with the USB webcam and the Raspberry Pi camera module.

In this chapter, we will look at how to perform basic arithmetic and logical operations on images with NumPy, OpenCV, and matplotlib. We will also learn about different color channels and image properties in detail.

The following is a list of the topics that will be covered in this chapter:

Retrieving image properties
Basic operations on images
Arithmetic operations on images
Blending and transitioning images
Multiplying images by constants and one another
Creating a negative of an image
Bitwise logical operations on images

This chapter has a lot of hands-on exercises that use Python 3...

Technical requirements

The code files of this chapter can be found on GitHub at https://github.com/PacktPublishing/raspberry-pi-computer-vision-programming/tree/master/Chapter05/programs.

Check out the following video to see the Code in Action at https://bit.ly/2V8vzev.

Retrieving image properties

We can retrieve and use many properties, such as the data type, the dimensions, the shape, and the size of bytes of an image with NumPy. Open the Python 3 interpreter by running the python3 command in the command prompt. Then, run the following statements one by one:

>>> import cv2
>>> img = cv2.imread('/home/pi/book/dataset/4.1.01.tiff', 0)
>>> print(type(img))

The following is the output of these statements:

<class 'numpy.ndarray'>

The preceding output confirms that the OpenCV imread() function read an image and stored it in NumPy's ndarray format. The following statement prints dimensions of the image it read:

>>> print(img.ndim)
2

The image is read in grayscale mode, which is why it is a two-dimensional image. It just has a single channel composed of intensities of grayscale. Now, let's see its shape:

>>> print(img.shape)
(256, 256)

The preceding...

Basic operations on images

Let's perform a few basic operations, such as splitting and combining the channels of a color image and adding a border to an image. We will continue this demonstration in interactive mode. Let's import OpenCV and read a color image, as follows:

>>> import cv2
>>> img = cv2.imread('/home/pi/book/dataset/4.1.01.tiff', 1)

For any image, the origin—the (0, 0) pixel—is the pixel at the upper-left corner. We can retrieve the intensity values for all the channels by running the following statement:

>>> print(img[10, 10])
[34 38 44]

These are the intensity values of the blue, green, and red channels, respectively, for pixel (10, 10). If you only want to access an individual channel for a pixel, then run the following statement:

>>> print(img[10, 10, 0])
34

The preceding output, 34, is the intensity of the blue channel. Similarly, we can access the green and red channels with...

Arithmetic operations on images

We know that images are nothing but NumPy ndarrays and we can perform arithmetic operations on images just as we can perform them on ndarrays. If we know how to apply numerical or arithmetic operations to matrices, then we should not have any trouble doing the same when the operands for those operations are images. Images must be of the same size and must have the same number of channels for us to perform arithmetic operations on them, and these operations are performed on individual pixels. There are many arithmetic operations, such as addition and subtraction. The first is the addition operation. We can add two images by using either the NumPy Addition or the add() function in OpenCV, as follows:

import cv2
img1 = cv2.imread('/home/pi/book/dataset/4.2.03.tiff', 1)
img2 = cv2.imread('/home/pi/book/dataset/4.2.05.tiff', 1)
cv2.imshow('NumPy Addition', img1 + img2 )
cv2.imshow('OpenCV Addition', cv2.add(img1...

Blending and transitioning images

The cv2.addWeighted() function computes the weighted sum of the two images that we pass in as arguments. This causes them to blend. The following is some code that demonstrates this concept of blending:

import cv2
img1 = cv2.imread('/home/pi/book/dataset/4.2.03.tiff', 1)
img2 = cv2.imread('/home/pi/book/dataset/4.2.05.tiff', 1)
cv2.imshow('Blended Image',
           cv2.addWeighted(img1, 0.5, img2, 0.5, 0))
cv2.waitKey(0)
cv2.destroyAllWindows()

In the preceding code, we are passing the following five arguments to the OpenCV cv2.addWeighted() function:

img1: The first image
alpha: The coefficient for the first image (0.5 in the preceding example)
img2: The second image
beta: The coefficient for the second image (0.5 in the preceding example)
gamma: The scalar value (0 in the preceding example)

OpenCV uses the following formula to compute the output image:

output image = (alpha *...

Multiplying images by a constant and one another

Just like normal matrices or NumPy ndarrays, images can be multiplied by a constant and with one another. We can multiply an image by a constant, as follows:

import cv2
img1 = cv2.imread('/home/pi/book/dataset/4.2.03.tiff', 1)
img2 = cv2.imread('/home/pi/book/dataset/4.2.05.tiff', 1)
cv2.imshow('Image1', img1 * 2)
cv2.waitKey(0)
cv2.destroyAllWindows()

In the preceding code, every element of the ndarray representing the image is multiplied by 2. Run the preceding program and see the output. We can also multiply images with one another, as follows:

cv2.imshow('Image1', img1 * 2)

The result is likely to look like noise.

Creating a negative of an image

In terms of pure mathematics, when we invert the colors of an image, it creates a negative of the image. This inversion operation can be computed by subtracting the color of a pixel from 255. If it is a color image, we invert the color of all the planes. For a grayscale image, we can directly compute the inversion by subtracting it from 255, as follows:

import cv2
img = cv2.imread('/home/pi/book/dataset/4.2.07.tiff', 0)
negative = abs(255 - img)
cv2.imshow('Grayscale', img)
cv2.imshow('Negative', negative)
cv2.waitKey(0)
cv2.destroyAllWindows()

The following is the output of this:

Figure 5.6 – A negative of an image

Try to find the negative of a color image, we just need to read the image in color mode in the preceding program.

Note:

The negative of a negative will be the original grayscale image. Try this on your own by computing the negative of the negative again for our...

Bitwise logical operations on images

The OpenCV library has many functions for computing bitwise logical operations on images. We can compute bitwise logical AND, OR, XOR (exclusive OR), and NOT (inversion) operations. The best way to demonstrate how these functions work is to use them with binary (black and white) images:

import cv2
import numpy as np
import matplotlib.pyplot as plt
a = [0, 255, 0]
img1 = np.array([a, a, a], dtype=np.uint8)
img2 = np.transpose(img1)
not_out = cv2.bitwise_not(img1 )
and_out = cv2.bitwise_and(img1, img2)
or_out = cv2.bitwise_or(img1, img2)
xor_out = cv2.bitwise_xor(img1, img2)
titles = ['Image 1', 'Image 2', 'Image 1 NOT', 'AND', 'OR', 'XOR']
images = [img1, img2, not_out, and_out, or_out, xor_out]
for i in range(6):
        plt.subplot(2, 3, i+1)
        plt.imshow(images[i], cmap='gray')
        plt.title(titles[i])
        plt.axis('off')
plt.show()

We created our...

Summary

In this chapter, we started by looking at image processing with OpenCV and NumPy. We learned about some important concepts, such as image channels, arithmetic and logical operations, and the negative of an image. Along the way, we also learned to use a bit more functionality in Python 3 and the NumPy library. The bitwise logical operations that we learned today will be very useful when writing programs for the functionality of object tracking by color in the next chapter.

In the next chapter, we will study colorspaces, transformations, and thresholding images.

The rest of the chapter is locked

You have been reading a chapter from

Raspberry Pi Computer Vision Programming. - Second Edition

Published in: Jun 2020Publisher: PacktISBN-13: 9781800207219

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Ashwin Pajankar

Ashwin Pajankar is an author, a YouTuber, and an instructor. He graduated from the International Institute of Information Technology, Hyderabad, with an MTech in Computer Science and Engineering. He has been writing programs for over two and a half decades. He is proficient in Linux, Unix shell scripting, C, C++, Java, JavaScript, Python, PowerShell, Golang, HTML, and assembly language. He has worked on single-board computers such as Raspberry Pi and Banana Pro. He is also proficient with microcontroller boards such as Arduino and the BBC Micro:bit. He is currently self-employed and teaches on Udemy and YouTube. He also organizes programming boot camps for working professionals and software companies.
Read more about Ashwin Pajankar

Other recommended products

Related to this chapter

Hands-On Internet of Things with Blynk

Blynk is gaining a lot of popularity among the masses as it is simple and portable to use, and it comes under mobile applications and devices. The book will introduce you to Blynk and will demonstrate how to setup the environment for building IoT applications. You will then deep dive into concepts like building a notification widget, display widget, and controller widgets. Besides this, you will learn how to build a Blynk server and then create a Blynk application on it. You will have hands-on experience in building IoT applications using Blynk.

BookMay 2018271 pages

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

BookSep 2018380 pages

Hands-On Robotics Programming with C++

C/C++ is one the legacy programming language for Robotics Programming. This book will help you understand and build complexly structured robots and implement various C/C++ programming libraries in it.

BookMar 2019312 pages

OpenCV 3.x with Python By Example

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.

BookJan 2018268 pages

Mastering OpenCV 4 with Python

Mastering OpenCV 4 with Python is a comprehensive guide to help you to get acquainted with various computer vision algorithms running in real-time. This book will help you to build complete projects on image processing, motion detection, and image segmentation where you can gain advanced computer vision techniques.

BookMar 2019532 pages

OpenCV 3 Computer Vision with Python Cookbook

OpenCV 3 is a native cross-platform library for computer vision, machine learning, and image processing. OpenCV's convenient high-level APIs hide very powerful internals designed for computational efficiency that can take advantage of multicore and GPU processing. This book will help you tackle increasingly challenging computer vision problems by providing a number of recipes that you can use to improve your applications.

BookMar 2018306 pages

Raspberry Pi 3 Projects for Java Programmers

This book will try to create starting points for Java developers who would like to extend their knowledge on how to interact with hardware on the Raspberry Pi by providing small real world usable projects. After reading this book the reader will be able to build their own real world usable projects not limited to Home Automation, IoT and/or Robotics utilizing logic, user- and web interfaces.

BookMay 2017286 pages

The Computer Vision Workshop

With The Computer Vision Workshop, you’ll explore the basic and advanced techniques in video and image processing using OpenCV and Python. It is filled with real-world exercises and activities that will make the learning process easy and enjoyable.

BookJul 2020568 pages

Hands-On Algorithms for Computer Vision

The field of Computer Vision has seen advancements in terms of processing power and performance. Many algorithms are introduced to perform Computer Vision tasks efficiently. This book is a starting point for anyone interested in this field and wants to dig deeper into the most practical algorithms used by professional Computer Vision developers.

BookJul 2018290 pages

Raspberry Pi Zero W Wireless Projects

Wireless communications have spread over the whole world, as the new era of Internet of Things come closer and closer. Low-energy hardware such as the Raspberry Pi Zero Wireless is perfect for many projects. This book introduces you to Raspberry Pi Zero W, the new family member with a wireless extension. Throughout the book, readers will learn how to build inexpensive, fast, and awesome projects to change their everyday routines.

BookAug 2017240 pages

Internet of Things Programming Projects

Taking a project-based approach this book will help you leverage sensors, actuators, Python programming and Raspberry Pi 3 to build connected things. Each chapter is an independent project where you will learn from connecting devices to building complex IoT projects. You will be well versed in every possible way to make your projects stand out.

BookOct 2018436 pages

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

BookAug 2017206 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages