You're reading from OpenCV 4 Computer Vision Application Programming Cookbook - Fourth Edition

Product typeBook

Published inMay 2019

Reading LevelIntermediate

PublisherPackt

ISBN-139781789340723

Edition4th Edition

Languages

Java

Tools

OpenCV

Concepts

Computer Vision

Authors (2):

David Millán Escrivá

Robert Laganiere

View More author details

OpenCV Advanced Features

Deep learning is a subfield of machine learning, based on traditional neural networks and convolutional neural networks. It's state-of-the-art pattern recognition. Deep learning is reaching the best accuracy in fields such as speech recognition, text recognition, and image classification. Actually, OpenCV adds the deep learning module as a basic module in its core algorithms and is making big efforts to increase its performance using CPUs and GPUs.

In this chapter of advanced features, we are going to cover how to enable Halide, a new language that OpenCV is using to improve the performance of deep learning inferences.

Finally, we are going to explore the new possibilities of using OpenCV in a web browser as a JavaScript library, which allows us to use computer vision in a web client without installing anything on the client computer.

In this chapter...

Face detection using deep learning

In this recipe, we are going to learn how to use a trained deep learning model for a face detection algorithm in OpenCV. To do this, we are going to download pre-trained face detection models and use OpenCV methods to import the model and also see how to convert an input image or frame into the required deep learning structure.

How to do it...

Using deep learning in OpenCV is very easy, and the only files that we require are the pre-trained models and know the basic configuration of it. We can download tested, pre-trained OpenCV models from https://github.com/opencv/open_model_zoo.

To create the face detector algorithm, follow these steps:

Download and save the model of the face detector...

Object detection with YOLOv3

In this recipe, we are going to use the YOLOv3 DNN algorithm, which performs object detection. Object detection in an image is a common task in computer vision; with deep learning techniques, we can achieve highly accurate detections. YOLOv3 reaches a 60.6 mAP in the COCO dataset (80 classes and more than 300,000 images), which has a very good precision with 20 fps or 33 mAP with 220 fps.

How to do it...

In this recipe, we are going to use the same functions and classes to load the model, convert the images, and predict, but we are going to introduce one more important function – non-maximum suppression – and a few util functions to draw the predictions with their labels:

Create...

Enabling Halide to improve efficiency

Halide is a new programming language written for high-performance image processing on new computers. Halide currently targets the following:

CPU architectures: X86, ARM, MIPS, Hexagon, and PowerPC
Operating systems: Linux, Windows, macOS, Android, iOS, and Qualcomm QuRT
GPU Compute APIs: CUDA, OpenCL, OpenGL, OpenGL Compute Shaders, Apple Metal, and Microsoft Direct X 12

In this recipe, we are going to show how to compile OpenCV with Halide and how to enable it in your DNN algorithms.

How to do it...

To enable Halide in OpenCV, follow these steps. It will take a while to compile it all:

Download and unpack the LLVM source code from http://releases.llvm.org/4.0.0/llvm-4.0.0.src.tar.xz...

OpenCV.js introduction

This recipe introduces a new way to develop computer vision algorithms with OpenCV using JavaScript for the World Wide Web (WWW). Before OpenCV.js, developers that needed some computer vision tasks on the web had to develop them in C++ in servers or Java applets clients, which were not optimized, but with OpenCV.js, the new paradigm of client applications using web browsers has grown with unlimited of possibilities.

In this recipe, we are going to create a very simple case where we load an image into an img HTML tag, change the color image, and present it to a canvas element.

How to do it...

Before starting to use OpenCV.js, we need a web page as a user interface. To get a user interface, we are going...

The rest of the chapter is locked

You have been reading a chapter from

OpenCV 4 Computer Vision Application Programming Cookbook - Fourth Edition

Published in: May 2019Publisher: PacktISBN-13: 9781789340723

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

David Millán Escrivá

David Millán Escrivá was 8 years old when he wrote his first program on an 8086 PC in Basic, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT with honors, through the Universitat Politécnica de Valencia, in human-computer interaction supported by computer vision with OpenCV (v0.96). He has worked with Blender, an open source, 3D software project, and on its first commercial movie, Plumiferos, as a computer graphics software developer. David has more than 10 years' experience in IT, with experience in computer vision, computer graphics, pattern recognition, and machine learning, working on different projects, and at different start-ups, and companies. He currently works as a researcher in computer vision.
Read more about David Millán Escrivá

Robert Laganiere

Robert Laganiere is a professor at the School of Electrical Engineering and Computer Science of the University of Ottawa, Canada. He is also a faculty member of the VIVA research lab and is the co-author of several scientific publications and patents in content based video analysis, visual surveillance, driver-assistance, object detection, and tracking. Robert authored the OpenCV2 Computer Vision Application Programming Cookbook in 2011 and co-authored Object Oriented Software Development published by McGraw Hill in 2001. He co-founded Visual Cortek in 2006, an Ottawa-based video analytics start-up that was later acquired by iwatchlife.com in 2009. He is also a consultant in computer vision and has assumed the role of Chief Scientist in a number of start-up companies such as Cognivue Corp, iWatchlife, and Tempo Analytics. Robert has a Bachelor of Electrical Engineering degree from Ecole Polytechnique in Montreal (1987) and MSc and PhD degrees from INRS-Telecommunications, Montreal (1996). You can visit the author's website at laganiere.name.
Read more about Robert Laganiere

Other recommended products

Related to this chapter

Hands-On Algorithms for Computer Vision

The field of Computer Vision has seen advancements in terms of processing power and performance. Many algorithms are introduced to perform Computer Vision tasks efficiently. This book is a starting point for anyone interested in this field and wants to dig deeper into the most practical algorithms used by professional Computer Vision developers.

BookJul 2018290 pages

OpenCV 3 Computer Vision with Python Cookbook

OpenCV 3 is a native cross-platform library for computer vision, machine learning, and image processing. OpenCV's convenient high-level APIs hide very powerful internals designed for computational efficiency that can take advantage of multicore and GPU processing. This book will help you tackle increasingly challenging computer vision problems by providing a number of recipes that you can use to improve your applications.

BookMar 2018306 pages

Learn OpenCV 4 By Building Projects

OpenCV is mainly used in Computer Vision and image processing and is considered to be one of the best open source libraries that helps developers focus on constructing complete projects on image processing, motion detection, and image segmentation. This book will be your guide to understanding the basic OpenCV concepts and algorithms.

BookNov 2018310 pages

OpenCV 3.x with Python By Example

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.

BookJan 2018268 pages

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

BookAug 2017206 pages

Mastering OpenCV 3

As we become more capable of handling data in every kind, we are becoming more reliant on visual input and what we can do with those self-driving cars, face recognition, and even augmented reality applications and games. This is all powered by Computer Vision. This book will put you straight to work in creating powerful and unique computer vision applications.

BookApr 2017250 pages

Hands-On Image Processing with Python

This book covers how to use the image processing libraries in Python. It will enable you to write code snippets to implement complex image processing algorithms such as image enhancement, filtering, segmentation, object detection, and more. You will also be able to use machine learning and deep learning models and learn to implement them with ease.

BookNov 2018492 pages

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

BookSep 2018380 pages

Mastering OpenCV 4

Mastering OpenCV, now in its third edition, targets computer vision engineers taking their first steps toward mastering OpenCV. Keeping the mathematical formulations to a solid but bare minimum, the book delivers complete projects from ideation to running code, targeting current hot topics in computer vision such as face recognition, landmark detection and pose estimation, and number recognition with deep convolutional networks.

BookDec 2018280 pages

Hands-On Computer Vision with Julia

This book is a thorough guide for developers who want to get started with building computer vision applications using Julia. Julia is well suited to image processing because of its ease of use and the fact that it lets you write easy-to-compile and efficient machine code.

BookJun 2018202 pages

Raspberry Pi Computer Vision Programming

You will learn the basics of hardware and software required for image processing and computer vision with Raspberry Pi and Python 3. You will have a look at all the major image processing, manipulation, and computer vision techniques and algorithms in detail using engaging examples. You will build a lot of real-life computer vision applications.

BookJun 2020306 pages5

Learning OpenCV 4 Computer Vision with Python 3

Now in its third edition, this is the original book on OpenCV’s Python bindings. Readers will learn a great range of techniques and algorithms, from the classics to the state-of-the-art, and from geometry to machine learning. All of this is in aid of solving practical computer vision problems in well-built applications.

BookFeb 2020372 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages