You're reading from Hands-On Computer Vision with Detectron2

Product type Book

Published in Apr 2023

Publisher Packt

ISBN-13 9781800561625

Pages 318 pages

Edition 1st Edition

Languages

Python

Concepts

Computer Vision

Author (1):

Van Vung Pham

Table of Contents (20) Chapters

Preface

1. Part 1: Introduction to Detectron2

2. Chapter 1: An Introduction to Detectron2 and Computer Vision Tasks

3. Chapter 2: Developing Computer Vision Applications Using Existing Detectron2 Models

4. Part 2: Developing Custom Object Detection Models

5. Chapter 3: Data Preparation for Object Detection Applications

6. Chapter 4: The Architecture of the Object Detection Model in Detectron2

7. Chapter 5: Training Custom Object Detection Models

8. Chapter 6: Inspecting Training Results and Fine-Tuning Detectron2’s Solvers

9. Chapter 7: Fine-Tuning Object Detection Models

10. Chapter 8: Image Data Augmentation Techniques

11. Chapter 9: Applying Train-Time and Test-Time Image Augmentations

12. Part 3: Developing a Custom Detectron2 Model for Instance Segmentation Tasks

13. Chapter 10: Training Instance Segmentation Models

14. Chapter 11: Fine-Tuning Instance Segmentation Models

15. Part 4: Deploying Detectron2 Models into Production

16. Chapter 12: Deploying Detectron2 Models into Server Environments

17. Chapter 13: Deploying Detectron2 Models into Browsers and Mobile Environments

18. Index

Why subscribe?

19. Other Books You May Enjoy

Preface

Computer vision takes part and has become a critical success factor in many modern businesses such as automobile, robotics, manufacturing, and biomedical image processing – and its market is growing rapidly. This book will help you explore Detectron2. It is the next-generation library that provides cutting-edge computer vision algorithms. Many research and practical projects at Facebook (now Meta) use it as a library to support computer vision tasks. Its models can be exported to TorchScript and Open Neural Network Exchange (ONNX) format for deployments into server production environments (such as C++ runtime), browsers, and mobile devices.

By utilizing code and visualizations, this book will guide you on using existing models in Detectron2 for computer vision tasks (object detection, instance segmentation, key-point detection, semantic detection, and panoptic segmentation). It also covers theories and visualizations of Detectron2’s architectures and how each module in Detectron2 works. This book walks you through two complete hands-on, real-life projects (preparing data, training models, fine-tuning models, and deployments) for object detection and instance segmentation of brain tumors using Detectron2.

The data preparation section discusses common sources of datasets for computer vision applications and tools to collect and label data. It also describes common image data annotation formats and codes to convert from different formats to the one Detectron2 supports. The training model section guides the steps to prepare the configuration file, load pre-trained weights for transfer learning (if necessary), and modify the default trainer to meet custom business requirements.

The fine-tuning model section includes inspecting training results using TensorBoard and optimizing Detectron2 solvers. It also provides a primer to common and cutting-edge image augmentation techniques and how to use existing Detectron2 image augmentation techniques or to build and apply custom image augmentation techniques at training and testing time. There are also techniques to fine-tune object detection models, such as computing appropriate configurations for generating anchors (sizes and ratios of the anchors) or means or standard deviations of the pixel values from custom datasets. For instance segmentation task, this book also discusses the use of PointRend to improve the quality of the boundaries of the detected instances.

This book also covers steps for deploying Detectron2 models into production and developing Detectron2 applications for mobile devices. Specifically, it provides the model formats and platforms that Detectron2 supports, such as TorchScript and ONNX formats. It provides the code to convert Detectron2 into these formats models using tracing and scripting approaches. Additionally, code snippets illustrate how to deploy Detectron2 models into C++ and browser environments. Finally, this book also discusses D2Go, a platform to train, fine-tune, and quantize computer visions so they can be deployable to mobile and edge devices with low-computation resource awareness.

Through this book, you will find that Detectron2 is a valuable framework for anyone looking to build robust computer vision applications.