All Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletters

Free Learning

You're reading from Modern Computer Vision with PyTorch

Product type Book

Published in Nov 2020

Publisher Packt

ISBN-13 9781839213472

Pages 824 pages

Edition 1st Edition

Languages

Python

Concepts

Computer Vision

Authors (2):

V Kishore Ayyadevara

Yeshwanth Reddy

View More author details

Table of Contents (25) Chapters

Preface

Section 1 - Fundamentals of Deep Learning for Computer Vision

Artificial Neural Network Fundamentals

PyTorch Fundamentals

Building a Deep Neural Network with PyTorch

Section 2 - Object Classification and Detection

Introducing Convolutional Neural Networks

Transfer Learning for Image Classification

Practical Aspects of Image Classification

Basics of Object Detection

Advanced Object Detection

Image Segmentation

Applications of Object Detection and Segmentation

Section 3 - Image Manipulation

Autoencoders and Image Manipulation

Image Generation Using GANs

Advanced GANs to Manipulate Images

Section 4 - Combining Computer Vision with Other Techniques

Training with Minimal Data Points

Combining Computer Vision and NLP Techniques

Combining Computer Vision and Reinforcement Learning

Moving a Model to Production

Using OpenCV Utilities for Image Analysis

Other Books You May Enjoy

Leave a review - let other readers know what you think

Appendix

Chapter 7 - Basics of Object Detection

How does the region proposal technique generate proposals?
It identifies regions that are similar in color, texture, size, and shape.
How is IoU calculated if there are multiple objects in an image?
IoU is calculated for each object with the ground truth, using Intersection Over Union metric
Why does R-CNN take a long time to generate predictions?
Because we create as many forward propagations as there are proposals
Why is Fast R-CNN faster when compared to R-CNN?
For all proposals, extracting the feature map from the VGG backbone is common. This reduces almost 90% of the computations as compared to Fast RCNN

How does RoI Pooling work?
All the selectivesearch crops are passed through adaptive pooling kernel so that the final output is of the same size
What is the impact of not having multiple layers, post obtaining feature map, when predicting the bounding box corrections?
You might not notice that the model did not learn to predict the bounding...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime}

Authors (2)

V Kishore Ayyadevara

V Kishore Ayyadevara leads a team focused on using AI to solve problems in the healthcare space. He has 10 years' experience in data science, solving problems to improve customer experience in leading technology companies. In his current role, he is responsible for developing a variety of cutting edge analytical solutions that have an impact at scale while building strong technical teams. Prior to this, Kishore authored three books — Pro Machine Learning Algorithms, Hands-on Machine Learning with Google Cloud Platform, and SciPy Recipes. Kishore is an active learner with keen interest in identifying problems that can be solved using data, simplifying the complexity and in transferring techniques across domains to achieve quantifiable results.

See other products by V Kishore Ayyadevara

Yeshwanth Reddy

Yeshwanth is a highly accomplished data scientist manager with 9+ years of experience in deep learning and document analysis. He has made significant contributions to the field, including building software for end-to-end document digitization, resulting in substantial cost savings. Yeshwanth's expertise extends to developing modules in OCR, word detection, and synthetic document generation. His groundbreaking work has been recognized through multiple patents. He also created a few Python libraries. With a passion for disrupting unsupervised and self-supervised learning, Yeshwanth is dedicated to reducing reliance on manual annotation and driving innovative solutions in the field of data science.

See other products by Yeshwanth Reddy