Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Modern Computer Vision with PyTorch

You're reading from  Modern Computer Vision with PyTorch

Product type Book
Published in Nov 2020
Publisher Packt
ISBN-13 9781839213472
Pages 824 pages
Edition 1st Edition
Languages
Authors (2):
V Kishore Ayyadevara V Kishore Ayyadevara
Profile icon V Kishore Ayyadevara
Yeshwanth Reddy Yeshwanth Reddy
Profile icon Yeshwanth Reddy
View More author details

Table of Contents (25) Chapters

Preface Section 1 - Fundamentals of Deep Learning for Computer Vision
Artificial Neural Network Fundamentals PyTorch Fundamentals Building a Deep Neural Network with PyTorch Section 2 - Object Classification and Detection
Introducing Convolutional Neural Networks Transfer Learning for Image Classification Practical Aspects of Image Classification Basics of Object Detection Advanced Object Detection Image Segmentation Applications of Object Detection and Segmentation Section 3 - Image Manipulation
Autoencoders and Image Manipulation Image Generation Using GANs Advanced GANs to Manipulate Images Section 4 - Combining Computer Vision with Other Techniques
Training with Minimal Data Points Combining Computer Vision and NLP Techniques Combining Computer Vision and Reinforcement Learning Moving a Model to Production Using OpenCV Utilities for Image Analysis Other Books You May Enjoy Appendix

Exploring the U-Net architecture

Imagine a scenario where you've been given an image and been asked to predict which pixel corresponds to what object. So far, when we have been predicting the class of an object and the bounding box corresponding to the object, we passed the image through a network, which then passes the image through a backbone architecture (such as VGG or ResNet), flattens the output at a certain layer, and connects additional dense layers before making predictions for the class and bounding box offsets. However, in the case of image segmentation, where the output shape is the same as that of the input image's shape, flattening the convolutions' outputs and then reconstructing the image might result in a loss of information. Furthermore, the contours and shapes present in the original image will not vary in the output image in the case of image segmentation, so the networks we have dealt with so far (which flatten the last layer and connect additional...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}