You're reading from Hands-On Vision and Behavior for Self-Driving Cars

Product typeBook

Published inOct 2020

PublisherPackt

ISBN-139781800203587

Edition1st Edition

Tools

OpenCV

Concepts

Computer Vision

Authors (2):

Luca Venturi

Krishtof Korda

View More author details

Chapter 9: Semantic Segmentation

This is probably the most advanced chapter concerning deep learning, as we will go as far as classifying an image at a pixel level with a technique called semantic segmentation. We will use plenty of what we have learned so far, including data augmentation with generators.

We will study a very flexible and efficient neural network architecture called DenseNet in great detail, as well as its extension for semantic segmentation, FC-DenseNet, and then we will write it from scratch and train it with a dataset built with Carla.

I hope you will find this chapter inspiring and challenging. And be prepared for a long training session because our task can be quite demanding!

In this chapter, we will cover the following topics:

Introducing semantic segmentation
Understanding DenseNet for classification
Semantic segmentation with CNN
Adapting DenseNet for semantic segmentation
Coding the blocks of FC-DenseNet
Improving bad...

Technical requirements

To be able to use the code explained in this chapter, you will need to have the following tools and modules installed:

The Carla simulator
Python 3.7
The NumPy module
The TensorFlow module
The Keras module
The OpenCV-Python module
A GPU (recommended)

The code for this chapter can be found at https://github.com/PacktPublishing/Hands-On-Computer-Vision-for-Self-Driving-Cars.

The Code in Action videos for this chapter can be found here:

https://bit.ly/3jquo3v

Introducing semantic segmentation

In the previous chapters, we implemented several classifiers, where we provided an image as input and the network said what it was. This can be excellent in many situations, but to be very useful, it usually needs to be combined with a method that can identify the region of interest. We did this in Chapter 7, Detecting Pedestrians and Traffic Lights, where we used SSD to identify a region of interest with a traffic light and then our neural network was able to tell the color. But even this would not be very useful to us, because the regions of interest produced by SSD are rectangles, and therefore a network telling us that there is a road basically as big as the image would not provide much information: is the road straight? Is there a turn? We cannot know. We need more precision.

If object detectors such as SSD brought classification to the next level, now we need to reach the level after that, and maybe more. In fact, we want to classify every...

Understanding DenseNet for classification

DenseNet is a fascinating architecture of neural networks that is designed to be flexible, memory efficient, effective, and also relatively simple. There are really a lot of things to like about DenseNet.

The DenseNet architecture is designed to build very deep networks, solving the problem of the vanishing gradient with techniques derived from ResNet. Our implementation will reach 50 layers, but you can easily build a deeper network. In fact, Keras has three types of DenseNet trained on ImageNet, with 121, 169, and 201 layers, respectively. DenseNet also solves the problem of dead neurons, when you have neurons that are basically not active.The next section will show a high-level overview of DenseNet.

DenseNet from a bird's-eye view

For the moment, we will focus on DenseNet as a classifier, which is not what we are going to implement, but it is useful as a concept to start to understand it. The high-level architecture of DenseNet...

Segmenting images with CNN

A typical semantic segmentation task receives as input an RGB image and needs to output an image with the raw segmentation, but this solution could be problematic. We already know that classifiers generate their results using one-hot encoded labels, and we can do the same for semantic segmentation: instead of generating a single image with the raw segmentation, the network can create a series of one-hot encoded images. In our case, as we need 13 classes, the network will output 13 RGB images, one per label, with the following features:

One image describes only one label.
The pixels belonging to the label have a value of 1 in the red channel, while all the other pixels are marked as 0.

Each given pixel can be 1 only in one image; it will be 0 in all the remaining images. This is a difficult task, but it does not necessarily require particular architectures: a series of convolutional layers with same padding can do it; however, their cost...

Adapting DenseNet for semantic segmentation

DenseNet is very suitable for semantic segmentation because of its efficiency, accuracy, and abundance of skip layers. In fact, using DenseNet for semantic segmentation proves to be effective even when the dataset is limited and when a label is underrepresented.

To use DenseNet for semantic segmentation, we need to be able to build the right side of the U network, which means that we need the following:

A way to increase the resolution; if we call the transition layers of DenseNet transition down, then we need transition-up layers.
We need to build the skip layers to join the left and right side of the U network.

Our reference network is FC-DenseNet, also known as one hundred layers tiramisu, but we are not trying to reach 100 layers.

In practice, we want to achieve an architecture similar to the following:

Figure 9.8 – Example of FC-DenseNet architecture

The horizontal red arrows...

Coding the blocks of FC-DenseNet

DenseNet is very flexible, so you can easily configure it in many ways. However, depending on the hardware of your computer, you might hit the limits of your GPU. The following are the values that I used on my computer, but feel free to change them to achieve better accuracy or to reduce the memory consumption or the time required to train the network:

Input and output resolution: 160 X 160
Growth rate (number of channels added by each convolutional layer in a dense block): 12
Number of dense blocks: 11: 5 down, 1 to transition between down and up, and 5 up
Number of convolutional blocks in each dense block: 4
Batch size: 4
Bottleneck layer in the dense blocks: No
Compression factor: 0.6
Dropout: Yes, 0.2
We will define some functions that you can use to build FC-DenseNet and, as usual, you are invited to check out the full code on GitHub.
The first function just defines a convolution with batch normalization:
```
def dn_conv...
```

Summary

Congratulations! You completed the final chapter on deep learning.

We started this chapter by discussing what semantic segmentation means, then we talked extensively about DenseNet and why it is such a great architecture. We quickly talked about using a stack of convolutional layers to implement semantic segmentation, but we focused on a more efficient way, which is using DenseNet after adapting it to this task. In particular, we developed an architecture similar to FC-DenseNet. We collected a dataset with the ground truth for semantic segmentation, using Carla, and then we trained our neural network on it and saw how it performed and when detecting roads and other objects, such as pedestrians and sidewalks. We even discussed a trick to improve the output of a bad semantic segmentation.

This chapter was quite advanced, and it required a good understanding of all the previous chapters about deep learning. It has been quite a ride, and I think it is fair to say that this...

Questions

After reading this chapter, you will be able to answer the following questions:

What is a distinguished characteristic of DenseNet?
What is the name of the family architecture such as inspired the authors of DenseNet?
What is FC-DenseNet?
Why do we say that FC-DenseNet is U-shaped?
Do you need a fancy architecture like DenseNet to perform semantic segmentation?
If you have a neural network that performs poorly at semantic segmentation, is there a quick fix that you can use sometimes, if you have no other options?
What are skip connections used for in FC-DenseNet and other U-shaped architectures?

Luca Venturi has extensive experience as a programmer with world-class companies, including Ferrari and Opera Software. He has also worked for some start-ups, including Activetainment (maker of the world's first smart bike), Futurehome (a provider of smart home solutions), and CompanyBook (whose offerings apply artificial intelligence to sales). He worked on the Data Platform team at Tapad (Telenor Group), making petabytes of data accessible to the rest of the company, and is now the lead engineer of Piano Software's analytical database.
Read more about Luca Venturi

Krishtof Korda

Krishtof Korda grew up in a mountainside home over which the US Navy's Blue Angels flew during the Reno Air Races each year. A graduate from the University of Southern California and the USMC Officer Candidate School, he set the Marine Corps obstacle course record of 51 seconds. He took his love of aviation to the USAF, flying aboard the C-5M Super Galaxy as a flight test engineer for 5 years, and engineered installations of airborne experiments for the USAF Test Pilot School for 4 years. Later, he transitioned to designing sensor integrations for autonomous cars at Lyft Level 5. Now he works as an applications engineer for Ouster, integrating LIDAR sensors in the fields of robotics, AVs, drones, and mining, and loves racing Enduro mountain bikes.
Read more about Krishtof Korda

Other recommended products

Related to this chapter

Applied Deep Learning and Computer Vision for Self-Driving Cars

This book teaches you the different techniques and methodologies associated while implementing deep learning solutions in self-driving cars. You will use real-world examples to implement various neural network architectures to develop your own autonomous and automated vehicle using the Python environment.

BookAug 2020332 pages

Hands-On Deep Learning with TensorFlow

With deep learning going mainstream, making sense of data and getting accurate results using deep networks is possible. Dan Van Boxel is your guide to exploring the possibilities with deep learning; he will enable you to understand data like never before. With the efficiency and simplicity of TensorFlow, you will be able to process your data and gain insights that will change how you look at data.

BookJul 2017174 pages

Hands-On Java Deep Learning for Computer Vision

This book will take you through the process of efficiently training deep neural networks in Java for Computer Vision-related tasks. You will build real-world applications ranging from simple Java handwritten digit recognition models to real-time autonomous car driving systems and face recognition models using the popular Java-based libraries.

BookFeb 2019260 pages

Python Artificial Intelligence Projects for Beginners

This book demonstrates AI projects in Python covering modern techniques that make up the world of Artificial Intelligence. You will come across a variety of real-world projects on classifying data, text processing techniques, deep learning and neural networks

BookJul 2018162 pages

Practical Convolutional Neural Networks

This book helps you master CNN, from the basics to the most advanced concepts in CNN such as GANs, instance classification and attention mechanism for vision models and more. You will implement advanced CNN models using complex image and video datasets. By the end of the book you will learn CNN’s best practices to implement smart ConvNet models and apply them to solve complex deep learning problems.

BookFeb 2018218 pages

Practical Computer Vision

Computer Vision is a broadly used term associated with acquiring, processing, and analyzing images. This book will show you how you can perform various Computer Vision techniques in the most practical way possible. Right from capturing images from various sources, you will learn how to perform image filtering/manipulation and detect features in your images. As you go through the chapters, you'll work with increasingly complex algorithms to develop complex Computer Vision applications

BookFeb 2018234 pages

Mastering Computer Vision with TensorFlow 2.x

You will learn the principles of computer vision and deep learning, and understand various models and architectures with their pros and cons. You will learn how to use TensorFlow 2.x to build your own neural network model and apply it to various computer vision tasks such as image acquiring, processing, and analyzing.

BookMay 2020430 pages

Neural Network Projects with Python

This book contains practical implementations of several deep learning projects in multiple domains, including in regression-based tasks such as taxi fare prediction in New York City, image classification of cats and dogs using a convolutional neural network, implementing a facial recognition security system using Siamese Neural Networks, and more.

BookFeb 2019308 pages

Hands-On Computer Vision with TensorFlow 2

Computer vision is achieving a new frontier of capabilities in fields like health, automobile or robotics. This book explores TensorFlow 2, Google's open-source AI framework, and teaches how to leverage deep neural networks for visual tasks. It will help you acquire the insight and skills to be a part of the exciting advances in computer vision.

BookMay 2019372 pages

Hands-On Deep Learning Architectures with Python

This book explains the essential learning algorithms used for deep and shallow architectures. Packed with practical implementations to help you understand the concepts and ideas required to build efficient artificial intelligence systems, this book will help you construct deep models using popular frameworks and datasets.

BookApr 2019316 pages

Advanced Deep Learning with Python

This book is an expert-level guide to master the neural network variants using the Python ecosystem. You will gain the skills to build smarter, faster, and efficient deep learning systems with practical examples. By the end of this book, you will be up to date with the latest advances and current researches in the deep learning domain.

BookDec 2019468 pages

Hands-On Neural Networks with TensorFlow 2.0

This book is a guide to the TensorFlow (TF) framework, from the static graph architecture of TF 1.x to the eager execution and all the new features introduced in TF 2.0. Neural Networks applications are developed throughout the book with the aim of making the reader capable of developing neural networks-based solutions to real problems using TF 2.0

BookSep 2019358 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Hands-On Vision and Behavior for Self-Driving Cars

Chapter 9: Semantic Segmentation

Technical requirements

Introducing semantic segmentation

Understanding DenseNet for classification

DenseNet from a bird's-eye view

Segmenting images with CNN

Adapting DenseNet for semantic segmentation

Coding the blocks of FC-DenseNet

Summary

Questions

Further reading

Unlock this book and the full library FREE for 7 days

Authors (2)

Applied Deep Learning and Computer Vision for Self-Driving Cars

Hands-On Deep Learning with TensorFlow

Hands-On Java Deep Learning for Computer Vision

Python Artificial Intelligence Projects for Beginners

This book demonstrates AI projects in Python covering modern techniques that make up the world of Artificial Intelligence. You will come across a variety of real-world projects on classifying data, text processing techniques, deep learning and neural networks

Practical Convolutional Neural Networks

Practical Computer Vision

Mastering Computer Vision with TensorFlow 2.x

Neural Network Projects with Python

Hands-On Computer Vision with TensorFlow 2

Hands-On Deep Learning Architectures with Python

Advanced Deep Learning with Python

Hands-On Neural Networks with TensorFlow 2.0

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook