Packt+ | Advance your knowledge in tech

You're reading from Generative Adversarial Networks Cookbook

Product typeBook

Published inDec 2018

PublisherPackt

ISBN-139781789139907

Edition1st Edition

Concepts

Artificial Intelligence

Author (1)

Josh Kalin

Chapter 7. Using Simulated Images To Create Photo-Realistic Eyeballs with SimGAN

In this chapter, we'll cover the following recipes:

How the SimGAN architecture works
Pseudocode – how does it work?
How to work with training data
Code implementation – loss functions
Code implementation – generator
Code implementation – discriminator
Code implementation – GAN
Training the SimGAN network

Introduction

This chapter will focus on the SimGAN paper and how to take simulated data and make it look more realistic. The generator network used in the SimGAN architecture is able to improve the fidelity of simulated data.

How SimGAN architecture works

Apple previously released a paper titled Learning from Simulated and Unsupervised Images through Adversarial Training (https://arxiv.org/pdf/1612.07828.pdf), in which authors coined the architecture type SimGAN. As set out in the paper, SimGAN allows users to refine simulated data to make it look more realistic. In this section, we'll discuss how SimGAN architecture works.

Getting ready

The only thing you'll need in this section is the paper previously mentioned, which can be downloaded and read at: https://arxiv.org/pdf/1612.07828.pdf titled Learning from Simulated and Unsupervised Images through Adversarial Training.

How to do it...

In the SimGAN paper, authors set out to create a refiner network that can accurately improve the realism of synthetic images in an unsupervised manner. In the past, it has been quite hard to find matched simulation and real data for training such networks, but SimGAN has changed the existing landscape thanks to its focus on a simulated...

Pseudocode – how does it work?

With every technique, we need to understand the baseline algorithm before we can lay down any code. So, in this section, we'll discuss how the training algorithm works.

Getting ready

In this section, we'll be referring to the SimGAN paper once again.

How to do it...

In the SimGAN paper, the authors provided a convenient graphic for users to base their development on. We already know that we need to develop models for each of the networks, but how do we train a network in the first place? The following diagram offers an explanation:

Algorithm

Let's convert the preceding diagram into the following, tangible steps:

Read both synthetic images and real images into variables.
Then, for every epoch, do the following:
- Train the refiner networks on a random mini batch for K_Gtimes
- Train the discriminator network on a random mini batch for K_D times
Stop when the number of epochs reached, or lost, has not changed significantly for nepochs.

How to work with training data

As with every architecture we train throughout this book, understanding the structure of the data and the development environment is important to overall success. So, in this section, we'll set up the development environment and download the data inside the Docker container.

Getting ready

You'll need to create a folder at the $HOME directory level of your Linux machine with the following directory structure (which can be checked using the tree function):

├── docker
│   ├── build.sh
│   ├── clean.sh
│   ├── Dockerfile
│   └── kaggle.json
├── out
├── README.md
├── run.sh
└── src

How to do it...

In this chapter, we're going to introduce the Kaggle API so we can grab the necessary data for the SimGAN training architecture. Using the Kaggle API will require you to set up a Kaggle account and get API token access.

Kaggle and its API

Kaggle.com is a popular online site that holds machine learning (ML) competitions and discussions. Kaggle also supplies an API for accessing...

Code implementation – loss functions

In this section, we're going to develop custom loss functions that will be used for the discriminator, generator, and adversarial models. We'll cover two loss functions in this section, which we'll go over in detail.

Getting ready

It's time for a directory check! Make sure you've created and placed the relevant data in each of the following folders and files. In this step, we're adding the loss.py file:

├── data
├── docker
│   ├── build.sh
│   ├── clean.sh
│   ├── Dockerfile
│   └── kaggle.json
├── out
├── README.md
├── run.sh
└── src
    ├── loss.py

How to do it...

This is a fairly simple section made up of three primary steps—creating the loss.py file and placing two loss functions in it for us to inherit later on in the development.

Perform the following steps to create the loss.py file:

Add the python3 interpreter to the top of the file and import tensorflow, as follows:

#!/usr/bin/env python3
import tensorflow as tf

Implement the self-regularization loss...

Code implementation – generator

In this case, the generator in also known as the refiner network. This generator, therefore, is the networkthat will take and refine the simulated data.

Getting ready

Check that you have the following files in the correct place:

├── data
├── docker
│   ├── build.sh
│   ├── clean.sh
│   ├── Dockerfile
│   └── kaggle.json
├── out
├── README.md
├── run.sh
└── src
    ├── generator.py
    loss.py

How to do it...

In this section, we'll look at build boilerplate items, model development, and helper functions in order to help us to build the full generator.

Boilerplate items

There are two key steps in the boilerplate, and they are as follows:

Add all of the following import statements needed to create the generator (refiner) network:

#!/usr/bin/env python
import sys
import numpy as np
from keras.layers import Dense, Reshape, Input, BatchNormalization, Concatenate, Activation
from keras.layers.core import Activation
from keras.layers.convolutional import UpSampling2D, Convolution2D...

Code implementation – discriminator

The discriminator in SimGAN is a fairly simple Convolutional Neural Network (CNN) with a small twist at the end—it outputs the likelihood of simulated and real. In this section, we'll also make use of a function from the loss class we built earlier.

Getting ready

We've built a set of loss functions and the generator class, so now it's time to build the discriminator class. You should see the following structure in your directory:

├── data
├── docker
│   ├── build.sh
│   ├── clean.sh
│   ├── Dockerfile
│   └── kaggle.json
├── imgs
│   ├── create_token.png
│   ├── kaggle_signup.png
│   ├── MyAccount.png
│   ├── refiner_network_training.png
│   └── simGAN_network.png
├── out
│   └── Generator_Model.png
├── README.md
├── run.sh
└── src
 ├── discriminator.py
 ├── generator.py
 ├── loss.py

How to do it...

The discriminator is very similar to other discriminators we've built in previous chapters. In this case, we're essentially building a CNN with a slightly different...

Code implementation – GAN

The Generative Adversarial Model, or GAN, is at the heart of adversarial training architecture. In fact, this model is different only in the fact that we use custom loss functions in our compile step. Let's take a look at how it's implemented.

Getting ready

This section will fill out the core of the base classes and functionality we need to have for training the simGAN architecture. The following files, and structure, should be included in your current directory:

├── data
├── docker
│   ├── build.sh
│   ├── clean.sh
│   ├── Dockerfile
│   └── kaggle.json
├── out
├── README.md
├── run.sh
└── src
    ├── discriminator.py
    ├── gan.py
    ├── generator.py
    ├── loss.py

How to do it...

The GAN model is vastly simplified in comparison to the building of the generator and discriminator. Essentially, this class will put the generator and discriminator into adversarial training along with the custom loss functions.

Take the following steps:

Use the python3 interpreter and...

Training the simGAN network

Now that we've built the infrastructure, we can develop the training methodology in the train script. In this section, we'll also create the run python and shell scripts that will be used for running everything in the Docker environment.

Getting ready

We're almost at the end! So, make sure you have every one of the following directories and files in your $HOME directory:

├── data
├── docker
│   ├── build.sh
│   ├── clean.sh
│   ├── Dockerfile
│   └── kaggle.json
├── out
│   ├── GAN_Model.png
│   └── Generator_Model.png
├── README.md
├── run.sh
└── src
    ├── discriminator.py
    ├── gan.py
    ├── generator.py
    ├── loss.py
    ├── run.py
    └── train.py

How to do it...

The training script will read in data, process the data for input into the networks, and then train the simGAN model.

Initialization

Take the following steps to initialize the training class and the basic functionality needed to train the models:

Create a train.py file and place the following imports...

Exercise

Create a way to pre-train the refiner and discriminator networks.

The rest of the chapter is locked

You have been reading a chapter from

Generative Adversarial Networks Cookbook

Published in: Dec 2018Publisher: PacktISBN-13: 9781789139907

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Josh Kalin

Josh Kalin is a Physicist and Technologist focused on the intersection of robotics and machine learning. Josh works on advanced sensors, industrial robotics, machine learning, and automated vehicle research projects. Josh holds degrees in Physics, Mechanical Engineering, and Computer Science. In his free time, he enjoys working on cars (has owned 36 vehicles and counting), building computers, and learning new techniques in robotics and machine learning (like writing this book).
Read more about Josh Kalin

Other recommended products

Related to this chapter

Generative Adversarial Networks Projects

In this book, we will use different complexities of datasets in order to build end-to-end projects. With every chapter, the level of complexity and operations will become advanced. It consists of 8 full-fledged projects covering approaches such as 3D-GAN, Age-cGAN, DCGAN, SRGAN, StackGAN, and CycleGAN with real-world use cases.

BookJan 2019316 pages

Keras Deep Learning Cookbook

This book gives you a practical, hands-on understanding of how you can leverage the power of Python and Keras to perform effective deep learning. It presents a unique problem-solution approach to tackle various problems in training different types of neural networks while taking care of the speed and accuracy of these models

BookOct 2018252 pages

Hands-On Generative Adversarial Networks with Keras

This book will explore deep learning and generative models, and their applications in artificial intelligence. You will learn to evaluate and improve your GAN models by eliminating challenges that are encountered in real-world applications. You will implement GAN architectures in various domains such as computer vision, NLP, and audio processing

BookMay 2019272 pages

Caffe2 Quick Start Guide

Caffe2 by Facebook is a popular and relatively lightweight deep learning framework. Caffe2 is known for speed, accuracy and high efficiency in training neural networks. Caffe2 is widely used in mobile apps. This book is a fast paced guide that will teach you how to train and deploy deep learning models with Caffe2 on resource constrained platforms.

BookMay 2019136 pages

Hands-On Generative Adversarial Networks with PyTorch 1.x

This book will help you understand how GANs architecture works using PyTorch. You will get familiar with the most flexible deep learning toolkit and use it to transform ideas into actual working codes. You will apply GAN models to areas like computer vision, multimedia and natural language processing using a sample-generation perspective.

BookDec 2019312 pages

Advanced Deep Learning with Keras

This book covers advanced deep learning techniques to create successful AI. Using MLPs, CNNs, and RNNs as building blocks to more advanced techniques, you’ll study deep neural network architectures, Autoencoders, Generative Adversarial Networks (GANs), Variational AutoEncoders (VAEs), and Deep Reinforcement Learning (DRL) critical to many cutting-edge AI results.

BookOct 2018368 pages

Hands-On Image Generation with TensorFlow

This book is a step-by-step guide to show you how to implement generative models in TensorFlow 2.x from scratch. You’ll get to grips with the image generative technology by covering autoencoders, style transfer, and GANs as well as fundamental and state-of-the-art models.

BookDec 2020306 pages

Modern Computer Vision with PyTorch

Starting from the basics of neural networks, this book covers over 50 applications of computer vision and helps you to gain a solid understanding of the theory of various architectures before implementing them. Each use case is accompanied by a notebook in GitHub with ready-to-execute code and self-assessment questions.

BookNov 2020824 pages5

Modern Computer Vision with PyTorch

Starting from the basics of neural networks, this book covers over 50 applications of computer vision and helps you to gain a solid understanding of the theory of various architectures before implementing them. Each use case is accompanied by a notebook in GitHub with ready-to-execute code and self-assessment questions.

BookNov 2020824 pages5

Python Deep Learning Cookbook

Deep Learning is a rapidly evolving field of Machine Learning science which gives machines the ability to learn from information. This book contains detailed recipes to tackle with the common and not so common problems while dealing with deep learning algorithms and models in Python. You will benefit from this book by finding technical solutions to the issues presented, along with a detailed explanation of the solutions, and a discussion on corresponding pros and cons of implementing the proposed solution using Theano, Tensorflow, MXNet, and Keras. You'll come across recipes on data pre-processing, network models and topologies, supervised and unsupervised learning presented in a “solution to problem” fashion.

BookOct 2017330 pages

Python Deep Learning Projects

Python Deep Learning Projects book will simplify and ease how deep learning works, and demonstrate how neural networks play a vital role in exploring predictive analytics across different domains. You will explore projects in the field of computational linguistics, computer vision, machine translation, pattern recognition and many more

BookOct 2018472 pages

Deep Learning with Keras

Keras is a high-level neural network library written in Python that runs on top of either Theano or TensorFlow. With this book, you’ll learn the basics of Keras in a highly practical way and understand how this minimal, highly modular framework runs on both CPU and GPU, allowing you to put your ideas into action in the shortest possible time.

BookApr 2017318 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages