Reader small image

You're reading from  Hands-On Generative Adversarial Networks with Keras

Product typeBook
Published inMay 2019
Reading LevelIntermediate
PublisherPackt
ISBN-139781789538205
Edition1st Edition
Languages
Tools
Right arrow
Author (1)
Rafael Valle
Rafael Valle
author image
Rafael Valle

Rafael Valle is a research scientist at NVIDIA focusing on audio applications. He has years of experience developing high performance machine learning models for data/audio analysis, synthesis and machine improvisation with formal specifications. Dr. Valle was the first to generate speech samples from scratch with GANs and to show that simple yet efficient techniques can be used to identify GAN samples. He holds an Interdisciplinary PhD in Machine Listening and Improvisation from UC Berkeley, a Masters degree in Computer Music from the MH-Stuttgart in Germany and a Bachelors degree in Orchestral Conducting from UFRJ in Brazil.
Read more about Rafael Valle

Right arrow

Progressive Growing of GANs

Progressive Growing of Generative Adversarial Networks (GANs) is a training methodology that is introduced in a context where high-resolution image synthesis was dominated by autoregressive models, such as PixelCNN and Variational Autoencoders (VAEs) – just like the models used in the paper Improved Variational Inference with Inverse Autoregressive Flow (https://arxiv.org/abs/1606.04934).

As we described in earlier chapters, although autoregressive models are able to produce high-quality images, when compared to their counterparts they lack an explicit latent representation that can be directly manipulated. Additionally, due to their autoregressive nature, at the time of inference autoregressive models tend to be slower than their counterparts. On the other hand, VAE-based models have quicker inference but are harder to train, and the VAE-based...

lock icon
The rest of the page is locked
Previous ChapterNext Page
You have been reading a chapter from
Hands-On Generative Adversarial Networks with Keras
Published in: May 2019Publisher: PacktISBN-13: 9781789538205

Author (1)

author image
Rafael Valle

Rafael Valle is a research scientist at NVIDIA focusing on audio applications. He has years of experience developing high performance machine learning models for data/audio analysis, synthesis and machine improvisation with formal specifications. Dr. Valle was the first to generate speech samples from scratch with GANs and to show that simple yet efficient techniques can be used to identify GAN samples. He holds an Interdisciplinary PhD in Machine Listening and Improvisation from UC Berkeley, a Masters degree in Computer Music from the MH-Stuttgart in Germany and a Bachelors degree in Orchestral Conducting from UFRJ in Brazil.
Read more about Rafael Valle