Hands-On Generative Adversarial Networks with Keras

More Information
  • Learn how GANs work and the advantages and challenges of working with them
  • Control the output of GANs with the help of conditional GANs, using embedding and space manipulation
  • Apply GANs to computer vision, NLP, and audio processing
  • Understand how to implement progressive growing of GANs
  • Use GANs for image synthesis and speech enhancement
  • Explore the future of GANs in visual and sonic arts
  • Implement pix2pixHD to turn semantic label maps into photorealistic images

Generative Adversarial Networks (GANs) have revolutionized the fields of machine learning and deep learning. This book will be your first step towards understanding GAN architectures and tackling the challenges involved in training them.

This book opens with an introduction to deep learning and generative models, and their applications in artificial intelligence (AI). You will then learn how to build, evaluate, and improve your first GAN with the help of easy-to-follow examples. The next few chapters will guide you through training a GAN model to produce and improve high-resolution images. You will also learn how to implement conditional GANs that give you the ability to control characteristics of GAN outputs. You will build on your knowledge further by exploring a new training methodology for progressive growing of GANs. Moving on, you'll gain insights into state-of-the-art models in image synthesis, speech enhancement, and natural language generation using GANs. In addition to this, you'll be able to identify GAN samples with TequilaGAN.

By the end of this book, you will be well-versed with the latest advancements in the GAN framework using various examples and datasets, and you will have the skills you need to implement GAN architectures for several tasks and domains, including computer vision, natural language processing (NLP), and audio processing.

Foreword by Ting-Chun Wang, Senior Research Scientist, NVIDIA

  • Discover various GAN architectures using Python and Keras library
  • Understand how GAN models function with the help of theoretical and practical examples
  • Apply your learnings to become an active contributor to open source GAN applications
Page Count 272
Course Length 8 hours 9 minutes
ISBN 9781789538205
Date Of Publication 2 May 2019


Rafael Valle

Rafael Valle is a research scientist at NVIDIA focusing on audio applications. He has years of experience developing high performance machine learning models for data/audio analysis, synthesis and machine improvisation with formal specifications.

Dr. Valle was the first to generate speech samples from scratch with GANs and to show that simple yet efficient techniques can be used to identify GAN samples. He holds an Interdisciplinary PhD in Machine Listening and Improvisation from UC Berkeley, a Master’s degree in Computer Music from the MH-Stuttgart in Germany and a Bachelor’s degree in Orchestral Conducting from UFRJ in Brazil.