Reader small image

You're reading from  Applied Deep Learning with Keras

Product typeBook
Published inApr 2019
Reading LevelIntermediate
Publisher
ISBN-139781838555078
Edition1st Edition
Languages
Tools
Right arrow
Authors (3):
Ritesh Bhagwat
Ritesh Bhagwat
author image
Ritesh Bhagwat

Ritesh Bhagwat has a master's degree in applied mathematics with a specialization in computer science. He has over 14 years of experience in data-driven technologies and has led and been a part of complex projects ranging from data warehousing and business intelligence to machine learning and artificial intelligence. He has worked with top-tier global consulting firms as well as large multinational financial institutions. Currently, he works as a data scientist. Besides work, he enjoys playing and watching cricket and loves to travel. He is also deeply interested in Bayesian statistics.
Read more about Ritesh Bhagwat

Mahla Abdolahnejad
Mahla Abdolahnejad
author image
Mahla Abdolahnejad

Mahla Abdolahnejad is a Ph.D. candidate in systems and computer engineering with Carleton University, Canada. She also holds a bachelor's degree and a master's degree in biomedical engineering, which first exposed her to the field of artificial intelligence and artificial neural networks, in particular. Her Ph.D. research is focused on deep unsupervised learning for computer vision applications. She is particularly interested in exploring the differences between a human's way of learning from the visual world and a machine's way of learning from the visual world, and how to push machine learning algorithms toward learning and thinking like humans.
Read more about Mahla Abdolahnejad

Matthew Moocarme
Matthew Moocarme
author image
Matthew Moocarme

Matthew Moocarme is an accomplished data scientist with more than eight years of experience in creating and utilizing machine learning models. He comes from a background in the physical sciences, in which he holds a Ph.D. in physics from the Graduate Center of CUNY. Currently, he leads a team of data scientists and engineers in the media and advertising space to build and integrate machine learning models for a variety of applications. In his spare time, Matthew enjoys sharing his knowledge with the data science community through published works, conference presentations, and workshops.
Read more about Matthew Moocarme

View More author details
Right arrow

Chapter 3. Deep Learning with Keras

Note

Learning Objectives

By the end of this chapter, you will be able to:

  • Define Keras as a sequential model

  • Develop single-layer and multi-layer Keras models

  • Evaluate a trained model

  • Explain overfitting and underfitting

  • Perform early stopping as a technique to reduce overfitting

Note

In this chapter, we will learn how to develop single-layer and multi-layer models. We will learn how to evaluate trained models and determine whether they are overfitting or not.

Introduction


In this chapter, you will learn how to implement your first neural network using Keras. This chapter covers the basics of deep learning and will provide you with the foundation necessary to build highly complex neural network architectures. We start by extending the logistic regression model to a simple single-layer neural network and then proceed to more complicated neural networks with multiple hidden layers. In this process, you will learn about the underlying basic concepts of neural networks, including forward propagation for making predictions, computing loss, backpropagation for computing derivative of loss with respect to model parameters, and finally gradient descent for learning optimal parameters for the model. You will also learn about the various choices available to build and train a neural network in terms of activation functions, loss functions, and optimizers.

Furthermore, you will learn how to evaluate your model while understanding issues such as overfitting...

Building Your First Neural Network


In this section, you will first learn about the representations and concepts of deep learning such as forward propagation, backpropagation, and gradient descent. We will not delve deeply into these concepts, as it isn't required for this book. However, the coverage will essentially help anyone who wants to apply deep learning to a problem.

We then will move on to implementing neural networks using Keras. Also, we will stick to the simplest case, which is a neural network with a single hidden layer. You will learn how to define a model in Keras, choose the hyperparameters, and then train your model. At the end of this section, you will have the opportunity to practice what you have learned by implementing a neural network in Keras to perform classification on a dataset and observe how neural networks outperform simpler models such as logistic regression.

Logistic Regression to a Deep Neural Network

You learned in the previous chapter about the logistic regression...

Model Evaluation


In this section, we will move on to multi-layer or deep neural networks while learning about techniques for assessing the performance of a model. As you may have already realized, there are many hyperparameter choices to be made when building a deep neural network. Some very important challenges of applied deep learning are how to find the right values for the number of hidden layers, the number of units in each hidden layer, the type of activation function to use for each layer, the type of optimizer and loss function for training the network, among others. Model evaluation is required for making these decisions. By performing model evaluation, you can say whether a specific deep architecture or a specific set of hyperparameters is working poorly or well on a particular dataset, and therefore decide whether to change them or not.

Furthermore, you will learn about overfitting and underfitting in this section. These are two very important issues that can arise when building...

Summary


In this chapter, you learned about the basics of deep learning, including the common representations and terminology and essential underlying concepts. You learned how forward propagation in neural networks works and how it is used for predicting outputs. You learned about the loss function as a measure of model performance and learned how backpropagation is used to compute the derivatives of loss function with respect to model parameters. Finally, you learned about gradient descent, which uses the gradients computed by backpropagation to gradually update the model parameters. In addition to basic theory and concepts, you also learned how to implement and train shallow and deep neural networks with Keras and how to use a trained network to make predictions about the output of a given input. You also learned how to evaluate the overall performance of the network over all data examples, and the reasons why evaluating a model on training examples can be misleading. You learned about...

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Applied Deep Learning with Keras
Published in: Apr 2019Publisher: ISBN-13: 9781838555078
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Authors (3)

author image
Ritesh Bhagwat

Ritesh Bhagwat has a master's degree in applied mathematics with a specialization in computer science. He has over 14 years of experience in data-driven technologies and has led and been a part of complex projects ranging from data warehousing and business intelligence to machine learning and artificial intelligence. He has worked with top-tier global consulting firms as well as large multinational financial institutions. Currently, he works as a data scientist. Besides work, he enjoys playing and watching cricket and loves to travel. He is also deeply interested in Bayesian statistics.
Read more about Ritesh Bhagwat

author image
Mahla Abdolahnejad

Mahla Abdolahnejad is a Ph.D. candidate in systems and computer engineering with Carleton University, Canada. She also holds a bachelor's degree and a master's degree in biomedical engineering, which first exposed her to the field of artificial intelligence and artificial neural networks, in particular. Her Ph.D. research is focused on deep unsupervised learning for computer vision applications. She is particularly interested in exploring the differences between a human's way of learning from the visual world and a machine's way of learning from the visual world, and how to push machine learning algorithms toward learning and thinking like humans.
Read more about Mahla Abdolahnejad

author image
Matthew Moocarme

Matthew Moocarme is an accomplished data scientist with more than eight years of experience in creating and utilizing machine learning models. He comes from a background in the physical sciences, in which he holds a Ph.D. in physics from the Graduate Center of CUNY. Currently, he leads a team of data scientists and engineers in the media and advertising space to build and integrate machine learning models for a variety of applications. In his spare time, Matthew enjoys sharing his knowledge with the data science community through published works, conference presentations, and workshops.
Read more about Matthew Moocarme