Reader small image

You're reading from  Hands-On Neural Networks with Keras

Product typeBook
Published inMar 2019
Reading LevelIntermediate
PublisherPackt
ISBN-139781789536089
Edition1st Edition
Languages
Tools
Right arrow
Author (1)
Niloy Purkait
Niloy Purkait
author image
Niloy Purkait

Niloy Purkait is a technology and strategy consultant by profession. He currently resides in the Netherlands, where he offers his consulting services to local and international companies alike. He specializes in integrated solutions involving artificial intelligence, and takes pride in navigating his clients through dynamic and disruptive business environments. He has a masters in Strategic Management from Tilburg University, and a full specialization in data science from Michigan University. He has advanced industry grade certifications from IBM, in subjects like signal processing, cloud computing, machine and deep learning. He is also perusing advanced academic degrees in several related fields, and is a self-proclaimed lifelong learner.
Read more about Niloy Purkait

Right arrow

Using the Bellman equation

The Bellman equation, which was proposed by American mathematician Richard Bellman, is one of the main workhorse equations powering the chariot of deep Q-learning. It essentially allows us to solve the Markov decision process we formalized earlier. Intuitively, the Bellman equation makes one simple assumption. It states that the maximum future reward for a given action, performed at a state, is the immediate reward plus the maximum future reward for the next state. To draw a parallel to the marshmallow experiments, the maximum possible reward of two marshmallows is attained by the agents through the act of abstaining at the first time step (with a reward of 0 marshmallows) and then collecting (with a reward of two marshmallows) at the second time step.

In other words, given any state-action pair, the quality (Q) of performing an action (a) at the given...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Hands-On Neural Networks with Keras
Published in: Mar 2019Publisher: PacktISBN-13: 9781789536089

Author (1)

author image
Niloy Purkait

Niloy Purkait is a technology and strategy consultant by profession. He currently resides in the Netherlands, where he offers his consulting services to local and international companies alike. He specializes in integrated solutions involving artificial intelligence, and takes pride in navigating his clients through dynamic and disruptive business environments. He has a masters in Strategic Management from Tilburg University, and a full specialization in data science from Michigan University. He has advanced industry grade certifications from IBM, in subjects like signal processing, cloud computing, machine and deep learning. He is also perusing advanced academic degrees in several related fields, and is a self-proclaimed lifelong learner.
Read more about Niloy Purkait