Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
TensorFlow Reinforcement Learning Quick Start Guide

You're reading from  TensorFlow Reinforcement Learning Quick Start Guide

Product type Book
Published in Mar 2019
Publisher Packt
ISBN-13 9781789533583
Pages 184 pages
Edition 1st Edition
Languages
Author (1):
Kaushik Balakrishnan Kaushik Balakrishnan
Profile icon Kaushik Balakrishnan

Table of Contents (11) Chapters

Preface 1. Up and Running with Reinforcement Learning 2. Temporal Difference, SARSA, and Q-Learning 3. Deep Q-Network 4. Double DQN, Dueling Architectures, and Rainbow 5. Deep Deterministic Policy Gradient 6. Asynchronous Methods - A3C and A2C 7. Trust Region Policy Optimization and Proximal Policy Optimization 8. Deep RL Applied to Autonomous Driving 9. Assessment 10. Other Books You May Enjoy

Summary

In this chapter, we were introduced to our first continuous actions RL algorithm, DDPG, which also happens to be the first Actor-Critic algorithm in this book. DDPG is an off-policy algorithm, as it uses a replay buffer. We also covered the use of policy gradients to update the actor, and the use of the L2 norm to update the critic. Thus, we have two different neural networks. The actor learns the policy and the critic learns to evaluate the actor's policy, thereby providing a learning signal to the actor. You saw how to compute the gradient of the state-action value, Q(s,a), with respect to the action, and also the gradient of the policy, both of which are combined to evaluate the policy gradient, which is then used to update the actor. We trained the DDPG on the inverted pendulum problem, and the agent learned it very well.

We have come a long way in this chapter...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}