Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Reinforcement Learning with TensorFlow

You're reading from  Reinforcement Learning with TensorFlow

Product type Book
Published in Apr 2018
Publisher Packt
ISBN-13 9781788835725
Pages 334 pages
Edition 1st Edition
Languages
Author (1):
Sayon Dutta Sayon Dutta
Profile icon Sayon Dutta

Table of Contents (21) Chapters

Title Page
Packt Upsell
Contributors
Preface
1. Deep Learning – Architectures and Frameworks 2. Training Reinforcement Learning Agents Using OpenAI Gym 3. Markov Decision Process 4. Policy Gradients 5. Q-Learning and Deep Q-Networks 6. Asynchronous Methods 7. Robo Everything – Real Strategy Gaming 8. AlphaGo – Reinforcement Learning at Its Best 9. Reinforcement Learning in Autonomous Driving 10. Financial Portfolio Management 11. Reinforcement Learning in Robotics 12. Deep Reinforcement Learning in Ad Tech 13. Reinforcement Learning in Image Processing 14. Deep Reinforcement Learning in NLP 1. Further topics in Reinforcement Learning 2. Other Books You May Enjoy Index

Chapter 12. Deep Reinforcement Learning in Ad Tech

So far in this unit of discussing reinforcement learning application research domains, we saw how reinforcement learning is disrupting the field of robotics, autonomous driving, financial portfolio management, and solving games of extremely high complexity, such as Go. Another important domain which is likely to be disrupted by reinforcement learning is advertisement technology.

Before getting into the details of the problem statement and it's solution based on reinforcement learning, let's understand the challenges, business models, and bidding strategies involved, which will work as a basic prerequisite in understanding the problem that we will try to solve using a reinforcement learning framework. The topics that we will be covering in this chapter are as follows:

  • Computational advertising challenges and bidding strategies

  • Real-time bidding by reinforcement learning in display advertising

Computational advertising challenges and bidding strategies


Advertising is a mode of conveying information. The core task of computational advertising is to find the best match between a given user in a given context and an advertisement, where the following factors apply:

  • Context/Auctioneer: A platform visited by a user and that is deemed fit for advertisements, for example:
    • A user using a search engine. Therefore, sponsored advertisements in such a scenario form a good plan.
    • A user reading a web page. Therefore, display advertisements fit such cases.
    • A user watching any video (movie, clips, short videos, and so on). Therefore, short video advertisements are good.
  • Constraints: The biggest constraint of all for the advertiser is limited budget and limited time period.

The core challenges to meet regarding the preceding goals are as follows:

  • Designing markets and exchanges that can facilitate the task and maximize value for all the participating stake holders, which are users, advertisers, and...

Real-time bidding by reinforcement learning in display advertising


Online displays are majorly served through real-time bidding where each impression of the display advertisement is auctioned in real time simultaneously when generated from a user visit. Placing a bid automatically, and in real time, is highly critical for advertisers to maximize their profits. Thus, a learning algorithm needs to be devised that can devise an optimal learning strategy in real time based on historical data, so that dynamic allocation of the budget takes place across different impressions according to immediate and future returns. Here, we will discuss formulating a bid-decision process in terms of a reinforcement learning framework published in Real-Time Bidding by Reinforcement Learning in Display Advertising by Cai et. al. 2017.

In this research by Cai et. al., the machine bidding in the context of display advertising is considered, where real-time bidding is a highly challenging task because, in the case...

Summary


In this chapter, we understood the basic concepts and challenges in the domain of advertising technology. We also learned about the relevant business models, such as CPC, CPM, and CPA, and real-time strategy bidding and why there's a need for an autonomous agent to automate the process. Moreover, we discussed a basic approach to converting the problem state of real-time bidding in online advertising into a reinforcement-learning framework. This is a totally new domain for reinforcement learning to disrupt. Many more exploratory works utilizing reinforcement learning for advertising technology, and their results, are yet to be published.

In the next chapter, we will study how reinforcement learning is being used in the field of computer vision, especially for object detection.

 

lock icon The rest of the chapter is locked
You have been reading a chapter from
Reinforcement Learning with TensorFlow
Published in: Apr 2018 Publisher: Packt ISBN-13: 9781788835725
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}