In the preceding chapter, we familiarized ourselves with a novel area in machine learning (ML): the realm of reinforcement learning. We saw how reinforcement learning algorithms can be augmented using neural networks, and how we can learn approximate functions that can map game states to possible actions the agent may take. These actions are then compared to a moving target variable, which in turn was defined by what we called the Bellman equation. This, strictly speaking, is a self-supervised ML technique, as it is the Bellman equation that's used to compare our predictions, and not a set of labeled target variables, as would be the case for a supervised learning approach (for example, game screens labeled with optimal actions to take at each state). The latter, while possible, proves to be much more computationally intensive for the given use case. Now we will...
 United States
            United States
             Great Britain
            Great Britain
             India
            India
             Germany
            Germany
             France
            France
             Canada
            Canada
             Russia
            Russia
             Spain
            Spain
             Brazil
            Brazil
             Australia
            Australia
             Singapore
            Singapore
             Canary Islands
            Canary Islands
             Hungary
            Hungary
             Ukraine
            Ukraine
             Luxembourg
            Luxembourg
             Estonia
            Estonia
             Lithuania
            Lithuania
             South Korea
            South Korea
             Turkey
            Turkey
             Switzerland
            Switzerland
             Colombia
            Colombia
             Taiwan
            Taiwan
             Chile
            Chile
             Norway
            Norway
             Ecuador
            Ecuador
             Indonesia
            Indonesia
             New Zealand
            New Zealand
             Cyprus
            Cyprus
             Denmark
            Denmark
             Finland
            Finland
             Poland
            Poland
             Malta
            Malta
             Czechia
            Czechia
             Austria
            Austria
             Sweden
            Sweden
             Italy
            Italy
             Egypt
            Egypt
             Belgium
            Belgium
             Portugal
            Portugal
             Slovenia
            Slovenia
             Ireland
            Ireland
             Romania
            Romania
             Greece
            Greece
             Argentina
            Argentina
             Netherlands
            Netherlands
             Bulgaria
            Bulgaria
             Latvia
            Latvia
             South Africa
            South Africa
             Malaysia
            Malaysia
             Japan
            Japan
             Slovakia
            Slovakia
             Philippines
            Philippines
             Mexico
            Mexico
             Thailand
            Thailand
             
     
         
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                