Q-learning

Free
Added: 2015-12-28
2
48
3556
Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a gi...
Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter.
Biggerplate logo

Improve Your Skills: Upgrade to Biggerplate Plus!

Software Courses
250+ Premium Videos
Live Virtual Events
Software Discounts
View Details

Upcoming Webinars:

Module 1: Principles & Process
Module 1: Principles & Process
In this live training webinar you will learn core principles for effective mind mapping, whether cre…
Speaker: Liam Hughes
Date 07 July 2026
Module 2: Complexity & Creativity
Module 2: Complexity & Creativity
In this live training webinar you will learn how mind mapping can help you tackle complexity and sup…
Speaker: Liam Hughes
Date 14 July 2026