Q-learning

Free

Added: 2015-12-28

3641

wojciechkorsak

Created with: MindManager

Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a gi...