Q-learning

Added: 2015-12-28
2
48
2941
Created with: MindManager
Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a gi...
Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP). It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter.
Arts & Entertainment
Books & Writing
Career
Communication
Creativity & Innovation
Finance & Economics
Geography & Travel
Health & Home
History
Languages
Leadership & Management
Mathematics
Personal Development
Politics & Law
Productivity
Project Management
Sales & Marketing
Science & Technology
Teaching & Learning
Biggerplate logo

Go Further with Mind Mapping: Upgrade to Biggerplate Plus!

Software Courses
250+ Premium Videos
Live Virtual Events
Software Discounts
View Details

Upcoming Webinars:

Mind Mapping Module 1: Principles
Mind Mapping Module 1: Principles
In this live training webinar you will learn the core principles for effective mind mapping, whether…
Speaker: Liam Hughes
Date 07 May 2025
Mind Mapping Module 2: Complexity
Mind Mapping Module 2: Complexity
In this live training webinar you will learn how mind mapping can help you tackle complexity and wor…
Speaker: Liam Hughes
Date 14 May 2025
Copyright 2008 - 2025 Biggerplate.com Ltd. All rights reserved.