Artificial neural networks/reinforcement learning
Aperçu des semaines
-
Reinforcement Learning for Bandit problems
-
Markov Decision Processes and Convergence of SARSA
-
Variants of TD-learning methods and eligibility traces
-
TD-learning and function approximation
-
Policy-gradient methods
-
-
Model-Free Deep Reinforcement Learning
-
Model-Based Deep Reinforcement Learning