Week: 20 March - 26 March | Theory and Methods for Reinforcement Learning | Moodle

Section outline

- Select activity Lecture 5
  
  Lecture 5 File
  
  Policy gradient methods II: NPG, Sample Based NPG, TRPO, exploration in policy gradients
- Select activity Dynamic Programming Notebook
  
  Dynamic Programming Notebook File
  
  Exercises on Value Iteration, Policy Iteration, Modified Policy Iteration and Q Learning

Contact
EPFL CH-1015 Lausanne
+41 21 693 11 11

Follow the pulses of EPFL on social networks

© 2023 EPFL, all rights reserved