Links to some lecture videos
| 1 | Modeling Sequential Decision Making | ||
| 2 | Planning Problem and Dynamic Programming | Video | |
| 3 | Model-free policy evaluation | Video | |
| 4 | Model-free control | Video | |
| 5 | Stochastic Approximation theory | ||
| 6 | Function Approximation | ||
| 7 | Gmae Theory: multiple Agents | Video | |
| 8 | Policy Gradient Methods | Video | |
| 9 | Information theory and Natural Policy gradient | Video | |
| 10 | KL divergence and TRPO | Video | |
| 11 | Importance sampling PPO | Video | |
| 12 | Exploration vs Exploitation 1 | Video | |
| 13 | Exploration vs Exploitation 2 | ||
| 14 | Model-based algorithms and Sample Complexity |
