rl_theory

We are currently following this book: RL Theory. The slides from the past meetings are available here:

2020/08/27: MDPs
2020/09/08: Value Iteration, Code for Value Iteration
2020/09/23: Policy Iteration
2020/10/06: Q* = TQ* Proof, Concentration Inequalities (Section 1.1, 1.2)
2020/10/20: Sample Complexity