Reinforcement Learning 4: Dynamic programming
Slides:
Colab:
Twitter:
Next video:
Introduction
- definition
- examples
- planning in an MDP
Policy evaluation
- definition
- synchronous algorithm
Policy iteration
- policy improvement
- definition
- modified policy iteration
Value iteration
- definition
- summary and extensions
#reinforcementlearning #dynamicprogramming #MDPs #policyevaluation #policyiteration #valueiteration #planning