References
Reinforcing Learning
Preface
1
Multi-armed Bandits
2
Markov Decision Processes
3
Dynamic Programming
References
References
3
Dynamic Programming