分享给好友:
A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning - Foundations and Trends (R) in Machine Learning Alborz Geramifard
A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning - Foundations and Trends (R) in Machine Learning
Alborz Geramifard
A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms.
92 pages
| 介质类型 | 图书 Paperback Book (平装胶订图书) |
| 已发行 | 2013年12月19日 |
| ISBN13 | 9781601987600 |
| 出版商 | now publishers Inc |
| 页数 | 92 |
| 商品尺寸 | 156 × 234 × 5 mm · 140 g |
| 语言 | 英语 |