Approximate Dynamic Programming and Reinforcement Learning - Algorithms, Analysis and an Application
Problems involving optimal sequential making in uncertain dynamic systems arise in domains such as engineering, science and economics. Such problems can often be cast in the framework of Markov Decision Process (MDP). Solving an MDP requires computing the optimal value function and the optimal polic...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
2018
|
Subjects: | |
Online Access: | http://etd.iisc.ernet.in/2005/3963 http://etd.iisc.ernet.in/abstracts/4850/G27265-Abs.pdf |