Approximate Dynamic Programming and Reinforcement Learning - Algorithms, Analysis and an Application

Problems involving optimal sequential making in uncertain dynamic systems arise in domains such as engineering, science and economics. Such problems can often be cast in the framework of Markov Decision Process (MDP). Solving an MDP requires computing the optimal value function and the optimal polic...

Full description

Bibliographic Details
Main Author: Lakshminarayanan, Chandrashekar
Other Authors: Bhatnagar, Shalabh
Language:en_US
Published: 2018
Subjects:
Online Access:http://etd.iisc.ernet.in/2005/3963
http://etd.iisc.ernet.in/abstracts/4850/G27265-Abs.pdf