Simulation Based Algorithms For Markov Decision Process And Stochastic Optimization

In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution of infinite horizon Markov Decision Processes (MDPs) with finite state-space under the average cost criterion. On the slower timescale, all the algorithms perform a gradient search over corresponding...

Full description

Bibliographic Details
Main Author: Abdulla, Mohammed Shahid
Other Authors: Bhatnagar, Shalabh
Language:en_US
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/2005/812