Simulation Based Algorithms For Markov Decision Process And Stochastic Optimization
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution of infinite horizon Markov Decision Processes (MDPs) with finite state-space under the average cost criterion. On the slower timescale, all the algorithms perform a gradient search over corresponding...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/2005/812 |