Feature Adaptation Algorithms for Reinforcement Learning with Applications to Wireless Sensor Networks And Road Traffic Control

Many sequential decision making problems under uncertainty arising in engineering, science and economics are often modelled as Markov Decision Processes (MDPs). In the setting of MDPs, the goal is to and a state dependent optimal sequence of actions that minimizes a certain long-term performance cri...

Full description

Bibliographic Details
Main Author: Prabuchandran, K J
Other Authors: Bhatnagar, Shalabh
Language:en_US
Published: 2017
Subjects:
Online Access:http://etd.iisc.ernet.in/handle/2005/2664
http://etd.ncsi.iisc.ernet.in/abstracts/3481/G27183-Abs.pdf