Feature Adaptation Algorithms for Reinforcement Learning with Applications to Wireless Sensor Networks And Road Traffic Control
Many sequential decision making problems under uncertainty arising in engineering, science and economics are often modelled as Markov Decision Processes (MDPs). In the setting of MDPs, the goal is to and a state dependent optimal sequence of actions that minimizes a certain long-term performance cri...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
2017
|
Subjects: | |
Online Access: | http://etd.iisc.ernet.in/handle/2005/2664 http://etd.ncsi.iisc.ernet.in/abstracts/3481/G27183-Abs.pdf |