Optimal learning: Computational procedures for Bayes -adaptive Markov decision processes

This dissertation considers a particular aspect of sequential decision making under uncertainty in which, at each stage, a decision-making agent operating in an uncertain world takes an action that elicits a reinforcement signal and causes the state of the world to change. Optimal learning is a patt...

Full description

Bibliographic Details
Main Author:	Duff, Michael O'Gordon
Language:	ENG
Published:	ScholarWorks@UMass Amherst 2002
Subjects:	Computer science\|Operations research\|Statistics\|Artificial intelligence
Online Access:	https://scholarworks.umass.edu/dissertations/AAI3039353

Internet

https://scholarworks.umass.edu/dissertations/AAI3039353

Optimal learning: Computational procedures for Bayes -adaptive Markov decision processes

Internet

Similar Items