Regret Minimization in Structured Reinforcement Learning

We consider a class of sequential decision making problems in the presence of uncertainty, which belongs to the field of Reinforcement Learning (RL). Specifically, we study discrete Markov decision Processes (MDPs) which model a decision maker or agent that interacts with a stochastic and dynamic en...

Full description

Bibliographic Details
Main Author: Tranos, Damianos
Format: Others
Language:English
Published: KTH, Reglerteknik 2021
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-296238
http://nbn-resolving.de/urn:isbn:978-91-7873-839-7