Structured Stochastic Bandits

In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm, and the learning to rank problem, as viewed from a MAB perspective. For the former, we deri...

Full description

Bibliographic Details
Main Author:	Magureanu, Stefan
Format:	Others
Language:	English
Published:	KTH, Reglerteknik 2016
Subjects:	Multi-armed bandits Learning to rank reinforcement learning Lipschitz Bandits
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-182816 http://nbn-resolving.de/urn:isbn:978-91-7595-880-4

Internet

http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-182816
http://nbn-resolving.de/urn:isbn:978-91-7595-880-4

Structured Stochastic Bandits

Internet

Similar Items