Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits

We consider the nonstationary multiarmed bandit framework and propose a Kolmogorov-Smirnov (KS) test based Thompson sampling (TS) algorithm named TS-KS that actively detects change points and resets the TS parameters once a change is detected. In particular, for the two-armed bandit case, we derive...

Full description

Bibliographic Details
Main Authors: Ghatak, G. (Author), Mohanty, H. (Author), Rahman, A.U (Author)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2022
Subjects:
5G
Online Access:View Fulltext in Publisher