On Reinforcement Learning for Turn-based Zero-sum Markov Games

© 2020 Owner/Author. We consider the problem of finding Nash equilibrium for two-player turn-based zero-sum games. Inspired by the AlphaGo Zero (AGZ) algorithm, we develop a Reinforcement Learning based approach. Specifically, we propose Explore-Improve-Supervise (EIS) method that combines "exp...

Full description

Bibliographic Details
Main Authors: Shah, D (Author), Somani, V (Author), Xie, Q (Author), Xu, Z (Author)
Format: Article
Language:English
Published: ACM, 2021-11-02T17:41:39Z.
Subjects:
Online Access:Get fulltext