Counterfactual-Based Action Evaluation Algorithm in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) algorithms have made great achievements in various scenarios, but there are still many problems in solving sequential social dilemmas (SSDs). In SSDs, the agent’s actions not only change the instantaneous state of the environment but also affect the latent s...

Full description

Bibliographic Details
Main Authors: Guo, T. (Author), Jiang, H. (Author), Yuan, Y. (Author), Zhao, P. (Author)
Format: Article
Language:English
Published: MDPI 2022
Subjects:
Online Access:View Fulltext in Publisher