Reinforcement learning with constraint based on mirror descent algorithm

An important issue in reinforcement learning is to make the agent avoid the dangers and risks during the task such as physical collisions. We propose the reinforcement learning algorithm based on the CoMirror algorithm, named CoMDS, for the problem that has a functional constraint. Besides, we modif...

Full description

Bibliographic Details
Main Authors: Megumi Miyashita, Toshiyuki Kondo, Shiro Yano
Format: Article
Language:English
Published: Elsevier 2021-09-01
Series:Results in Control and Optimization
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S266672072100028X