Supervised Reinforcement Learning via Value Function

Using expert samples to improve the performance of reinforcement learning (RL) algorithms has become one of the focuses of research nowadays. However, in different application scenarios, it is hard to guarantee both the quantity and quality of expert samples, which prohibits the practical applicatio...

Full description

Bibliographic Details
Main Authors: Yaozong Pan, Jian Zhang, Chunhui Yuan, Haitao Yang
Format: Article
Language:English
Published: MDPI AG 2019-04-01
Series:Symmetry
Subjects:
DQN
Online Access:https://www.mdpi.com/2073-8994/11/4/590