Diversity Evolutionary Policy Deep Reinforcement Learning

The reinforcement learning algorithms based on policy gradient may fall into local optimal due to gradient disappearance during the update process, which in turn affects the exploration ability of the reinforcement learning agent. In order to solve the above problem, in this paper, the cross-entropy...

Full description

Bibliographic Details
Main Authors: Jian Liu, Liming Feng
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2021/5300189