An Improved DDPG and Its Application Based on the Double-Layer BP Neural Network

This paper focused on three application problems of the traditional Deep Deterministic Policy Gradient(DDPG) algorithm. That is, the agent exploration is insufficient, the neural network performance is unsatisfied, the agent output fluctuates greatly. In terms of agent exploration strategy, network...

Full description

Bibliographic Details
Main Authors: Mingli Zhang, Yijie Zhang, Zhengjie Gao, Xiaolong He
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9181588/