UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

Deep deterministic policy gradient (DDPG) algorithm is a reinforcement learning method, which has been widely used in UAV path planning. However, the critic network of DDPG is frequently updated in the training process. It leads to an inevitable overestimation problem and increases the training comp...

Full description

Bibliographic Details
Main Authors: Gu, F. (Author), Liu, H.-L (Author), Shi, H. (Author), Wu, R. (Author)
Format: Article
Language:English
Published: Hindawi Limited 2022
Subjects:
Online Access:View Fulltext in Publisher