UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

Deep deterministic policy gradient (DDPG) algorithm is a reinforcement learning method, which has been widely used in UAV path planning. However, the critic network of DDPG is frequently updated in the training process. It leads to an inevitable overestimation problem and increases the training comp...

Full description

Bibliographic Details
Main Authors:	Gu, F. (Author), Liu, H.-L (Author), Shi, H. (Author), Wu, R. (Author)
Format:	Article
Language:	English
Published:	Hindawi Limited 2022
Subjects:	Complex networks Critic network Deterministics Gradient algorithm Learning methods Motion planning Policy gradient Policy gradient methods Real environments Reinforcement learning Reinforcement learning method Training process UAV missions Unmanned aerial vehicles (UAV)
Online Access:	View Fulltext in Publisher

Internet

View Fulltext in Publisher

UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

Internet

Similar Items