Efficient TD3 based path planning of mobile robot in dynamic environments using prioritized experience replay and LSTM

Abstract To address the challenges of sample utilization efficiency and managing temporal dependencies, this paper proposes an efficient path planning method for mobile robot in dynamic environments based on an improved twin delayed deep deterministic policy gradient (TD3) algorithm. The proposed me...

Full description

Bibliographic Details
Published in:Scientific Reports
Main Authors: Yunhan Lin, Zhijie Zhang, Yijian Tan, Hao Fu, Huasong Min
Format: Article
Language:English
Published: Nature Portfolio 2025-05-01
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-02244-z
Description
Summary:Abstract To address the challenges of sample utilization efficiency and managing temporal dependencies, this paper proposes an efficient path planning method for mobile robot in dynamic environments based on an improved twin delayed deep deterministic policy gradient (TD3) algorithm. The proposed method, named PL-TD3, integrates prioritized experience replay (PER) and long short-term memory (LSTM) neural networks, which enhance both sample efficiency and the ability to handle time-series data. To verify the effectiveness of the proposed method, simulation and practical experiments were designed and conducted. In the simulation experiments, both static and dynamic obstacles were included in the test environment, along with experiments to assess generalization capabilities. The algorithm demonstrated superior performance in terms of both execution time and path efficiency. The practical experiments, based on the assumptions from the simulation tests, further confirmed that PL-TD3 has improved the effectiveness and robustness of path planning for mobile robot in dynamic environments.
ISSN:2045-2322