Decision-making models on perceptual uncertainty with distributional reinforcement learning

Decision-making for autonomous vehicles in the presence of obstacle occlusions is difficult because the lack of accurate information affects the judgment. Existing methods may lead to overly conservative strategies and time-consuming computations that cannot be balanced with efficiency. We propose t...

全面介紹

書目詳細資料
發表在:Green Energy and Intelligent Transportation
Main Authors: Shuyuan Xu, Qiao Liu, Yuhui Hu, Mengtian Xu, Jiachen Hao
格式: Article
語言:英语
出版: Elsevier 2023-04-01
主題:
在線閱讀:http://www.sciencedirect.com/science/article/pii/S2773153722000627
_version_ 1852672789593456640
author Shuyuan Xu
Qiao Liu
Yuhui Hu
Mengtian Xu
Jiachen Hao
author_facet Shuyuan Xu
Qiao Liu
Yuhui Hu
Mengtian Xu
Jiachen Hao
author_sort Shuyuan Xu
collection DOAJ
container_title Green Energy and Intelligent Transportation
description Decision-making for autonomous vehicles in the presence of obstacle occlusions is difficult because the lack of accurate information affects the judgment. Existing methods may lead to overly conservative strategies and time-consuming computations that cannot be balanced with efficiency. We propose to use distributional reinforcement learning to hedge the risk of strategies, optimize the worse cases, and improve the efficiency of the algorithm so that the agent learns better actions. A batch of smaller values is used to replace the average value to optimize the worse case, and combined with frame stacking, we call it Efficient-Fully parameterized Quantile Function (E-FQF). This model is used to evaluate signal-free intersection crossing scenarios and makes more efficient moves and reduces the collision rate compared to conventional reinforcement learning algorithms in the presence of perceived occlusion. The model also has robustness in the case of data loss compared to the method with embedded long and short term memory.
format Article
id doaj-art-57ff9c488bee4ffca80ca3f30c089335
institution Directory of Open Access Journals
issn 2773-1537
language English
publishDate 2023-04-01
publisher Elsevier
record_format Article
spelling doaj-art-57ff9c488bee4ffca80ca3f30c0893352025-08-19T21:32:25ZengElsevierGreen Energy and Intelligent Transportation2773-15372023-04-012210006210.1016/j.geits.2022.100062Decision-making models on perceptual uncertainty with distributional reinforcement learningShuyuan Xu0Qiao Liu1Yuhui Hu2Mengtian Xu3Jiachen Hao4School of Mechanical Engineering, Beijing Institute of Technology, 100081 Beijing, ChinaSchool of Mechanical Engineering, Beijing Institute of Technology, 100081 Beijing, ChinaSchool of Mechanical Engineering, Beijing Institute of Technology, 100081 Beijing, China; Corresponding author.School of Mechanical Engineering, Beijing Institute of Technology, 100081 Beijing, China; Beili Huidong (Changshu) Vehicle Technology Company, 215513 Suzhou, ChinaSchool of Mechanical Engineering, Beijing Institute of Technology, 100081 Beijing, ChinaDecision-making for autonomous vehicles in the presence of obstacle occlusions is difficult because the lack of accurate information affects the judgment. Existing methods may lead to overly conservative strategies and time-consuming computations that cannot be balanced with efficiency. We propose to use distributional reinforcement learning to hedge the risk of strategies, optimize the worse cases, and improve the efficiency of the algorithm so that the agent learns better actions. A batch of smaller values is used to replace the average value to optimize the worse case, and combined with frame stacking, we call it Efficient-Fully parameterized Quantile Function (E-FQF). This model is used to evaluate signal-free intersection crossing scenarios and makes more efficient moves and reduces the collision rate compared to conventional reinforcement learning algorithms in the presence of perceived occlusion. The model also has robustness in the case of data loss compared to the method with embedded long and short term memory.http://www.sciencedirect.com/science/article/pii/S2773153722000627Autonomous vehiclesReinforcement learningSensing occlusionPartially observable markov decision processUnsiganlized intersections
spellingShingle Shuyuan Xu
Qiao Liu
Yuhui Hu
Mengtian Xu
Jiachen Hao
Decision-making models on perceptual uncertainty with distributional reinforcement learning
Autonomous vehicles
Reinforcement learning
Sensing occlusion
Partially observable markov decision process
Unsiganlized intersections
title Decision-making models on perceptual uncertainty with distributional reinforcement learning
title_full Decision-making models on perceptual uncertainty with distributional reinforcement learning
title_fullStr Decision-making models on perceptual uncertainty with distributional reinforcement learning
title_full_unstemmed Decision-making models on perceptual uncertainty with distributional reinforcement learning
title_short Decision-making models on perceptual uncertainty with distributional reinforcement learning
title_sort decision making models on perceptual uncertainty with distributional reinforcement learning
topic Autonomous vehicles
Reinforcement learning
Sensing occlusion
Partially observable markov decision process
Unsiganlized intersections
url http://www.sciencedirect.com/science/article/pii/S2773153722000627
work_keys_str_mv AT shuyuanxu decisionmakingmodelsonperceptualuncertaintywithdistributionalreinforcementlearning
AT qiaoliu decisionmakingmodelsonperceptualuncertaintywithdistributionalreinforcementlearning
AT yuhuihu decisionmakingmodelsonperceptualuncertaintywithdistributionalreinforcementlearning
AT mengtianxu decisionmakingmodelsonperceptualuncertaintywithdistributionalreinforcementlearning
AT jiachenhao decisionmakingmodelsonperceptualuncertaintywithdistributionalreinforcementlearning