Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian tra...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9082663/ |
id |
doaj-08091d6560c442ddb6da922e1aea5dd3 |
---|---|
record_format |
Article |
spelling |
doaj-08091d6560c442ddb6da922e1aea5dd32021-03-30T01:46:11ZengIEEEIEEE Access2169-35362020-01-018833218333210.1109/ACCESS.2020.29914359082663Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal GraphXiangyu Zou0https://orcid.org/0000-0002-2993-9498Bin Sun1https://orcid.org/0000-0003-0652-3999Duan Zhao2https://orcid.org/0000-0002-9679-3943Zongwei Zhu3https://orcid.org/0000-0003-3607-2631Jinjin Zhao4https://orcid.org/0000-0002-7047-0775Yongxin He5School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSuzhou Institute for Advanced Study, University of Science and Technology of China, Suzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaEdge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction.https://ieeexplore.ieee.org/document/9082663/Trajectory predictionspatial-temporal graphgenerative adversarial networkglobal nodescaled dot product attention |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiangyu Zou Bin Sun Duan Zhao Zongwei Zhu Jinjin Zhao Yongxin He |
spellingShingle |
Xiangyu Zou Bin Sun Duan Zhao Zongwei Zhu Jinjin Zhao Yongxin He Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph IEEE Access Trajectory prediction spatial-temporal graph generative adversarial network global node scaled dot product attention |
author_facet |
Xiangyu Zou Bin Sun Duan Zhao Zongwei Zhu Jinjin Zhao Yongxin He |
author_sort |
Xiangyu Zou |
title |
Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph |
title_short |
Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph |
title_full |
Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph |
title_fullStr |
Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph |
title_full_unstemmed |
Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph |
title_sort |
multi-modal pedestrian trajectory prediction for edge agents based on spatial-temporal graph |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction. |
topic |
Trajectory prediction spatial-temporal graph generative adversarial network global node scaled dot product attention |
url |
https://ieeexplore.ieee.org/document/9082663/ |
work_keys_str_mv |
AT xiangyuzou multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT binsun multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT duanzhao multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT zongweizhu multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT jinjinzhao multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT yongxinhe multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph |
_version_ |
1724186399331057664 |