Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph

Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian tra...

Full description

Bibliographic Details
Main Authors:	Xiangyu Zou, Bin Sun, Duan Zhao, Zongwei Zhu, Jinjin Zhao, Yongxin He
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Trajectory prediction spatial-temporal graph generative adversarial network global node scaled dot product attention
Online Access:	https://ieeexplore.ieee.org/document/9082663/

id	doaj-08091d6560c442ddb6da922e1aea5dd3
record_format	Article
spelling	doaj-08091d6560c442ddb6da922e1aea5dd32021-03-30T01:46:11ZengIEEEIEEE Access2169-35362020-01-018833218333210.1109/ACCESS.2020.29914359082663Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal GraphXiangyu Zou0https://orcid.org/0000-0002-2993-9498Bin Sun1https://orcid.org/0000-0003-0652-3999Duan Zhao2https://orcid.org/0000-0002-9679-3943Zongwei Zhu3https://orcid.org/0000-0003-3607-2631Jinjin Zhao4https://orcid.org/0000-0002-7047-0775Yongxin He5School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSuzhou Institute for Advanced Study, University of Science and Technology of China, Suzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaEdge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction.https://ieeexplore.ieee.org/document/9082663/Trajectory predictionspatial-temporal graphgenerative adversarial networkglobal nodescaled dot product attention
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xiangyu Zou Bin Sun Duan Zhao Zongwei Zhu Jinjin Zhao Yongxin He
spellingShingle	Xiangyu Zou Bin Sun Duan Zhao Zongwei Zhu Jinjin Zhao Yongxin He Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph IEEE Access Trajectory prediction spatial-temporal graph generative adversarial network global node scaled dot product attention
author_facet	Xiangyu Zou Bin Sun Duan Zhao Zongwei Zhu Jinjin Zhao Yongxin He
author_sort	Xiangyu Zou
title	Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_short	Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_full	Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_fullStr	Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_full_unstemmed	Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_sort	multi-modal pedestrian trajectory prediction for edge agents based on spatial-temporal graph
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction.
topic	Trajectory prediction spatial-temporal graph generative adversarial network global node scaled dot product attention
url	https://ieeexplore.ieee.org/document/9082663/
work_keys_str_mv	AT xiangyuzou multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT binsun multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT duanzhao multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT zongweizhu multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT jinjinzhao multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph AT yongxinhe multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
_version_	1724186399331057664

Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph

Similar Items