Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph

Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian tra...

Full description

Bibliographic Details
Main Authors: Xiangyu Zou, Bin Sun, Duan Zhao, Zongwei Zhu, Jinjin Zhao, Yongxin He
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9082663/
id doaj-08091d6560c442ddb6da922e1aea5dd3
record_format Article
spelling doaj-08091d6560c442ddb6da922e1aea5dd32021-03-30T01:46:11ZengIEEEIEEE Access2169-35362020-01-018833218333210.1109/ACCESS.2020.29914359082663Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal GraphXiangyu Zou0https://orcid.org/0000-0002-2993-9498Bin Sun1https://orcid.org/0000-0003-0652-3999Duan Zhao2https://orcid.org/0000-0002-9679-3943Zongwei Zhu3https://orcid.org/0000-0003-3607-2631Jinjin Zhao4https://orcid.org/0000-0002-7047-0775Yongxin He5School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSuzhou Institute for Advanced Study, University of Science and Technology of China, Suzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaSchool of Information and Control Engineering, China University of Mining and Technology, Xuzhou, ChinaEdge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction.https://ieeexplore.ieee.org/document/9082663/Trajectory predictionspatial-temporal graphgenerative adversarial networkglobal nodescaled dot product attention
collection DOAJ
language English
format Article
sources DOAJ
author Xiangyu Zou
Bin Sun
Duan Zhao
Zongwei Zhu
Jinjin Zhao
Yongxin He
spellingShingle Xiangyu Zou
Bin Sun
Duan Zhao
Zongwei Zhu
Jinjin Zhao
Yongxin He
Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
IEEE Access
Trajectory prediction
spatial-temporal graph
generative adversarial network
global node
scaled dot product attention
author_facet Xiangyu Zou
Bin Sun
Duan Zhao
Zongwei Zhu
Jinjin Zhao
Yongxin He
author_sort Xiangyu Zou
title Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_short Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_full Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_fullStr Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_full_unstemmed Multi-Modal Pedestrian Trajectory Prediction for Edge Agents Based on Spatial-Temporal Graph
title_sort multi-modal pedestrian trajectory prediction for edge agents based on spatial-temporal graph
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Edge agents, represented by socially-aware robots and autonomous vehicles, have gradually been integrated into human society. The safety navigation system in interactive scenes is of great importance to them. The key of this system is that the edge agent has the ability to predict the pedestrian trajectory in the dynamic scene, so as to avoid collision. However, predicting pedestrian trajectories in dynamic scenes is not an easy task, because it is necessary to comprehensively consider the spatial-temporal structure of human-environment interaction, visual attention, and the multi-modal behavior of human walking. In this paper, a scalable spatial-temporal graph generation adversarial network architecture (STG-GAN) is introduced, which can comprehensively consider the influence of human-environment interaction and generate a reasonable multi-modal prediction trajectory. First, we use LSTM nodes to flexibly transform the spatial-temporal graph of human-environment interactions into feed-forward differentiable feature coding, and innovatively propose the global node to integrate scene context information. Then, we capture the relative importance of global interactions on pedestrian trajectories through scaled dot product attention, and use recurrent sequence modeling and generative adversarial network architecture for common training, so as to generate reasonable pedestrian future trajectory distributions based on rich mixed features. Experiments on public data sets show that STG-GAN is superior to previous work in terms of accuracy, reasoning speed and rationality of trajectory prediction.
topic Trajectory prediction
spatial-temporal graph
generative adversarial network
global node
scaled dot product attention
url https://ieeexplore.ieee.org/document/9082663/
work_keys_str_mv AT xiangyuzou multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
AT binsun multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
AT duanzhao multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
AT zongweizhu multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
AT jinjinzhao multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
AT yongxinhe multimodalpedestriantrajectorypredictionforedgeagentsbasedonspatialtemporalgraph
_version_ 1724186399331057664