Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost

Traffic prediction is highly significant for intelligent traffic systems and traffic management. eXtreme Gradient Boosting (XGBoost), a scalable tree lifting algorithm, is proposed and improved to predict more high-resolution traffic state by utilizing origin-destination (OD) relationship of segment...

Full description

Bibliographic Details
Main Authors: Bo Sun, Tuo Sun, Pengpeng Jiao
Format: Article
Language:English
Published: Hindawi-Wiley 2021-01-01
Series:Journal of Advanced Transportation
Online Access:http://dx.doi.org/10.1155/2021/5559562
id doaj-132f72b526214b1f968a50069908de2d
record_format Article
spelling doaj-132f72b526214b1f968a50069908de2d2021-06-14T00:16:43ZengHindawi-WileyJournal of Advanced Transportation2042-31952021-01-01202110.1155/2021/5559562Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoostBo Sun0Tuo Sun1Pengpeng Jiao2Beijing Key Laboratory of General Aviation TechnologyBeijing Key Laboratory of General Aviation TechnologyBeijing Key Laboratory of General Aviation TechnologyTraffic prediction is highly significant for intelligent traffic systems and traffic management. eXtreme Gradient Boosting (XGBoost), a scalable tree lifting algorithm, is proposed and improved to predict more high-resolution traffic state by utilizing origin-destination (OD) relationship of segment flow data between upstream and downstream on the highway. In order to achieve fine prediction, a generalized extended-segment data acquirement mode is added by incorporating information of Automatic Number Plate Recognition System (ANPRS) from exits and entrances of toll stations and acquired by mathematical OD calculation indirectly without cameras. Abnormal data preprocessing and spatio-temporal relationship matching are conducted to ensure the effectiveness of prediction. Pearson analysis of spatial correlation is performed to find the relevance between adjacent roads, and the relative importance of input modes can be verified by spatial lag input and ordinary input. Two improved models, independent XGBoost (XGBoost-I) with individual adjustment parameters of different sections and static XGBoost (XGBoost-S) with overall adjustment of parameters, are conducted and combined with temporal relevant intervals and spatial staggered sectional lag. The early_stopping_rounds adjustment mechanism (EAM) is introduced to improve the effect of the XGBoost model. The prediction accuracy of XGBoost-I-lag is generally higher than XGBoost-I, XGBoost-S-lag, XGBoost-S, and other baseline methods for short-term and long-term multistep ahead. Additionally, the accuracy of the XGBoost-I-lag is evaluated well in nonrecurrent conditions and missing cases with considerable running time. The experiment results indicate that the proposed framework is convincing, satisfactory, and computationally reasonable.http://dx.doi.org/10.1155/2021/5559562
collection DOAJ
language English
format Article
sources DOAJ
author Bo Sun
Tuo Sun
Pengpeng Jiao
spellingShingle Bo Sun
Tuo Sun
Pengpeng Jiao
Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost
Journal of Advanced Transportation
author_facet Bo Sun
Tuo Sun
Pengpeng Jiao
author_sort Bo Sun
title Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost
title_short Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost
title_full Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost
title_fullStr Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost
title_full_unstemmed Spatio-Temporal Segmented Traffic Flow Prediction with ANPRS Data Based on Improved XGBoost
title_sort spatio-temporal segmented traffic flow prediction with anprs data based on improved xgboost
publisher Hindawi-Wiley
series Journal of Advanced Transportation
issn 2042-3195
publishDate 2021-01-01
description Traffic prediction is highly significant for intelligent traffic systems and traffic management. eXtreme Gradient Boosting (XGBoost), a scalable tree lifting algorithm, is proposed and improved to predict more high-resolution traffic state by utilizing origin-destination (OD) relationship of segment flow data between upstream and downstream on the highway. In order to achieve fine prediction, a generalized extended-segment data acquirement mode is added by incorporating information of Automatic Number Plate Recognition System (ANPRS) from exits and entrances of toll stations and acquired by mathematical OD calculation indirectly without cameras. Abnormal data preprocessing and spatio-temporal relationship matching are conducted to ensure the effectiveness of prediction. Pearson analysis of spatial correlation is performed to find the relevance between adjacent roads, and the relative importance of input modes can be verified by spatial lag input and ordinary input. Two improved models, independent XGBoost (XGBoost-I) with individual adjustment parameters of different sections and static XGBoost (XGBoost-S) with overall adjustment of parameters, are conducted and combined with temporal relevant intervals and spatial staggered sectional lag. The early_stopping_rounds adjustment mechanism (EAM) is introduced to improve the effect of the XGBoost model. The prediction accuracy of XGBoost-I-lag is generally higher than XGBoost-I, XGBoost-S-lag, XGBoost-S, and other baseline methods for short-term and long-term multistep ahead. Additionally, the accuracy of the XGBoost-I-lag is evaluated well in nonrecurrent conditions and missing cases with considerable running time. The experiment results indicate that the proposed framework is convincing, satisfactory, and computationally reasonable.
url http://dx.doi.org/10.1155/2021/5559562
work_keys_str_mv AT bosun spatiotemporalsegmentedtrafficflowpredictionwithanprsdatabasedonimprovedxgboost
AT tuosun spatiotemporalsegmentedtrafficflowpredictionwithanprsdatabasedonimprovedxgboost
AT pengpengjiao spatiotemporalsegmentedtrafficflowpredictionwithanprsdatabasedonimprovedxgboost
_version_ 1721378880585990144