Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification

Late research has established the critical environmental, health and social impacts of traffic in highly populated urban regions. Apart from traffic monitoring, textual analysis of geo-located social media responses can provide an intelligent means in detecting and classifying traffic related events...

Full description

Bibliographic Details
Main Authors: Kokkinos Konstantinos, Nathanail Eftihia
Format: Article
Language:English
Published: Sciendo 2020-12-01
Series:Transport and Telecommunication
Subjects:
Online Access:https://doi.org/10.2478/ttj-2020-0023
id doaj-5dce7e3c009d45fd85b0f6a0cf1da128
record_format Article
spelling doaj-5dce7e3c009d45fd85b0f6a0cf1da1282021-09-05T21:24:16ZengSciendoTransport and Telecommunication1407-61792020-12-0121428529410.2478/ttj-2020-0023ttj-2020-0023Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and ClassificationKokkinos Konstantinos0Nathanail Eftihia1Energy Systems Department, University of Thessaly, Larissa, GreeceCivil Engineering Department, University of Thessaly, Volos, GreeceLate research has established the critical environmental, health and social impacts of traffic in highly populated urban regions. Apart from traffic monitoring, textual analysis of geo-located social media responses can provide an intelligent means in detecting and classifying traffic related events. This paper deals with the content analysis of Twitter textual data using an ensemble of supervised and unsupervised Machine Learning methods in order to cluster and properly classify traffic related events. Voluminous textual data was gathered using innovative Twitter APIs and managed by Big Data cloud methodologies via an Apache Spark system. Events were detected using a traffic related typology and the clustering K-Means model, where related event classification was achieved applying Support Vector Machines (SVM), Convolutional Neural Networks (CNN) and Long Short Term Memory (LSTM) networks. We provide experimental results for 2-class and 3-class classification examples indicating that the ensemble performs with accuracy and F-score reaching 98.5%.https://doi.org/10.2478/ttj-2020-0023textualtrafficclusteringclassificationensembledeep-learning
collection DOAJ
language English
format Article
sources DOAJ
author Kokkinos Konstantinos
Nathanail Eftihia
spellingShingle Kokkinos Konstantinos
Nathanail Eftihia
Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification
Transport and Telecommunication
textual
traffic
clustering
classification
ensemble
deep-learning
author_facet Kokkinos Konstantinos
Nathanail Eftihia
author_sort Kokkinos Konstantinos
title Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification
title_short Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification
title_full Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification
title_fullStr Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification
title_full_unstemmed Exploring an Ensemble of Textual Machine Learning Methodologies for Traffic Event Detection and Classification
title_sort exploring an ensemble of textual machine learning methodologies for traffic event detection and classification
publisher Sciendo
series Transport and Telecommunication
issn 1407-6179
publishDate 2020-12-01
description Late research has established the critical environmental, health and social impacts of traffic in highly populated urban regions. Apart from traffic monitoring, textual analysis of geo-located social media responses can provide an intelligent means in detecting and classifying traffic related events. This paper deals with the content analysis of Twitter textual data using an ensemble of supervised and unsupervised Machine Learning methods in order to cluster and properly classify traffic related events. Voluminous textual data was gathered using innovative Twitter APIs and managed by Big Data cloud methodologies via an Apache Spark system. Events were detected using a traffic related typology and the clustering K-Means model, where related event classification was achieved applying Support Vector Machines (SVM), Convolutional Neural Networks (CNN) and Long Short Term Memory (LSTM) networks. We provide experimental results for 2-class and 3-class classification examples indicating that the ensemble performs with accuracy and F-score reaching 98.5%.
topic textual
traffic
clustering
classification
ensemble
deep-learning
url https://doi.org/10.2478/ttj-2020-0023
work_keys_str_mv AT kokkinoskonstantinos exploringanensembleoftextualmachinelearningmethodologiesfortrafficeventdetectionandclassification
AT nathanaileftihia exploringanensembleoftextualmachinelearningmethodologiesfortrafficeventdetectionandclassification
_version_ 1717780582927171584