Predicting the Volume of Response to Tweets Posted by a Single Twitter Account

Social media users, including organizations, often struggle to acquire the maximum number of responses from other users, but predicting the responses that a post will receive before publication is highly desirable. Previous studies have analyzed why a given tweet may become more popular than others,...

Full description

Bibliographic Details
Main Authors: Krzysztof Fiok, Waldemar Karwowski, Edgar Gutierrez, Tareq Ahram
Format: Article
Language:English
Published: MDPI AG 2020-06-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/6/1054
id doaj-223a46245a3648d584f52f4fb9b784ed
record_format Article
spelling doaj-223a46245a3648d584f52f4fb9b784ed2020-11-25T03:14:12ZengMDPI AGSymmetry2073-89942020-06-01121054105410.3390/sym12061054Predicting the Volume of Response to Tweets Posted by a Single Twitter AccountKrzysztof Fiok0Waldemar Karwowski1Edgar Gutierrez2Tareq Ahram3Department of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USADepartment of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USADepartment of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USADepartment of Industrial Engineering and Management Systems, University of Central Florida, Orlando, FL 32816, USASocial media users, including organizations, often struggle to acquire the maximum number of responses from other users, but predicting the responses that a post will receive before publication is highly desirable. Previous studies have analyzed why a given tweet may become more popular than others, and have used a variety of models trained to predict the response that a given tweet will receive. The present research addresses the prediction of response measures available on Twitter, including likes, replies and retweets. Data from a single publisher, the official US Navy Twitter account, were used to develop a feature-based model derived from structured tweet-related data. Most importantly, a deep learning feature extraction approach for analyzing unstructured tweet text was applied. A classification task with three classes, representing low, moderate and high responses to tweets, was defined and addressed using four machine learning classifiers. All proposed models were symmetrically trained in a fivefold cross-validation regime using various feature configurations, which allowed for the methodically sound comparison of prediction approaches. The best models achieved F1 scores of 0.655. Our study also used SHapley Additive exPlanations (SHAP) to demonstrate limitations in the research on explainable AI methods involving Deep Learning Language Modeling in NLP. We conclude that model performance can be significantly improved by leveraging additional information from the images and links included in tweets.https://www.mdpi.com/2073-8994/12/6/1054natural language processingdeep learningpredictionmachine learningtwitterexplainability
collection DOAJ
language English
format Article
sources DOAJ
author Krzysztof Fiok
Waldemar Karwowski
Edgar Gutierrez
Tareq Ahram
spellingShingle Krzysztof Fiok
Waldemar Karwowski
Edgar Gutierrez
Tareq Ahram
Predicting the Volume of Response to Tweets Posted by a Single Twitter Account
Symmetry
natural language processing
deep learning
prediction
machine learning
twitter
explainability
author_facet Krzysztof Fiok
Waldemar Karwowski
Edgar Gutierrez
Tareq Ahram
author_sort Krzysztof Fiok
title Predicting the Volume of Response to Tweets Posted by a Single Twitter Account
title_short Predicting the Volume of Response to Tweets Posted by a Single Twitter Account
title_full Predicting the Volume of Response to Tweets Posted by a Single Twitter Account
title_fullStr Predicting the Volume of Response to Tweets Posted by a Single Twitter Account
title_full_unstemmed Predicting the Volume of Response to Tweets Posted by a Single Twitter Account
title_sort predicting the volume of response to tweets posted by a single twitter account
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-06-01
description Social media users, including organizations, often struggle to acquire the maximum number of responses from other users, but predicting the responses that a post will receive before publication is highly desirable. Previous studies have analyzed why a given tweet may become more popular than others, and have used a variety of models trained to predict the response that a given tweet will receive. The present research addresses the prediction of response measures available on Twitter, including likes, replies and retweets. Data from a single publisher, the official US Navy Twitter account, were used to develop a feature-based model derived from structured tweet-related data. Most importantly, a deep learning feature extraction approach for analyzing unstructured tweet text was applied. A classification task with three classes, representing low, moderate and high responses to tweets, was defined and addressed using four machine learning classifiers. All proposed models were symmetrically trained in a fivefold cross-validation regime using various feature configurations, which allowed for the methodically sound comparison of prediction approaches. The best models achieved F1 scores of 0.655. Our study also used SHapley Additive exPlanations (SHAP) to demonstrate limitations in the research on explainable AI methods involving Deep Learning Language Modeling in NLP. We conclude that model performance can be significantly improved by leveraging additional information from the images and links included in tweets.
topic natural language processing
deep learning
prediction
machine learning
twitter
explainability
url https://www.mdpi.com/2073-8994/12/6/1054
work_keys_str_mv AT krzysztoffiok predictingthevolumeofresponsetotweetspostedbyasingletwitteraccount
AT waldemarkarwowski predictingthevolumeofresponsetotweetspostedbyasingletwitteraccount
AT edgargutierrez predictingthevolumeofresponsetotweetspostedbyasingletwitteraccount
AT tareqahram predictingthevolumeofresponsetotweetspostedbyasingletwitteraccount
_version_ 1724643959127408640