Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies

The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To a...

Full description

Bibliographic Details
Published in:IEEE Access
Main Authors: Huda Aljalaoud, Kia Dashtipour, Ahmed Y. Al-Dubai
Format: Article
Language:English
Published: IEEE 2025-01-01
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10966006/
_version_ 1849629783109926912
author Huda Aljalaoud
Kia Dashtipour
Ahmed Y. Al-Dubai
author_facet Huda Aljalaoud
Kia Dashtipour
Ahmed Y. Al-Dubai
author_sort Huda Aljalaoud
collection DOAJ
container_title IEEE Access
description The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To address this gap, the current study provides a comprehensive, state-of-the-art review of datasets and methodologies specifically focused on Arabic cyberbullying detection. It systematically reviews different relevant studies from six academic databases, examining their methodologies, dataset characteristics, and performance in terms of classification accuracy and limitations. The paper critically evaluates existing Arabic cyberbullying datasets according to criteria such as dataset size, dialectal diversity, annotation processes, and accessibility. Additionally, this review identifies critical limitations, including dataset scarcity, dialectal imbalance, annotation subjectivity, and methodological constraints. By synthesizing current knowledge, identifying research gaps, and suggesting future directions, this review supports the development of more robust, effective, and linguistically inclusive analytical methods. Ultimately, this work contributes significantly to natural language processing research and advances the creation of safer online environments for Arabic-speaking users.
format Article
id doaj-art-e3714b4e63dc4bba95fe1ac0f7b974ce
institution Directory of Open Access Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
spelling doaj-art-e3714b4e63dc4bba95fe1ac0f7b974ce2025-08-20T02:25:08ZengIEEEIEEE Access2169-35362025-01-0113690216903810.1109/ACCESS.2025.356113210966006Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and MethodologiesHuda Aljalaoud0https://orcid.org/0009-0007-8459-713XKia Dashtipour1https://orcid.org/0000-0002-9651-6487Ahmed Y. Al-Dubai2https://orcid.org/0000-0001-9758-5540School of Computing, Edinburgh Napier University, Merchiston Campus, Edinburgh, U.K.School of Computing, Edinburgh Napier University, Merchiston Campus, Edinburgh, U.K.School of Computing, Edinburgh Napier University, Merchiston Campus, Edinburgh, U.K.The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To address this gap, the current study provides a comprehensive, state-of-the-art review of datasets and methodologies specifically focused on Arabic cyberbullying detection. It systematically reviews different relevant studies from six academic databases, examining their methodologies, dataset characteristics, and performance in terms of classification accuracy and limitations. The paper critically evaluates existing Arabic cyberbullying datasets according to criteria such as dataset size, dialectal diversity, annotation processes, and accessibility. Additionally, this review identifies critical limitations, including dataset scarcity, dialectal imbalance, annotation subjectivity, and methodological constraints. By synthesizing current knowledge, identifying research gaps, and suggesting future directions, this review supports the development of more robust, effective, and linguistically inclusive analytical methods. Ultimately, this work contributes significantly to natural language processing research and advances the creation of safer online environments for Arabic-speaking users.https://ieeexplore.ieee.org/document/10966006/Arabic cyberbullying detectionArabic cyberbullying datasetdeep learningmachine learningtransformers-based
spellingShingle Huda Aljalaoud
Kia Dashtipour
Ahmed Y. Al-Dubai
Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
Arabic cyberbullying detection
Arabic cyberbullying dataset
deep learning
machine learning
transformers-based
title Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
title_full Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
title_fullStr Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
title_full_unstemmed Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
title_short Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
title_sort arabic cyberbullying detection a comprehensive review of datasets and methodologies
topic Arabic cyberbullying detection
Arabic cyberbullying dataset
deep learning
machine learning
transformers-based
url https://ieeexplore.ieee.org/document/10966006/
work_keys_str_mv AT hudaaljalaoud arabiccyberbullyingdetectionacomprehensivereviewofdatasetsandmethodologies
AT kiadashtipour arabiccyberbullyingdetectionacomprehensivereviewofdatasetsandmethodologies
AT ahmedyaldubai arabiccyberbullyingdetectionacomprehensivereviewofdatasetsandmethodologies