Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies
The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To a...
| Published in: | IEEE Access |
|---|---|
| Main Authors: | , , |
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10966006/ |
| _version_ | 1849629783109926912 |
|---|---|
| author | Huda Aljalaoud Kia Dashtipour Ahmed Y. Al-Dubai |
| author_facet | Huda Aljalaoud Kia Dashtipour Ahmed Y. Al-Dubai |
| author_sort | Huda Aljalaoud |
| collection | DOAJ |
| container_title | IEEE Access |
| description | The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To address this gap, the current study provides a comprehensive, state-of-the-art review of datasets and methodologies specifically focused on Arabic cyberbullying detection. It systematically reviews different relevant studies from six academic databases, examining their methodologies, dataset characteristics, and performance in terms of classification accuracy and limitations. The paper critically evaluates existing Arabic cyberbullying datasets according to criteria such as dataset size, dialectal diversity, annotation processes, and accessibility. Additionally, this review identifies critical limitations, including dataset scarcity, dialectal imbalance, annotation subjectivity, and methodological constraints. By synthesizing current knowledge, identifying research gaps, and suggesting future directions, this review supports the development of more robust, effective, and linguistically inclusive analytical methods. Ultimately, this work contributes significantly to natural language processing research and advances the creation of safer online environments for Arabic-speaking users. |
| format | Article |
| id | doaj-art-e3714b4e63dc4bba95fe1ac0f7b974ce |
| institution | Directory of Open Access Journals |
| issn | 2169-3536 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| spelling | doaj-art-e3714b4e63dc4bba95fe1ac0f7b974ce2025-08-20T02:25:08ZengIEEEIEEE Access2169-35362025-01-0113690216903810.1109/ACCESS.2025.356113210966006Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and MethodologiesHuda Aljalaoud0https://orcid.org/0009-0007-8459-713XKia Dashtipour1https://orcid.org/0000-0002-9651-6487Ahmed Y. Al-Dubai2https://orcid.org/0000-0001-9758-5540School of Computing, Edinburgh Napier University, Merchiston Campus, Edinburgh, U.K.School of Computing, Edinburgh Napier University, Merchiston Campus, Edinburgh, U.K.School of Computing, Edinburgh Napier University, Merchiston Campus, Edinburgh, U.K.The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To address this gap, the current study provides a comprehensive, state-of-the-art review of datasets and methodologies specifically focused on Arabic cyberbullying detection. It systematically reviews different relevant studies from six academic databases, examining their methodologies, dataset characteristics, and performance in terms of classification accuracy and limitations. The paper critically evaluates existing Arabic cyberbullying datasets according to criteria such as dataset size, dialectal diversity, annotation processes, and accessibility. Additionally, this review identifies critical limitations, including dataset scarcity, dialectal imbalance, annotation subjectivity, and methodological constraints. By synthesizing current knowledge, identifying research gaps, and suggesting future directions, this review supports the development of more robust, effective, and linguistically inclusive analytical methods. Ultimately, this work contributes significantly to natural language processing research and advances the creation of safer online environments for Arabic-speaking users.https://ieeexplore.ieee.org/document/10966006/Arabic cyberbullying detectionArabic cyberbullying datasetdeep learningmachine learningtransformers-based |
| spellingShingle | Huda Aljalaoud Kia Dashtipour Ahmed Y. Al-Dubai Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies Arabic cyberbullying detection Arabic cyberbullying dataset deep learning machine learning transformers-based |
| title | Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies |
| title_full | Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies |
| title_fullStr | Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies |
| title_full_unstemmed | Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies |
| title_short | Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies |
| title_sort | arabic cyberbullying detection a comprehensive review of datasets and methodologies |
| topic | Arabic cyberbullying detection Arabic cyberbullying dataset deep learning machine learning transformers-based |
| url | https://ieeexplore.ieee.org/document/10966006/ |
| work_keys_str_mv | AT hudaaljalaoud arabiccyberbullyingdetectionacomprehensivereviewofdatasetsandmethodologies AT kiadashtipour arabiccyberbullyingdetectionacomprehensivereviewofdatasetsandmethodologies AT ahmedyaldubai arabiccyberbullyingdetectionacomprehensivereviewofdatasetsandmethodologies |
