Enhancing the Classification of Imbalanced Arabic Medical Questions Using DeepSMOTE

The growing demand for telemedicine has highlighted the need for automated healthcare services, particularly in medical question classification. This study presents a deep learning model designed to address key challenges in telemedicine, including class imbalance and accurate routing of Arabic medi...

詳細記述

書誌詳細
出版年:AI
主要な著者: Bushra Al-Smadi, Bassam Hammo, Hossam Faris, Pedro A. Castillo
フォーマット: 論文
言語:英語
出版事項: MDPI AG 2025-04-01
主題:
オンライン・アクセス:https://www.mdpi.com/2673-2688/6/4/77
その他の書誌記述
要約:The growing demand for telemedicine has highlighted the need for automated healthcare services, particularly in medical question classification. This study presents a deep learning model designed to address key challenges in telemedicine, including class imbalance and accurate routing of Arabic medical questions to the correct specialties. The model combines AraBERTv0.2-Twitter, fine-tuned for informal Arabic, with Bidirectional Long Short-Term Memory (BiLSTM) networks to capture deep semantic relationships in medical text. We used a labeled dataset of 5000 Arabic consultation records from Altibbi, covering five key medical specialties selected for their clinical relevance and frequency. The data underwent preprocessing to remove noise and normalize text. We employed stratified sampling to ensure representative distribution across the selected medical specialties. We evaluate multiple models using macro precision, macro recall, macro F1-score, weighted F1-score, and G-Mean. Our results demonstrate that DeepSMOTE combined with cross-entropy loss achieves the best performance. The findings offer statistically significant improvements and have practical implications for improving screening and patient routing in telemedicine platforms.
ISSN:2673-2688