Empowering Diagnostics: An Ensemble Machine Learning Model for Early Liver Disease Detection

Early and accurate detection of liver disease is critical to improving patient outcomes yet remains challenging due to class imbalance and noisy clinical data. In this study, we present a robust ensemble learning framework applied to the Indian Liver Patient Dataset, incorporating systematic data c...

詳細記述

書誌詳細
出版年:Al-Iraqia Journal for Scientific Engineering Research
主要な著者: Abdulrahman Ahmed Jasim, Hajer Alwindawi, Layth Rafea Hazim
フォーマット: 論文
言語:英語
出版事項: Al-Iraqia University - College of Engineering 2025-06-01
主題:
オンライン・アクセス:https://ijser.aliraqia.edu.iq/index.php/ijser/article/view/314
その他の書誌記述
要約:Early and accurate detection of liver disease is critical to improving patient outcomes yet remains challenging due to class imbalance and noisy clinical data. In this study, we present a robust ensemble learning framework applied to the Indian Liver Patient Dataset, incorporating systematic data cleaning, normalization, and Synthetic Minority Over‑Sampling (SMOTE) to address missing values, outliers, and class skew. We then perform correlation-based feature reduction before training a stacking classifier that combines Random Forest, XGBoost, and ExtraTrees base learners with an ExtraTrees meta‑learner. Using stratified 10‑fold cross‑validation on the balanced cohort (n = 792), our ensemble achieves 91.6 % accuracy, 92 % F1‑score, and a high area under the ROC curve, outperforming individual models and prior published approaches. These results demonstrate the potential of heterogeneous ensembles for clinical decision support in hepatology and lay the groundwork for prospective validation in diverse patient populations.
ISSN:2710-2165