Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19

Introduction: The COVID-19 epidemic is currently fronting the worldwide health care systems with many qualms and unexpected challenges in medical decision-making and the effective sharing of medical resources. Machine Learning (ML)-based prediction models can be potentially advantageous to overcome...

Full description

Bibliographic Details
Main Authors: Mostafa Shanbehzadeh, Azam Orooji, Hadi Kazemi-Arpanahi
Format: Article
Language:English
Published: Tehran University of Medical Sciences 2021-07-01
Series:Journal of Biostatistics and Epidemiology
Subjects:
Online Access:https://jbe.tums.ac.ir/index.php/jbe/article/view/504
id doaj-1a9c062375804a67a51a4d9b541dce2c
record_format Article
spelling doaj-1a9c062375804a67a51a4d9b541dce2c2021-09-11T05:30:39ZengTehran University of Medical SciencesJournal of Biostatistics and Epidemiology2383-41962383-420X2021-07-017210.18502/jbe.v7i2.6725Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19Mostafa Shanbehzadeh0Azam Orooji1Hadi Kazemi-Arpanahi2Department of Health Information Technology, School of Paramedical, Ilam University of Medical Sciences, Ilam, IranDepartment of Advanced Technologies, School of Medicine, North Khorasan University of Medical Science, North Khorasan, Iran.Department of Health Information Technology, Abadan University of Medical Sciences, Abadan, Iran, Student Research Committee, Abadan University of Medical Sciences, Abadan, Iran Introduction: The COVID-19 epidemic is currently fronting the worldwide health care systems with many qualms and unexpected challenges in medical decision-making and the effective sharing of medical resources. Machine Learning (ML)-based prediction models can be potentially advantageous to overcome these uncertainties. Objective: This study aims to train several ML algorithms to predict the COVID-19 in-hospital mortality and compare their performance to choose the best performing algorithm. Finally, the contributing factors scored using some feature selection methods.  Material and Methods: Using a single-center registry, we studied the records of 1353 confirmed COVID19 hospitalized patients from Ayatollah Taleghani hospital, Abadan city, Iran. We applied six feature scoring techniques and nine well-known ML algorithms. To evaluate the models’ performances, the metrics derived from the confusion matrix calculated.  Results: The study participants were 1353 patients, the male sex found to be higher than the women (742 vs. 611), and the median age was 57.25 (interquartile 18-100). After feature scoring, out of 54 variables, absolute neutrophil/lymphocyte count and loss of taste and smell were found the top three predictors. On the other hand, platelet count, magnesium, and headache gained the lowest importance for predicting the COVID-19 mortality. Experimental results indicated that the Bayesian network algorithm with an accuracy of 89.31% and a sensitivity of 64.2 % has been more successful in predicting mortality.  Conclusion: ML provides a reasonable level of accuracy in predicting. So, using the ML-based prediction models facilitate more responsive health systems and would be beneficial for timely identification of vulnerable patients to inform appropriate judgment by the health care providers. Abbreviation: Coronavirus Disease 2019 (COVID‐19), World Health Organization (WHO), Machine Learning (ML), Artificial Intelligence (AI), Multilayer Perceptron (MLP), Support Vector Machine (SVM), Locally Weighted Learning (LWL), Clinical Decision Support System (CDSS)  https://jbe.tums.ac.ir/index.php/jbe/article/view/504COVID‐19CoronavirusArtificial intelligenceMachine learningMortality
collection DOAJ
language English
format Article
sources DOAJ
author Mostafa Shanbehzadeh
Azam Orooji
Hadi Kazemi-Arpanahi
spellingShingle Mostafa Shanbehzadeh
Azam Orooji
Hadi Kazemi-Arpanahi
Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19
Journal of Biostatistics and Epidemiology
COVID‐19
Coronavirus
Artificial intelligence
Machine learning
Mortality
author_facet Mostafa Shanbehzadeh
Azam Orooji
Hadi Kazemi-Arpanahi
author_sort Mostafa Shanbehzadeh
title Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19
title_short Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19
title_full Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19
title_fullStr Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19
title_full_unstemmed Comparing of Data Mining Techniques for Predicting in-Hospital Mortality Among Patients with COVID-19
title_sort comparing of data mining techniques for predicting in-hospital mortality among patients with covid-19
publisher Tehran University of Medical Sciences
series Journal of Biostatistics and Epidemiology
issn 2383-4196
2383-420X
publishDate 2021-07-01
description Introduction: The COVID-19 epidemic is currently fronting the worldwide health care systems with many qualms and unexpected challenges in medical decision-making and the effective sharing of medical resources. Machine Learning (ML)-based prediction models can be potentially advantageous to overcome these uncertainties. Objective: This study aims to train several ML algorithms to predict the COVID-19 in-hospital mortality and compare their performance to choose the best performing algorithm. Finally, the contributing factors scored using some feature selection methods.  Material and Methods: Using a single-center registry, we studied the records of 1353 confirmed COVID19 hospitalized patients from Ayatollah Taleghani hospital, Abadan city, Iran. We applied six feature scoring techniques and nine well-known ML algorithms. To evaluate the models’ performances, the metrics derived from the confusion matrix calculated.  Results: The study participants were 1353 patients, the male sex found to be higher than the women (742 vs. 611), and the median age was 57.25 (interquartile 18-100). After feature scoring, out of 54 variables, absolute neutrophil/lymphocyte count and loss of taste and smell were found the top three predictors. On the other hand, platelet count, magnesium, and headache gained the lowest importance for predicting the COVID-19 mortality. Experimental results indicated that the Bayesian network algorithm with an accuracy of 89.31% and a sensitivity of 64.2 % has been more successful in predicting mortality.  Conclusion: ML provides a reasonable level of accuracy in predicting. So, using the ML-based prediction models facilitate more responsive health systems and would be beneficial for timely identification of vulnerable patients to inform appropriate judgment by the health care providers. Abbreviation: Coronavirus Disease 2019 (COVID‐19), World Health Organization (WHO), Machine Learning (ML), Artificial Intelligence (AI), Multilayer Perceptron (MLP), Support Vector Machine (SVM), Locally Weighted Learning (LWL), Clinical Decision Support System (CDSS) 
topic COVID‐19
Coronavirus
Artificial intelligence
Machine learning
Mortality
url https://jbe.tums.ac.ir/index.php/jbe/article/view/504
work_keys_str_mv AT mostafashanbehzadeh comparingofdataminingtechniquesforpredictinginhospitalmortalityamongpatientswithcovid19
AT azamorooji comparingofdataminingtechniquesforpredictinginhospitalmortalityamongpatientswithcovid19
AT hadikazemiarpanahi comparingofdataminingtechniquesforpredictinginhospitalmortalityamongpatientswithcovid19
_version_ 1717756922675855360