Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico

Heart diseases are highly ranked among the leading causes of mortality in the world. They have various types including vascular, ischemic, and hypertensive heart disease. A large number of medical features are reported for patients in the Electronic Health Records (EHR) that allow physicians to diag...

Full description

Bibliographic Details
Main Authors: Asma Baccouche, Begonya Garcia-Zapirain, Cristian Castillo Olea, Adel Elmaghraby
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/11/4/207
id doaj-e651badb5fe045918083613607f043cb
record_format Article
spelling doaj-e651badb5fe045918083613607f043cb2020-11-25T03:10:56ZengMDPI AGInformation2078-24892020-04-011120720710.3390/info11040207Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from MexicoAsma Baccouche0Begonya Garcia-Zapirain1Cristian Castillo Olea2Adel Elmaghraby3Department of Computer Science and Engineering, University of Louisville, Louisville, KY 40292, USAeVida Research Group, University of Deusto, 48007 Bilbao, SpaineVida Research Group, University of Deusto, 48007 Bilbao, SpainDepartment of Computer Science and Engineering, University of Louisville, Louisville, KY 40292, USAHeart diseases are highly ranked among the leading causes of mortality in the world. They have various types including vascular, ischemic, and hypertensive heart disease. A large number of medical features are reported for patients in the Electronic Health Records (EHR) that allow physicians to diagnose and monitor heart disease. We collected a dataset from <i>Medica Norte Hospital</i> in Mexico that includes 800 records and 141 indicators such as age, weight, glucose, blood pressure rate, and clinical symptoms. Distribution of the collected records is very unbalanced on the different types of heart disease, where 17% of records have hypertensive heart disease, 16% of records have ischemic heart disease, 7% of records have mixed heart disease, and 8% of records have valvular heart disease. Herein, we propose an ensemble-learning framework of different neural network models, and a method of aggregating random under-sampling. To improve the performance of the classification algorithms, we implement a data preprocessing step with features selection. Experiments were conducted with unidirectional and bidirectional neural network models and results showed that an ensemble classifier with a BiLSTM or BiGRU model with a CNN model had the best classification performance with accuracy and F1-score between 91% and 96% for the different types of heart disease. These results are competitive and promising for heart disease dataset. We showed that ensemble-learning framework based on deep models could overcome the problem of classifying an unbalanced heart disease dataset. Our proposed framework can lead to highly accurate models that are adapted for clinical real data and diagnosis use.https://www.mdpi.com/2078-2489/11/4/207heart disease classificationneural networkensemble-learning modelunder-samplingfeatures selectiondeep learning
collection DOAJ
language English
format Article
sources DOAJ
author Asma Baccouche
Begonya Garcia-Zapirain
Cristian Castillo Olea
Adel Elmaghraby
spellingShingle Asma Baccouche
Begonya Garcia-Zapirain
Cristian Castillo Olea
Adel Elmaghraby
Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico
Information
heart disease classification
neural network
ensemble-learning model
under-sampling
features selection
deep learning
author_facet Asma Baccouche
Begonya Garcia-Zapirain
Cristian Castillo Olea
Adel Elmaghraby
author_sort Asma Baccouche
title Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico
title_short Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico
title_full Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico
title_fullStr Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico
title_full_unstemmed Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico
title_sort ensemble deep learning models for heart disease classification: a case study from mexico
publisher MDPI AG
series Information
issn 2078-2489
publishDate 2020-04-01
description Heart diseases are highly ranked among the leading causes of mortality in the world. They have various types including vascular, ischemic, and hypertensive heart disease. A large number of medical features are reported for patients in the Electronic Health Records (EHR) that allow physicians to diagnose and monitor heart disease. We collected a dataset from <i>Medica Norte Hospital</i> in Mexico that includes 800 records and 141 indicators such as age, weight, glucose, blood pressure rate, and clinical symptoms. Distribution of the collected records is very unbalanced on the different types of heart disease, where 17% of records have hypertensive heart disease, 16% of records have ischemic heart disease, 7% of records have mixed heart disease, and 8% of records have valvular heart disease. Herein, we propose an ensemble-learning framework of different neural network models, and a method of aggregating random under-sampling. To improve the performance of the classification algorithms, we implement a data preprocessing step with features selection. Experiments were conducted with unidirectional and bidirectional neural network models and results showed that an ensemble classifier with a BiLSTM or BiGRU model with a CNN model had the best classification performance with accuracy and F1-score between 91% and 96% for the different types of heart disease. These results are competitive and promising for heart disease dataset. We showed that ensemble-learning framework based on deep models could overcome the problem of classifying an unbalanced heart disease dataset. Our proposed framework can lead to highly accurate models that are adapted for clinical real data and diagnosis use.
topic heart disease classification
neural network
ensemble-learning model
under-sampling
features selection
deep learning
url https://www.mdpi.com/2078-2489/11/4/207
work_keys_str_mv AT asmabaccouche ensembledeeplearningmodelsforheartdiseaseclassificationacasestudyfrommexico
AT begonyagarciazapirain ensembledeeplearningmodelsforheartdiseaseclassificationacasestudyfrommexico
AT cristiancastilloolea ensembledeeplearningmodelsforheartdiseaseclassificationacasestudyfrommexico
AT adelelmaghraby ensembledeeplearningmodelsforheartdiseaseclassificationacasestudyfrommexico
_version_ 1724656239184445440