Predicting Stroke Risk With an Interpretable Classifier

Predicting an individual's risk of getting a stroke has been a research subject for many authors worldwide since it is a frequent illness and there is strong evidence that early awareness of having that risk can be beneficial for prevention and treatment. Many Governments have been collecting m...

Full description

Bibliographic Details
Main Authors: Sergio Penafiel, Nelson Baloian, Horacio Sanson, Jose A. Pino
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9306826/
id doaj-76ee79c608674afdacd2e8f6d7c8e86c
record_format Article
spelling doaj-76ee79c608674afdacd2e8f6d7c8e86c2021-03-30T14:56:36ZengIEEEIEEE Access2169-35362021-01-0191154116610.1109/ACCESS.2020.30471959306826Predicting Stroke Risk With an Interpretable ClassifierSergio Penafiel0https://orcid.org/0000-0002-0025-7805Nelson Baloian1Horacio Sanson2Jose A. Pino3https://orcid.org/0000-0002-5835-988XDepartment of Computer Science, Universidad de Chile, Santiago, ChileDepartment of Computer Science, Universidad de Chile, Santiago, ChileAllm Inc., Tokyo, JapanDepartment of Computer Science, Universidad de Chile, Santiago, ChilePredicting an individual's risk of getting a stroke has been a research subject for many authors worldwide since it is a frequent illness and there is strong evidence that early awareness of having that risk can be beneficial for prevention and treatment. Many Governments have been collecting medical data about their own population with the purpose of using artificial intelligence methods for making those predictions. The most accurate ones are based on so called black-box methods which give little or no information about why they make a certain prediction. However, in the medical field the explanations are sometimes more important than the accuracy since they allow specialists to gain insight about the factors that influence the risk level. It is also frequent to find medical information records with some missing data. In this work, we present the development of a prediction method which not only outperforms some other existing ones but it also gives information about the most probable causes of a high stroke risk and can deal with incomplete data records. It is based on the Dempster-Shafer theory of plausibility. For the testing we used data provided by the regional hospital in Okayama, Japan, a country in which people are compelled to undergo annual health checkups by law. This article presents experiments comparing the results of the Dempster-Shafer method with the ones obtained using other well-known machine learning methods like Multilayer perceptron, Support Vector Machines and Naive Bayes. Our approach performed the best in these experiments with some missing data. It also presents an analysis of the interpretation of rules produced by the method for doing the classification. The rules were validated by both medical literature and human specialists.https://ieeexplore.ieee.org/document/9306826/Dempster-Shafer theorystrokeexpert systemsinterpretable classification
collection DOAJ
language English
format Article
sources DOAJ
author Sergio Penafiel
Nelson Baloian
Horacio Sanson
Jose A. Pino
spellingShingle Sergio Penafiel
Nelson Baloian
Horacio Sanson
Jose A. Pino
Predicting Stroke Risk With an Interpretable Classifier
IEEE Access
Dempster-Shafer theory
stroke
expert systems
interpretable classification
author_facet Sergio Penafiel
Nelson Baloian
Horacio Sanson
Jose A. Pino
author_sort Sergio Penafiel
title Predicting Stroke Risk With an Interpretable Classifier
title_short Predicting Stroke Risk With an Interpretable Classifier
title_full Predicting Stroke Risk With an Interpretable Classifier
title_fullStr Predicting Stroke Risk With an Interpretable Classifier
title_full_unstemmed Predicting Stroke Risk With an Interpretable Classifier
title_sort predicting stroke risk with an interpretable classifier
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Predicting an individual's risk of getting a stroke has been a research subject for many authors worldwide since it is a frequent illness and there is strong evidence that early awareness of having that risk can be beneficial for prevention and treatment. Many Governments have been collecting medical data about their own population with the purpose of using artificial intelligence methods for making those predictions. The most accurate ones are based on so called black-box methods which give little or no information about why they make a certain prediction. However, in the medical field the explanations are sometimes more important than the accuracy since they allow specialists to gain insight about the factors that influence the risk level. It is also frequent to find medical information records with some missing data. In this work, we present the development of a prediction method which not only outperforms some other existing ones but it also gives information about the most probable causes of a high stroke risk and can deal with incomplete data records. It is based on the Dempster-Shafer theory of plausibility. For the testing we used data provided by the regional hospital in Okayama, Japan, a country in which people are compelled to undergo annual health checkups by law. This article presents experiments comparing the results of the Dempster-Shafer method with the ones obtained using other well-known machine learning methods like Multilayer perceptron, Support Vector Machines and Naive Bayes. Our approach performed the best in these experiments with some missing data. It also presents an analysis of the interpretation of rules produced by the method for doing the classification. The rules were validated by both medical literature and human specialists.
topic Dempster-Shafer theory
stroke
expert systems
interpretable classification
url https://ieeexplore.ieee.org/document/9306826/
work_keys_str_mv AT sergiopenafiel predictingstrokeriskwithaninterpretableclassifier
AT nelsonbaloian predictingstrokeriskwithaninterpretableclassifier
AT horaciosanson predictingstrokeriskwithaninterpretableclassifier
AT joseapino predictingstrokeriskwithaninterpretableclassifier
_version_ 1724180348397420544