SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments

Deciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic p...

Full description

Bibliographic Details
Main Authors: Franklin Parrales Bravo, Alberto A. Del Barrio Garcia, Ana Beatriz Gago Veiga, Maria Mercedes Gallego De La Sacristana, Marina Ruiz Pinero, Angel Guerrero Peral, Saso Dzeroski, Jose L. Ayala
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8756249/
id doaj-f1d8fcb2893b419295782aae8e842a6e
record_format Article
spelling doaj-f1d8fcb2893b419295782aae8e842a6e2021-03-29T23:32:35ZengIEEEIEEE Access2169-35362019-01-017925989261410.1109/ACCESS.2019.29274298756249SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease TreatmentsFranklin Parrales Bravo0https://orcid.org/0000-0002-6283-8197Alberto A. Del Barrio Garcia1https://orcid.org/0000-0002-6769-1200Ana Beatriz Gago Veiga2Maria Mercedes Gallego De La Sacristana3Marina Ruiz Pinero4Angel Guerrero Peral5Saso Dzeroski6Jose L. Ayala7Department of Computer Architecture and Automation, Complutense University of Madrid, Madrid, SpainDepartment of Computer Architecture and Automation, Complutense University of Madrid, Madrid, SpainNeurology Department, La Princesa University Hospital, Madrid, SpainNeurology Department, La Princesa University Hospital, Madrid, SpainHeadache Unit, Department of Neurology, Hospital Clínico Universitario de Valladolid, Valladolid, SpainHeadache Unit, Department of Neurology, Hospital Clínico Universitario de Valladolid, Valladolid, SpainDepartment of Knowledge Technologies, Jozef Stefan Institute, Ljubljana, SloveniaDepartment of Computer Architecture and Automation, Complutense University of Madrid, Madrid, SpainDeciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic prediction makes it possible to decide whether the treatment will be beneficial without using previous knowledge and without involving unnecessary treatments. Feedback prediction can be more accurate prediction since it considers the results of previous stages of the treatment. A novel label encoding called simulated annealing and rounding (SAR) encoding is also proposed to help improve the accuracy of prediction in both approaches. To unveil the medical factors that make the treatment effective for patients, various techniques are applied to the prediction models found through the proposed approaches. Finally, this methodology is applied in the realistic scenario of analyzing electronic medical records of migraineurs under BoNT-A treatment. The results show a significant improvement in accuracy due to the use of SAR encoding, from close to 60% (baseline) to 75% with panoramic prediction, and up to around 90% when using feedback prediction. Furthermore, the following factors have been found to be relevant when predicting the migraine treatment responses: migraine time evolution, unilateral pain, analgesic abuse, headache days, and the retroocular component. According to doctors, these factors are also medically relevant and in alignment with the medical literature.https://ieeexplore.ieee.org/document/8756249/Multi-target predictionclassification algorithmsdata miningsimulated annealing
collection DOAJ
language English
format Article
sources DOAJ
author Franklin Parrales Bravo
Alberto A. Del Barrio Garcia
Ana Beatriz Gago Veiga
Maria Mercedes Gallego De La Sacristana
Marina Ruiz Pinero
Angel Guerrero Peral
Saso Dzeroski
Jose L. Ayala
spellingShingle Franklin Parrales Bravo
Alberto A. Del Barrio Garcia
Ana Beatriz Gago Veiga
Maria Mercedes Gallego De La Sacristana
Marina Ruiz Pinero
Angel Guerrero Peral
Saso Dzeroski
Jose L. Ayala
SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
IEEE Access
Multi-target prediction
classification algorithms
data mining
simulated annealing
author_facet Franklin Parrales Bravo
Alberto A. Del Barrio Garcia
Ana Beatriz Gago Veiga
Maria Mercedes Gallego De La Sacristana
Marina Ruiz Pinero
Angel Guerrero Peral
Saso Dzeroski
Jose L. Ayala
author_sort Franklin Parrales Bravo
title SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
title_short SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
title_full SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
title_fullStr SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
title_full_unstemmed SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
title_sort smurf: systematic methodology for unveiling relevant factors in retrospective data on chronic disease treatments
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Deciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic prediction makes it possible to decide whether the treatment will be beneficial without using previous knowledge and without involving unnecessary treatments. Feedback prediction can be more accurate prediction since it considers the results of previous stages of the treatment. A novel label encoding called simulated annealing and rounding (SAR) encoding is also proposed to help improve the accuracy of prediction in both approaches. To unveil the medical factors that make the treatment effective for patients, various techniques are applied to the prediction models found through the proposed approaches. Finally, this methodology is applied in the realistic scenario of analyzing electronic medical records of migraineurs under BoNT-A treatment. The results show a significant improvement in accuracy due to the use of SAR encoding, from close to 60% (baseline) to 75% with panoramic prediction, and up to around 90% when using feedback prediction. Furthermore, the following factors have been found to be relevant when predicting the migraine treatment responses: migraine time evolution, unilateral pain, analgesic abuse, headache days, and the retroocular component. According to doctors, these factors are also medically relevant and in alignment with the medical literature.
topic Multi-target prediction
classification algorithms
data mining
simulated annealing
url https://ieeexplore.ieee.org/document/8756249/
work_keys_str_mv AT franklinparralesbravo smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT albertoadelbarriogarcia smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT anabeatrizgagoveiga smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT mariamercedesgallegodelasacristana smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT marinaruizpinero smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT angelguerreroperal smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT sasodzeroski smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
AT joselayala smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments
_version_ 1724189374888804352