SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
Deciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic p...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8756249/ |
id |
doaj-f1d8fcb2893b419295782aae8e842a6e |
---|---|
record_format |
Article |
spelling |
doaj-f1d8fcb2893b419295782aae8e842a6e2021-03-29T23:32:35ZengIEEEIEEE Access2169-35362019-01-017925989261410.1109/ACCESS.2019.29274298756249SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease TreatmentsFranklin Parrales Bravo0https://orcid.org/0000-0002-6283-8197Alberto A. Del Barrio Garcia1https://orcid.org/0000-0002-6769-1200Ana Beatriz Gago Veiga2Maria Mercedes Gallego De La Sacristana3Marina Ruiz Pinero4Angel Guerrero Peral5Saso Dzeroski6Jose L. Ayala7Department of Computer Architecture and Automation, Complutense University of Madrid, Madrid, SpainDepartment of Computer Architecture and Automation, Complutense University of Madrid, Madrid, SpainNeurology Department, La Princesa University Hospital, Madrid, SpainNeurology Department, La Princesa University Hospital, Madrid, SpainHeadache Unit, Department of Neurology, Hospital Clínico Universitario de Valladolid, Valladolid, SpainHeadache Unit, Department of Neurology, Hospital Clínico Universitario de Valladolid, Valladolid, SpainDepartment of Knowledge Technologies, Jozef Stefan Institute, Ljubljana, SloveniaDepartment of Computer Architecture and Automation, Complutense University of Madrid, Madrid, SpainDeciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic prediction makes it possible to decide whether the treatment will be beneficial without using previous knowledge and without involving unnecessary treatments. Feedback prediction can be more accurate prediction since it considers the results of previous stages of the treatment. A novel label encoding called simulated annealing and rounding (SAR) encoding is also proposed to help improve the accuracy of prediction in both approaches. To unveil the medical factors that make the treatment effective for patients, various techniques are applied to the prediction models found through the proposed approaches. Finally, this methodology is applied in the realistic scenario of analyzing electronic medical records of migraineurs under BoNT-A treatment. The results show a significant improvement in accuracy due to the use of SAR encoding, from close to 60% (baseline) to 75% with panoramic prediction, and up to around 90% when using feedback prediction. Furthermore, the following factors have been found to be relevant when predicting the migraine treatment responses: migraine time evolution, unilateral pain, analgesic abuse, headache days, and the retroocular component. According to doctors, these factors are also medically relevant and in alignment with the medical literature.https://ieeexplore.ieee.org/document/8756249/Multi-target predictionclassification algorithmsdata miningsimulated annealing |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Franklin Parrales Bravo Alberto A. Del Barrio Garcia Ana Beatriz Gago Veiga Maria Mercedes Gallego De La Sacristana Marina Ruiz Pinero Angel Guerrero Peral Saso Dzeroski Jose L. Ayala |
spellingShingle |
Franklin Parrales Bravo Alberto A. Del Barrio Garcia Ana Beatriz Gago Veiga Maria Mercedes Gallego De La Sacristana Marina Ruiz Pinero Angel Guerrero Peral Saso Dzeroski Jose L. Ayala SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments IEEE Access Multi-target prediction classification algorithms data mining simulated annealing |
author_facet |
Franklin Parrales Bravo Alberto A. Del Barrio Garcia Ana Beatriz Gago Veiga Maria Mercedes Gallego De La Sacristana Marina Ruiz Pinero Angel Guerrero Peral Saso Dzeroski Jose L. Ayala |
author_sort |
Franklin Parrales Bravo |
title |
SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments |
title_short |
SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments |
title_full |
SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments |
title_fullStr |
SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments |
title_full_unstemmed |
SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments |
title_sort |
smurf: systematic methodology for unveiling relevant factors in retrospective data on chronic disease treatments |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Deciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic prediction makes it possible to decide whether the treatment will be beneficial without using previous knowledge and without involving unnecessary treatments. Feedback prediction can be more accurate prediction since it considers the results of previous stages of the treatment. A novel label encoding called simulated annealing and rounding (SAR) encoding is also proposed to help improve the accuracy of prediction in both approaches. To unveil the medical factors that make the treatment effective for patients, various techniques are applied to the prediction models found through the proposed approaches. Finally, this methodology is applied in the realistic scenario of analyzing electronic medical records of migraineurs under BoNT-A treatment. The results show a significant improvement in accuracy due to the use of SAR encoding, from close to 60% (baseline) to 75% with panoramic prediction, and up to around 90% when using feedback prediction. Furthermore, the following factors have been found to be relevant when predicting the migraine treatment responses: migraine time evolution, unilateral pain, analgesic abuse, headache days, and the retroocular component. According to doctors, these factors are also medically relevant and in alignment with the medical literature. |
topic |
Multi-target prediction classification algorithms data mining simulated annealing |
url |
https://ieeexplore.ieee.org/document/8756249/ |
work_keys_str_mv |
AT franklinparralesbravo smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT albertoadelbarriogarcia smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT anabeatrizgagoveiga smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT mariamercedesgallegodelasacristana smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT marinaruizpinero smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT angelguerreroperal smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT sasodzeroski smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments AT joselayala smurfsystematicmethodologyforunveilingrelevantfactorsinretrospectivedataonchronicdiseasetreatments |
_version_ |
1724189374888804352 |