Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison

BackgroundUric acid is associated with noncommunicable diseases such as cardiovascular diseases, chronic kidney disease, coronary artery disease, stroke, diabetes, metabolic syndrome, vascular dementia, and hypertension. Therefore, uric acid is considered to be a risk factor...

Full description

Bibliographic Details
Main Authors: Sampa, Masuda Begum, Hossain, Md Nazmul, Hoque, Md Rakibul, Islam, Rafiqul, Yokota, Fumihiko, Nishikitani, Mariko, Ahmed, Ashir
Format: Article
Language:English
Published: JMIR Publications 2020-10-01
Series:JMIR Medical Informatics
Online Access:https://medinform.jmir.org/2020/10/e18331
id doaj-8511e8ca44484f6bbfc55f0b46953cbc
record_format Article
spelling doaj-8511e8ca44484f6bbfc55f0b46953cbc2021-05-03T02:53:38ZengJMIR PublicationsJMIR Medical Informatics2291-96942020-10-01810e1833110.2196/18331Blood Uric Acid Prediction With Machine Learning: Model Development and Performance ComparisonSampa, Masuda BegumHossain, Md NazmulHoque, Md RakibulIslam, RafiqulYokota, FumihikoNishikitani, MarikoAhmed, Ashir BackgroundUric acid is associated with noncommunicable diseases such as cardiovascular diseases, chronic kidney disease, coronary artery disease, stroke, diabetes, metabolic syndrome, vascular dementia, and hypertension. Therefore, uric acid is considered to be a risk factor for the development of noncommunicable diseases. Most studies on uric acid have been performed in developed countries, and the application of machine-learning approaches in uric acid prediction in developing countries is rare. Different machine-learning algorithms will work differently on different types of data in various diseases; therefore, a different investigation is needed for different types of data to identify the most accurate algorithms. Specifically, no study has yet focused on the urban corporate population in Bangladesh, despite the high risk of developing noncommunicable diseases for this population. ObjectiveThe aim of this study was to develop a model for predicting blood uric acid values based on basic health checkup test results, dietary information, and sociodemographic characteristics using machine-learning algorithms. The prediction of health checkup test measurements can be very helpful to reduce health management costs. MethodsVarious machine-learning approaches were used in this study because clinical input data are not completely independent and exhibit complex interactions. Conventional statistical models have limitations to consider these complex interactions, whereas machine learning can consider all possible interactions among input data. We used boosted decision tree regression, decision forest regression, Bayesian linear regression, and linear regression to predict personalized blood uric acid based on basic health checkup test results, dietary information, and sociodemographic characteristics. We evaluated the performance of these five widely used machine-learning models using data collected from 271 employees in the Grameen Bank complex of Dhaka, Bangladesh. ResultsThe mean uric acid level was 6.63 mg/dL, indicating a borderline result for the majority of the sample (normal range <7.0 mg/dL). Therefore, these individuals should be monitoring their uric acid regularly. The boosted decision tree regression model showed the best performance among the models tested based on the root mean squared error of 0.03, which is also better than that of any previously reported model. ConclusionsA uric acid prediction model was developed based on personal characteristics, dietary information, and some basic health checkup measurements. This model will be useful for improving awareness among high-risk individuals and populations, which can help to save medical costs. A future study could include additional features (eg, work stress, daily physical activity, alcohol intake, eating red meat) in improving prediction.https://medinform.jmir.org/2020/10/e18331
collection DOAJ
language English
format Article
sources DOAJ
author Sampa, Masuda Begum
Hossain, Md Nazmul
Hoque, Md Rakibul
Islam, Rafiqul
Yokota, Fumihiko
Nishikitani, Mariko
Ahmed, Ashir
spellingShingle Sampa, Masuda Begum
Hossain, Md Nazmul
Hoque, Md Rakibul
Islam, Rafiqul
Yokota, Fumihiko
Nishikitani, Mariko
Ahmed, Ashir
Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
JMIR Medical Informatics
author_facet Sampa, Masuda Begum
Hossain, Md Nazmul
Hoque, Md Rakibul
Islam, Rafiqul
Yokota, Fumihiko
Nishikitani, Mariko
Ahmed, Ashir
author_sort Sampa, Masuda Begum
title Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
title_short Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
title_full Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
title_fullStr Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
title_full_unstemmed Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
title_sort blood uric acid prediction with machine learning: model development and performance comparison
publisher JMIR Publications
series JMIR Medical Informatics
issn 2291-9694
publishDate 2020-10-01
description BackgroundUric acid is associated with noncommunicable diseases such as cardiovascular diseases, chronic kidney disease, coronary artery disease, stroke, diabetes, metabolic syndrome, vascular dementia, and hypertension. Therefore, uric acid is considered to be a risk factor for the development of noncommunicable diseases. Most studies on uric acid have been performed in developed countries, and the application of machine-learning approaches in uric acid prediction in developing countries is rare. Different machine-learning algorithms will work differently on different types of data in various diseases; therefore, a different investigation is needed for different types of data to identify the most accurate algorithms. Specifically, no study has yet focused on the urban corporate population in Bangladesh, despite the high risk of developing noncommunicable diseases for this population. ObjectiveThe aim of this study was to develop a model for predicting blood uric acid values based on basic health checkup test results, dietary information, and sociodemographic characteristics using machine-learning algorithms. The prediction of health checkup test measurements can be very helpful to reduce health management costs. MethodsVarious machine-learning approaches were used in this study because clinical input data are not completely independent and exhibit complex interactions. Conventional statistical models have limitations to consider these complex interactions, whereas machine learning can consider all possible interactions among input data. We used boosted decision tree regression, decision forest regression, Bayesian linear regression, and linear regression to predict personalized blood uric acid based on basic health checkup test results, dietary information, and sociodemographic characteristics. We evaluated the performance of these five widely used machine-learning models using data collected from 271 employees in the Grameen Bank complex of Dhaka, Bangladesh. ResultsThe mean uric acid level was 6.63 mg/dL, indicating a borderline result for the majority of the sample (normal range <7.0 mg/dL). Therefore, these individuals should be monitoring their uric acid regularly. The boosted decision tree regression model showed the best performance among the models tested based on the root mean squared error of 0.03, which is also better than that of any previously reported model. ConclusionsA uric acid prediction model was developed based on personal characteristics, dietary information, and some basic health checkup measurements. This model will be useful for improving awareness among high-risk individuals and populations, which can help to save medical costs. A future study could include additional features (eg, work stress, daily physical activity, alcohol intake, eating red meat) in improving prediction.
url https://medinform.jmir.org/2020/10/e18331
work_keys_str_mv AT sampamasudabegum blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
AT hossainmdnazmul blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
AT hoquemdrakibul blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
AT islamrafiqul blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
AT yokotafumihiko blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
AT nishikitanimariko blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
AT ahmedashir blooduricacidpredictionwithmachinelearningmodeldevelopmentandperformancecomparison
_version_ 1721484991756500992